The website this time is: [url removed, login to view]
Develop a script to obtain product information from a website, the information we're looking for is: price (numeric format, no symbols, no commas, etc), name, description, category (electronics, tv screens, cars, books, pets, videogames, etc), model, features, rating and image url.
The script must comply the following rules:
a) It must extract the information of every product showcased in the webpage.
b) None of the products have to be repeated in the database.
c) In order to extract every product the script needs to crawl many pages, thus the script needs to be programmed so it will not cycle.
The language in which we require the script is Python, using scrapy and pymongo libraries. If necessary we may provide an example of a script already functional.
This is not the only script we are interested in, so if the script runs smoothly and we are pleased with the experience we will offer new developments.