only for scrapper expert
only for java or PHP expert
we need to scrap wikipedia and get a listing in a flat TXT file CSV
scrapper must process all pages listed here :
[login to view URL]:Paintings_by_painter
to get all paintings of wikimedia
then for each paintings we need to process this fields (some must have cleaning and formating)
URL: [login to view URL]:Anders_Zorn-Porträ[login to view URL]
Artist : Anders Zorn
Title : Portrait of Lisen Lamm
Date : 1885
Medium : watercolor
Dimensions : 77 × 65.5 cm (30.3 × 25.8 in)
Current location : Unknown
largest download image : [login to view URL]
largest download image SIZE : 1,752 × 2,496 pixels.
URL : [login to view URL]:Mona_Lisa,_by_Leonardo_da_Vinci,[login to view URL]
Artist : Leonardo da Vinci
Title : La Joconde
Date : 1505
Medium : oil on poplar wood
Dimensions : 76.8 × 53 cm
Current location : Louvre Museum
largest download image : [login to view URL]
largest download image SIZE : 7,854 × 11,498 pixels
It must be a java or PHP software
text must be unicode
no ascii < 32 in any fields
columns separator = TAB
columns values should be enclosed in quote
PHP/ Java code must be very clean
All variable name must be a minimum of 4 words
Must have 25 lines of relevant comments minimum
Hi, I am a Python/C# developer...I can create this scraper for you in Python or C#...Here are some of the projects that I have done...
https://www.freelancer.pk/projects/Web-Scraping/Scrape-sports-betting-odds.html
https://www.freelancer.pk/projects/php/Web-Scrapping-8590940.html
Hi
I am a professional web scrapper, I will do this using PHP and I may show you a demo if you agree.
Please let me know if you want a demo
Thanks in advance