Data collection (Surf the web and collect the data)
$30-250 USD
In Progress
Posted over 14 years ago
$30-250 USD
Paid on delivery
I have easy but time consuming project to offer. I need quality data on websites from one niche. I will tell to winner of the bid what niche we are talking about. The data has to be collected per country.
The job is to surf the web and collect the following data in a csv file (utf-8 format)
url;Company Name;Zip;City;State;Country;Continent;street or box;phone no.;url to website with contact info;keyword;url for keyword
exemple:
[login to view URL];Nils Oscar Company AB;611 31;Nyköping;Västra Götaland;Sweden;Europe;Fruängsgatan 2;+46(0)15577280;[login to view URL];company;[login to view URL]
Each country has to be saved in different file.
Websites can be found by searching the web on keywords and then visiting the website to gather the information.
Main URL (website adress) has to be without slash at the end and appear just once, can't be used twice for different countries.
States for the different countries can be found on Wikipedia and the names have to be identical for the same stats. Not “NY” and “New York” for other record. Only full names like “New York” in example case.
There have to be a whole address not only a city or country. One missing variable like zip, street or phone number is ok say in 1/20 records. But country, state, city have to always be stated.
If the street address is on different lines the data has to have comma "," between the lines like this: ;Time street 25, Suiet A;
The phone country code has to be in the following format: +32(0) like in +46(0)15577280. No spaces between numbers. No letters in phone number.
The second URL (web address) is the URL to the website where the address can be found. Information that you gather has to be only on one site. I have a script that will check that the information gathered is correct.
The site has therefore to be readable and have the same domain name as the main (first) URL. The website can't be for instance made in flash because you can’t read the code and copy from the website.
Keyword is a qualifier and defines the niche I want to gather information about. It can be different for different languages. You can use Google translate to find out the translation name. What the keywords are I will tell to the winning bidder.
The last URL has to be from the same domain as the main url and is pointing to the website where the keyword was found.
You may not find listings in all countries but probably most of them. You can gather and check about 150 records in one day (I have done 3000 myself). You have to deliver at least 1000 listings every second week to keep up the momentum and speed if you want to continue.
In average I would like to get 120 listings for each country. But for the large countries there will be more records and the small less (the extreme cases from 30 to 1000 I guess).
It's important to save in correct format utf-8 not to mess up encoding of the letters. I can give you more details on free software to help you and other details.
The payment will be escrow for each 1000 records which is equal to about 7-8 days work, after quality check.
The amount of wrong records can't be more than 30 defects per set (1000 record). One defect is one wrong variable. Example wrong state, wrong phone format, etc
If there is more defects then 30 I will return the record, but you will have to find the defects for you self. Quality is important and if not followed it can break the agreement.
There is a lot of work even if it's little bit boring. I guess in to the next year.
You have to ask if there is anything else.
I'm interested in information from following countries
(The smallest countries can be skipped, less than 1 million people):
Brazil
Japan
China
Hong Kong
Singapore
South Korea
Taiwan
Russia
Afghanistan
Albania
Algeria
American Samoa
Andorra
Angola
Anguilla
Antiguaand Barbuda
Argentina
Armenia
Aruba
Azerbaijan
Bahamas
Bahrain
Bangladesh
Barbados
Belarus
Belize
Benin
Bermuda
Bhutan
Bolivia
Bosniaand Herzegovina
Botswana
Bouvet Island
British Indian Ocean Territory
Brunei Darussalam
Bulgaria
Burkina Faso
Burundi
Cambodia
Cameroon
Cape Verde
Cayman Islands
Central African Republic
Chad
Chile
Christmas Island
Cocos (Keeling) Islands
Colombia
Comoros
Congo
Congo,Democratic Republic
Cook Islands
Costa Rica
Coted, Ivoire
Croatia
Cyprus
Czech Republic
Djibouti
Dominica
Dominican Republic
East Timor
Ecuador
Egypt
El Salvador
Equatorial Guinea
Eritrea
Estonia
Ethiopia
Falkland Islands (Malvinas)
Faroe Islands
Fiji
French Guiana
French Polynesia
French Southern Territories
Gabon
Gambia
Georgia
Ghana
Gibraltar
Greece
Greenland
Grenada
Guadeloupe
Guam
Guatemala
Guinea
Guinea-Bissau
Guyana
Haiti
HeardandMcDonaldIslands
Honduras
HongKong
Hungary
Iceland
Indonesia
Iraq
Ireland
Israel
Jamaica
Jordan
Kazakhstan
Kenya
Kiribati
Kuwait
Kyrgyzstan
Lao People's Democratic Republic
Latvia
Lebanon
Lesotho
Liberia
Libya
Liechtenstein
Lithuania
Luxembourg
Macau
Macedonia
Madagascar
Malawi
Malaysia
Maldives
Mali
Malta
Marshall Islands
Martinique
Mauritania
Mauritius
Mayotte
Mexico
Micronesia
Moldova
Monaco
Mongolia
Montserrat
Morocco
Mozambique
Myanmar
Namibia
Nauru
Nepal
Netherlands Antilles
New Caledonia
New Zealand
Nicaragua
Niger
Nigeria
Niue
Norfolk Island
Northern Mariana Islands
Oman
Pakistan
Palau
Palestinian Territory
Panama
Papua New Guinea
Paraguay
Peru
Philippines
Pitcairn
Poland
Puerto Rico
Qatar
Reunion
Romania
Rwanda
Saint Kittsand Nevis
Saint Lucia
Saint Vincent and the Grenadines
Samoa
San Marino
Sao Tomeand Principe
Saudi Arabia
Senegal
SerbiaandMontenegro
Seychelles
Sierra Leone
Slovakia
Slovenia
Solomon Islands
Somalia
South Georgiaand The South Sandwich Islands
SouthAfrica
Sri Lanka
[login to view URL]
[login to view URL] Miquelon
Suriname
Swaziland
Tajikistan
Tanzania
Thailand
Togo
Tokelau
Tonga
Trinidadand Tobago
Tunisia
Turkey
Turkmenistan
Turksand Caicos Islands
Tuvalu
Uganda
Ukraine
United Arab Emirates
United States Minor Outlying Islands
Uruguay
Uzbekistan
Vanuatu
Vatican
Venezuela
Vietnam
Virgin Islands (British)
Virgin Islands (U.S.)
Wallisand Futuna Islands
Western Sahara
Yemen
Zambia
Zimbabwe
Hi,
I have extensive experience in doing website research and data mining. Contact me if you are looking for quality work on a long term basis.
Warm regards,
$250 USD in 365 days
0.0 (0 reviews)
0.0
0.0
46 freelancers are bidding on average $419 USD for this job