Find Jobs
Hire Freelancers

Google News crawler

$250-750 USD

Closed
Posted over 7 years ago

$250-750 USD

Paid on delivery
Google News Crawler Need a program to crawl [login to view URL] -- the program should be able to run unattended and log its results to a log file. It should automatically create a new log file every day (even if the program doesn't stop at the end of the day), so that the log may have a filename format of news[date].log. The program should use a SQL Server database (you can create the db structure). It should be written in C# It should crawl each of these top-level categories: Business, Entertainment, Sports, Technology, Science, Health, Society, Politics, World In addition, it should crawl the top 200 metros in North America using the &geo= querystring parameter (we will provide a list of the top 200 metros) The first step the crawler should take when it navigates to a new top-level category is to parse the list of subcategories (which appear in the left-side navigation in the Google News interface) Then it should navigate to each of the subcategories to retrieve the matching articles. If Google News returns new subcategories, those new subcategories should be saved to a DB table and associated with their Master category. The primary objective of the crawler is to obtain contact information for Journalists. So the program should save the following information to the database: 1. Journalist name 2. Journalist’s title 3. Journalist’s beats 4. Journalist’s sub-beats 5. Journalist’s tags 6. Journalist’s location (city, state, country) 7. Journalist’s email 8. Journalist’s social media links (LinkedIn, Tweeter, Facebook pages) 9. Journalist’s website 10. Journalist’s association (news media) 11. Journalist’s About info 12. Dates of publications and Titles The crawler should use Proxies. The configuration of the crawler should specify if it should only focus on news articles added within the past day, or if it should crawl articles more than 1 day old. There should also be an ability to specify which top level beats to crawl. This way, we could potentially have multiple instances of the crawler running simultaneously. We may want to have one instance that crawls just recent articles, and then two or more additional instances that crawl historical articles (which could each be assigned half of the beats). The program should be able to crawl multiple pages of Google News results. Some news sites include journalist contact information in the articles themselves. Some have a link to the journalist's bio & contact details from the article. And these are in different places on different sites. Because different news sites put contact info for journalists in different places, the program should have some flexibility to be able to try a number of the likely places where the contact info may be to try to retrieve this information. However, if the crawler checks more than 3 articles on a given news site and does not find an email address associated with the author of the article any of those times, it should stop trying to search for journalist contact info on that site going forward (in other words, ignore that site when it appears in Google News results).
Project ID: 11626449

About the project

10 proposals
Remote project
Active 7 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
10 freelancers are bidding on average $667 USD for this job
User Avatar
Hi I would like to show you a demo first BEFORE starting the actual work. Let's discuss the details of this project. I can deliver some parts of the project to you, once you are satisfied with the results, then I will continue and then communicate about cost/payment at the end. Usually, I don't place a blind-bid before a demo, but its freelancer.com restriction to place a bid first in order to start communication with the employer. I have completed projects of several thousand dollar in net worth. Please spend 3 minutes to view the comments and testimonials given to me by other employers, please also have a look on my past projects and customers feedback here: https://www.freelancer.com/u/shakeelsoft.html#/reviews Thanks Shakeel
$825 USD in 25 days
5.0 (13 reviews)
5.9
5.9
User Avatar
I mastered c/c++/c# & java programing and reverse techniques. I read your project description carefully and decided your project. I have experienced a lot of such projects. I will satisfy you fully Please lookup my portfolio carefully. I am going to consult with many friends sincerely. Best regards.
$526 USD in 10 days
4.9 (9 reviews)
4.5
4.5
User Avatar
Hello, I understood the initial scope of this project. Although i want to discuss further this job in order to prepare the final concept for this project. After Complete discussion over the call or in chat, i will prepare following things for you - Technical Project Proposal - Flow chart for this Project - Execution plan (Step by step procedure with explanation how and at what that we are going to execute a particular task)
$773 USD in 20 days
4.6 (7 reviews)
4.6
4.6
User Avatar
Hi, I am interested. Thanks Narendra
$666 USD in 20 days
4.9 (8 reviews)
4.5
4.5
User Avatar
Hello, I hope you doing very well! I have gone through the description and would like to provide a quality solution using my 5+ years of professional experience in required skills. (.NET/ASP.NET - MVC, EF, SQL, C#). Awaiting for your positive response so we can have final talk and start project immediately :)
$350 USD in 10 days
5.0 (11 reviews)
4.0
4.0
User Avatar
Respected Sir, We are available 24/7 on SKY PE for communication and support. We create your idea into reality and specialize in creating awesome WEB, ANDRIOD, IOS Applications and marvelous designs. Happy to have your attention, I read all the project requirements and want to start communication with you for more clarification and deep understanding of your project requirements. Here’s a link to our feedback in the FREELANCER system http://www.freelancer.com/users/feedback_843582.html In short, if you want a service who offers fixed timing, down-to-earth advises, beautiful design & optimized code, some extra work for your comfort, premium after-service, AND on a reasonable budget, then we are the smart choice. We deliver you guaranteed results and complete work within mentioned timeframe. Quality is our promise. Take a look and tell me what you think Good or bad, We'd love to hear from you ;) Talk soon, Faisal Malik Hexamilesoft Lead Developer Sky pe: Hexamilesoft
$833 USD in 15 days
3.5 (5 reviews)
4.2
4.2
User Avatar
This looks like a very challenging problem that I would be very interested in! I have extensive experience in web crawlers and web automation, including some open source libraries that I would like to test to see if they work for this type of problem.
$750 USD in 14 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Hi there, I’d like to be considered for this project. I’m a software developer with a strong background developing projects like All Lines Insurance Suits, Online Shopping, Library Management System, WCF services for authentication and integrating data from third party, School Management System, Agent portal and Business Analysis. I can complete your task on or before time with almost 100% satisfactory work. For 3+ years I’ve worked in web and desktop application development field and so I am accustomed to working with all sorts of projects and services, and in a variety of industries. I have a deep passion for coding and satisfactory work so I will give it 100% satisfactory work on or before time. I highly value professionalism and hold myself strictly accountable to represent my client’s brand. I aim to form a long-term working relationship. If you are interested in a larger volume of work each month, I can offer a lower rate. Please, let me know what is needed to secure this bid! Thank you for your consideration. Bhavin Pokiya
$666 USD in 15 days
0.0 (1 review)
1.8
1.8

About the client

Flag of UNITED STATES
New York, United States
4.7
5
Payment method verified
Member since May 10, 2016

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.