Find Jobs
Hire Freelancers

Big Data Programmer/Scientist Needed Badly!!

$25-50 USD / hour

Closed
Posted about 9 years ago

$25-50 USD / hour

I am looking for someone to help me take my news aggregation site to the next level. I have a large list of keywords and websites that I would like to use to gather all the news on a given subject matter each day (a test revealed this to be approx. 4,500 articles). I would like someone, first and foremost, to design a framework for collecting these articles. I would then like to run text analytics to determine what people and places* are mentioned in those 4,500 articles. I then need to be able to determine "source" articles. In other words, of those 4,500 stories, there are only several hundred original stories, the rest are simply "re-writes" by other publications. I would then like to create a system for taking the events referenced in those "source articles" and create a "timeline" of that event. This way, people can click on today's headline and read the story, but they can also read past headlines and get a quick understanding of what has happened. Finally, I need a simple way to view the "source stories", have the ability to select certain stories for the home page of my website, and edit the headline of that story as well. *A secondary piece to this project is finding some way to "scrape" the data for people and places. When someone clicks on a person or place mentioned in an article, I'd like them to get a small bio/map/picture of that person or place. There are encyclopedias of places and of world leaders across the web, I am hoping we can find that data somewhere there.
Project ID: 7514269

About the project

13 proposals
Remote project
Active 9 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
13 freelancers are bidding on average $35 USD/hour for this job
User Avatar
Greetings, I have read Project Description & understand that you have a news aggregator site, so can I see that URL please? Please drop me a message thru' PMB to start further discussion about your project, as we cannot write much on this initial message bard due to limitation of characters. Please check out our Freelancer profile https://www.freelancer.com/u/leadconcept.html, as we have delivered many successfully projects lately and let me know, if you want to see some of our recently developed projects. Look forward to hearing from you and talking to you further thru' Freelancer private message board. Regards, YK LEADconcept
$36 USD in 40 days
5.0 (6 reviews)
6.1
6.1
User Avatar
I have a team of developers and have worked already on similar project
$25 USD in 20 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Hi, I am a Big Data Architect by trade, having built several applications from inception to production I have a career of experience in problem solving and solution development. Off the top of my head I believe that an appropriately considered ElasticSearch index will provide the collection and text analysis with relative ease. Source articles should also be fairly straightforward once collection and index configuration are completed. Once they have been identified they can be marked as "source stories" and will then be available to search/list by this attribute. People and place data will probably be best served by using exterior links - for data storage purposes, however, if this is not an issue then a script to collect people and place data into a separate index could also be written with relatively little complexity. I would be interested to know whether this is to be built for pure HTML/CSS/JS or whether you have any back-end systems already in place? Feel free to contact me to discuss this further.
$27 USD in 20 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Hi! I got 10 year academician experience in text analytics and news parsing, providing new theories and algos. During last 5 years me and my team finished dozens local projects on news analysis, tagging, aggregation, opinion and sentiment mining, fact extraction for marketing/banking and political/safety purposes. We really could provide you mature technical solution and full workflow of the thing you want to be done: 1. scrapping 2. cleaning 3. anomaly detection & QA of scrappers 4. reloading with classification and fact extraction 5. advanced statistic and reports 6. your customer segmentation. Some of parts could be built on your server or outsourced from our services - depending on your budget. I really would glad you to help in that task with whole my and my team members experience, using our toolkit in text analytics. Please PM me to discuss details or see some demoes of our experience. P.S. We are using enterprise level text/stat analytical software and writing code in python only in special cases. It means that we could start to build analytical framework to you from first minute with all power of modern software in textual analytics. In contrast to other freelancer we do not coding and wasting a time for initial data processing, code writing, hunt on bugs and so on!
$30 USD in 25 days
0.0 (0 reviews)
0.0
0.0
User Avatar
A proposal has not yet been provided
$33 USD in 20 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Please find below my short experience summary. * 10+ years experience developing Data Mining, Text Mining, Machine Learning, Knowledge Discovery, Information Extraction and Analytics for web crawling, scraping, extraction and aggregation from unstructured big data such as web-pages and text corpus ( freebase, cia factbook). * Have worked extensively on developing data extractors from various web-sources such as DailyDeals website, IMDB movie database ( text analysis and mining of movies, actors, genres ) etc and assembling and populating them into databases, datastores and search-indexes(Lucene, Solr) for analysis, search, reporting and dashboard. * Have worked extensively on Data Mining/Machine Learning techniques for automatically processing, classifying, predicting, clustering, categorization and citation and linkage analysis ( markov-model ) using referential connection and linkage structure (used HMM). * Have independently completed the projects undertaken before in developing Information Extraction, Web Crawling, Scraping, Data Mining, Analytics, Reporting, Dashboard and Statistical Tools. * Extensive experience using Perl, PHP, C, Java, .NET with MySql, Oracle, MS-SQL Server * Data Mining / Machine Learning / Information Extraction Tools : Weka, R, Excel, Perl-CPAN Packages for Extraction. Estimated Budget : ~ 45$/hr - 60$/hr ( @ 20-30 hours/week ) Price,milestones and timelines flexible and negotiable based on exact project specifications and details.
$45 USD in 20 days
0.0 (0 reviews)
0.0
0.0
User Avatar
I graduated from Carnegie Mellon University with a master degree. I have lots of industry experience in big data area. I worked at IBM, Twitter before. I know how to use hadoop related technology to build the pipeline to process the data. I have hands-on experience to use Amazon EMR. Have the ability to implement the machine learning algo using hadoop. Hadoop, Cascading, Scalding, Pig, Hive, Hbase, Giraph. Anything in hadoop ecosystem. I think I will be a good fit for this role :)
$40 USD in 15 days
0.0 (0 reviews)
0.0
0.0
User Avatar
I am writing to you to introduce our company and services.​​ My name is Verdan Mahmood, I am COO and Co-Founder of TresFilios. We are a small team of experienced Web Developers​, ​​recently established our own company. Previously, we were doing jobs in different companies. We've 3 years of working experience of Python, Scrapy and Django. We've the experience that is required for such kind of project. But to tell you honestly, It would be a long term project. It's not that easy and fast to get and architect such huge data. but yeah! it is DOABLE. We've also worked for some of the great clients who used scraping. ​We are always open to discuss more about our services and methodologies. We are new but passionate. Looking forward for a mutual beneficial relationship.
$44 USD in 18 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of UNITED STATES
Tyngsboro, United States
5.0
4
Payment method verified
Member since Feb 4, 2013

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.