BASIC EMR ON MAP-R V3 AND AWS S3 - repost

€750-1500 EUR

Closed

Posted

over 10 years ago

€750-1500 EUR

Paid on delivery

1- Scope: Providing with a basic Hadoop Map-R V.3 environment over Amazon Web Services. Basic trial environment in this phase. No need to provide 24 x 7 tools or extra code. Main aim is to analyse data from several text S3 input sources and start trial period. 2- Tools: We provide Project AWS account for the Project and Map-R V.3 Hadoop clusters. Free administration for implementing this project. 3- Deliverables: - Scripts code for AWS API based automatic MAP-R V.3 Set-up for a given number of masters and computing nodes. - Set up scripts capable of using EC2 on “demand nodes” o For real time 24x 7 live queries o For batch night processes. - Java basic code for providing basic routines like: o Joints tables form several text sources. o Gauss statistics: Mean, deviation, etc. o Basic counting and basic mathematics routines. o Output text or Mysql computed tables. - skype sessions for 4 hours to train skilled informatics from de php and javascript world. - Documented source code. 4- Input sources: The project is intended for analysing and creating logs joints form distant connected devices and central text tables. - Several TEXT files for remote devices stored on S3 files. o Characteristics of remote devices (>400.000 TV sets) • Brand • Programed parameters • Available channels o • Geo location o Log text of distant • Real time logging of visits • Number of visits • Duration • TV station tuned in in each moment • Type home demographics where the device is installed. o TV Stations programming scheduling • Show type: movie, talk show, debate • Start time, end time. • Celebrities involved in the show. 6- Expected outputs. - Several combinations of the above. - - Mean time per TV set type expend in each type of show. o Mean time o Standard deviation o Top celebrities watched - Samples of joints form several sources. - Real time queries set up in case of need real time response. - Batch set up for long time consuming queries of whole set of queries. 7- Time table. - Needed in four weeks / January end – first September week. - We provide AWS zone with all the text sources inside ready for use. - Week days 9- 18h CET e-mail /skype contact for immediate support for any doubt or clarification needs. 8- References: - No project will be awarded without clear and outstanding references on hadoop implantations over AWS , - MAP-R is a plus.

Project ID: 5280739

About the project

5 proposals

Remote project

Active 10 yrs ago

Looking to make some money?

Email address

Benefits of bidding on Freelancer

Set your budget and timeframe

Get paid for your work

Outline your proposal

It's free to sign up and bid on jobs

5 freelancers are bidding on average €2,422 EUR for this job

@innovese

Hi, I have worked with many startups and previously, and I had worked on a startup which was sold off to the German Bertelsmann Group. Now I have started this company called algoscale focusing on building highly scalable applications for the web and setting up auto-scalable infrastructure for the same. Big-Data In the big-data area, we have worked in Hadoop/ MapReduce where we pulled stock ticker data from Google/ Yahoo feeds & news articles about those stock tickers. We stored the data on top of HDFS. Then we had implemented a querying platform on top of it, using Hive. Also wrote a MapReduce program for parsing the news articles, picking out the sentiment and correlating it with the ticker price at that moment. The whole project was deployed on AWS infrastructure using EMR. Impala We also worked on setting up a querying platform on top of 10TB data stored on AWS”s S3 and connecting it with the CDH’s Impala platform. It also comprised of a Hive platform for intermediary table storage. I have lot of other experience in working with Hadoop on AWS incl. writing MapReduce programs on top of Hadoop platform. Please let me know if you have any questions. Regards, Neeraj Agarwal

€1,444 EUR in 20 days

5.0

(4 reviews)

3.7

@aczireonline

Hi Will do this for you. Being working in the BigData domain for more than 3 years and a Cloudera and IBM certified Hadoop developer, I hope this is my domain. Well, that was to just introduce myself, now on to the project. We can do the same over multiple milestones/phases. Before that, I need to know the exact data model of the input data. I hope the whole schema is somewhat csv, right? Also, what strategy you are using for injesting data onto AWS, using any custom tools or something like sqoop or flume? Also, as per the description; you need to analyze the data; can you please get me some more details. Also, is there any direct dependancy to Map-R or can we use other distributions like Cloudera? Please feel free to contact, Thanks,

€2,888 EUR in 60 days

5.0

(3 reviews)

3.5

@meljux

Hi, I can use my own account to start hadoop on AWS. Training on this topic is very expensive and really wide wide wild.... I belong to a team of web developers and web designers who are excited by all the possibilities of the web today. Whether you need a complete website, full web application or just a graphic design; our highly experienced team will deliver. Our goal is to offer top quality web services that can propel our clients into a new direction of on line success. Our team is comprised of Web Designers, Web Developers, SEOs, Flash Specialists, Content Managers and more. Please, don't hesitate to ask us for a detailed cost estimate for your web project even if you require consulting services, a proof-of-concept on your own infrastructure we offer you all these services at no charge.

€5,000 EUR in 7 days

0.0

(0 reviews)

0.0

@naveenamp

I am expert in Java. I have over 8 years of experience in designing and developing Java, Hadoop,Map Reduce and web applications. I understand your requirement and able to deliver you project as per your expectation within time frame. Don't hesitate to ask me any question regarding your project. Please do visit my profile to see my Java Web application portfolio. Hope to hear from you soon..Thanks

€1,250 EUR in 20 days