Find Jobs
Hire Freelancers

Python project

$10-30 USD

Completed
Posted over 3 years ago

$10-30 USD

Paid on delivery
1. (100 points) Write a python program that fetches movie reviews from 3 major websites: IMDB, Metacritics, and Rotten Tomatoes. On these websites, there is an option to show the top movies. On IMDB, it is called “Top rated movies” On Rotten Tomatoes, it is called “BEST MOVIES OF ALL TIME” On Metacritics, it is called “Movies of All Time” You will first write python script that collect the movie reviews for the top 50 movies from each website and store them in a comma separated file (called [your name][login to view URL]). In addition to the csv file, the data should also be stored in a SQLite database called MovieReviewDatabase in the directory that your Jupyter Notebook code will be executed from. The MovieReviewDatabase should have a table called MovieReviewTable. Next, analyze the data you have collected, remove all punctuations and lower all cases to avoid counting the same word multiple times. There are 2 tasks you need to complete: 1. Compare the review style across the 3 websites. After removing stop words (e.g. a, the, of, and, etc.), find out what are the top 50 most frequent keyword that is used in each of the websites. Using simple word embedding technique, your program should also show the similarity score between the keywords used in these 3 websites. (An example is given below) 2. Compare the top 50 movies that are considered top rated across the 3 websites. Construct a dictionary that uses the movie name as the key, and their genre information as the value. For example: dictionary = {Black panther: [Adventure, Action, Fantasy], Casablanca:[Drama]} Use similar technique like word embedding, your program should show the similarity score between the genres of movies ranked as top 50 across these 3 websites. (An example is given below). Combine your finding with those in point 1, write a short report to make inference on the audience demography their preferences on these 3 websites
Project ID: 28405255

About the project

3 proposals
Remote project
Active 3 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
Awarded to:
User Avatar
Hi sir, Loved the project description. My approach would be use python for web scraping, storing the files locally. Applying NLP to find best reviews , then detect top keywords and reviews. Saving the data in csv + database. Can start immediately working on the project. Thank you, looking forward to your kind response!
$20 USD in 4 days
5.0 (9 reviews)
3.2
3.2
3 freelancers are bidding on average $17 USD for this job
User Avatar
HI there I can extract the review data as you want. I have gone through your project description and it will take 4 working days to complete it. Lets have a conversation if you are interested
$20 USD in 4 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of UNITED STATES
Arlington, United States
5.0
1
Payment method verified
Member since Dec 3, 2020

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.