- Developed GCP-hosted product for over 1 million movie investors on HSX.com, aiding online trading by designing end-to-end Hive workflow using MapReduce; weekly update of Box Office trends by dynamic web scraping
- Instituted proposal for producers to leverage proactive script writing by analyzing ~480K movie reviews on Rotten Tomatoes
Movie Tycoon is a platform which helps movie investors on https://www.hsx.com/ by providing insights on where and whom to invest the money on. The aim is to provide creative personnel with a tool to analyze reviews and use it as a feedback for future projects. This way, investors can identify the right price for investments in cinema business, and theatre owners can schedule movie shows based on box office predictions.
- Deployed Python Web Scraping tools to build a corpus of data that could be leveraged for ‘NLP Modeling’
- Leveraged HIVE platform to query solutions on movie database
- Used Naïve Bayes Algorithm to identify the sentiment of the trends
- Understand Box Office Trends - The top box office returns are observed in Action, Musical and Family genres
- Leverage NLP to understand critics reviews - The top words in movies having positive reviews are
- Story
- Compelling
- Performance
- Brilliant Drama
- Movie Business Landscape Analysis - The top movies produced are produced in Drama, Thriller and Comedy genre
- Entire product uses real time predictions every Monday at 8am using Hive for scheduling automated workflows
Visit the following link to listen to the product pitch --> https://www.youtube.com/watch?v=wpuIuco7MX0