For this studies project we chose an dataset on Kaggle that show us the top 1000 movies from all the time and we perfomed the data analysis and pre-processing, and a detailed, varied and relevant visualization for all the data, using histograms, heat maps, pair plots, bar plot, pie charts, box plot and wordclouds.
- Luísa Balleroni Shimabucoro - 11832385
- Wictor Dalbosco Silva - 11871027
A jupyter notebook can be found for faster visualization, or if you desire to run on a dashboard with streamlit, you can follow these steps:
You will need the files in this repository and some libraries:
- streamlit
- pandas
- matplotlib
- plotly.express
- wordcloud
They can be installed using:
pip install lib-name
Then after all the installation is done, you can run with
streamlit run trab2.py
The streamlit version presents full interation with the user, and you can change values with sliders and such.
A demonstration in portuguese can be found in this link.