This project is designed to analyze WhatsApp chat data to provide insights such as message frequency, word usage, media sharing, and more. The analysis is visualized using graphs and charts to make it easier to understand the trends within the chat data.
-
Statistics Overview:
- Total number of messages
- Total number of words
- Number of media messages shared
- Number of links shared
-
Timeline Analysis:
- Monthly timeline of messages
- Daily timeline of messages
-
Activity Maps:
- Most active days of the week
- Most active months
-
WordCloud:
- Visualization of most used words in the chat
-
Most Common Words:
- List of the most commonly used words
The project consists of the following files:
- app.py: Main Streamlit application file where the app logic resides.
- preprocessor.py: Module for preprocessing the WhatsApp chat data.
- helper.py: Module containing helper functions for data analysis and visualization.
- stop_hinglish.txt: File containing stop words for filtering out common words.
- Python
- Streamlit: For creating the interactive web app
- Pandas: For data manipulation and analysis
- Matplotlib & Seaborn: For data visualization
- urlextract: For extracting URLs from messages
- Clone the repository
- Install the required dependencies: pip install -r requirements.txt
- Run the Streamlit app: streamlit run app.py
- Upload your WhatsApp chat file
- Analyze the Chat