This project focuses on the detailed analysis of financial news sentiment and its correlation with stock market movements. By leveraging natural language processing (NLP) techniques, sentiment analysis, and advanced financial analytics, this analysis aims to uncover meaningful insights that can enhance financial forecasting accuracy. The ultimate goal is to develop innovative strategies that utilize news sentiment as a predictive tool for stock market trends.
This branch focuses on Task 1: Exploratory Data Analysis (EDA), where we perform initial analysis on the financial news dataset to uncover insights and set the foundation for further analysis.
This project focuses on analyzing a large corpus of financial news data to discover correlations between news sentiment and stock market movements. It encompasses data engineering, financial analytics, and machine learning engineering to enhance predictive analytics capabilities at Nova Financial Solutions.
Nova Financial Solutions aims to enhance its predictive analytics capabilities to significantly boost its financial forecasting accuracy and operational efficiency through advanced data analysis. This involves:
- Performing sentiment analysis on financial news headlines.
- Establishing statistical correlations between sentiment scores and stock price movements.
- Providing actionable insights and investment strategies based on the analysis.
The Financial News and Stock Price Integration Dataset (FNSPID) contains:
- headline: Title of the news article.
- url: Direct link to the full news article.
- publisher: Author/creator of the article.
- date: Publication date and time.
- stock: Stock ticker symbol.
-
Task 1: Exploratory Data Analysis (EDA)
- Descriptive Statistics
- Text Analysis (Sentiment Analysis & Topic Modeling)
- Time Series Analysis
- Publisher Analysis
-
Task 2: Quantitative Analysis using PyNance and TA-Lib
- Load and prepare stock price data
- Apply technical analysis indicators
- Visualize the data
-
Task 3: Correlation Between News and Stock Movement
- Align datasets by dates
- Perform sentiment analysis on news headlines
- Calculate daily stock returns and correlation with sentiment scores
- Interim Report: Summary of initial findings and partial progress (max 3 pages).
- Final Report: Detailed analysis and insights (up to 10 pages).
-
Clone the Repository:
git clone https://github.com/dagiteferi/Financial-News-Sentiment-Stock-Market-Correlation-Analysis.git cd Financial-News-Sentiment-Stock-Market-Correlation-Analysis
-
Install Dependencies:
pip install -r requirements.txt
** Contributing** Contributions are welcome! Please fork the repository and use a feature branch. Pull requests are warmly welcomed.
- Fork the repository: Click the "Fork" button at the top right of this page to create your own copy of the repository.
- Clone your fork: Clone the forked repository to your local machine.
git clone https://github.com/your-username/your-repository.git
- Create a new branch: Create a new branch for your feature or bugfix.
git checkout -b feature/AmazingFeature
- Make your changes: Implement your feature or fix the bug. Ensure your code adheres to the project's coding standards and style.
- Commit your changes: Commit your changes with a descriptive message.
git add . git commit -m 'Add some AmazingFeature'
- Push your branch: Push your branch to your forked repository.
git push origin feature/AmazingFeature
- Create a Pull Request: Go to the repository on GitHub, switch to your branch, and click the
New Pull Request
button. Provide a detailed description of your changes and submit the pull request.
License Distributed under the MIT License. See LICENSE for more information.
=======
- Descriptive Statistics: Calculate basic statistics such as headline length, article counts per publisher, and publication dates.
- Text Analysis: Perform sentiment analysis on headlines and identify common keywords or phrases.
- Time Series Analysis: Analyze the frequency of article publication over time and identify trends.
- Publisher Analysis: Identify top publishers and analyze their reporting patterns.
- EDA Report: Summary of initial findings and insights from the exploratory data analysis.