-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
44c093b
commit 954dc2a
Showing
1 changed file
with
50 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,50 @@ | ||
# Financial News Sentiment and Stock Market Correlation Analysis - Task 1 | ||
|
||
This branch focuses on **Task 1: Exploratory Data Analysis (EDA)**, where we perform initial analysis on the financial news dataset to uncover insights and set the foundation for further analysis. | ||
|
||
## Table of Contents | ||
|
||
1. [Project Overview](#project-overview) | ||
2. [Business Objective](#business-objective) | ||
3. [Dataset Overview](#dataset-overview) | ||
4. [Task 1 Overview](#task-1-overview) | ||
5. [Installation](#installation) | ||
6. [Usage](#usage) | ||
7. [Contributing](#contributing) | ||
8. [License](#license) | ||
9. [Acknowledgements](#acknowledgements) | ||
|
||
## Project Overview | ||
|
||
This project focuses on analyzing a large corpus of financial news data to discover correlations between news sentiment and stock market movements. It encompasses data engineering, financial analytics, and machine learning engineering to enhance predictive analytics capabilities at Nova Financial Solutions. | ||
|
||
## Business Objective | ||
|
||
Nova Financial Solutions aims to enhance its predictive analytics capabilities to significantly boost its financial forecasting accuracy and operational efficiency through advanced data analysis. This involves: | ||
- Performing sentiment analysis on financial news headlines. | ||
- Establishing statistical correlations between sentiment scores and stock price movements. | ||
- Providing actionable insights and investment strategies based on the analysis. | ||
|
||
## Dataset Overview | ||
|
||
The Financial News and Stock Price Integration Dataset (FNSPID) contains: | ||
- **headline**: Title of the news article. | ||
- **url**: Direct link to the full news article. | ||
- **publisher**: Author/creator of the article. | ||
- **date**: Publication date and time. | ||
- **stock**: Stock ticker symbol. | ||
|
||
## Task 1 Overview | ||
|
||
### Task 1: Exploratory Data Analysis (EDA) | ||
|
||
- **Descriptive Statistics**: Calculate basic statistics such as headline length, article counts per publisher, and publication dates. | ||
- **Text Analysis**: Perform sentiment analysis on headlines and identify common keywords or phrases. | ||
- **Time Series Analysis**: Analyze the frequency of article publication over time and identify trends. | ||
- **Publisher Analysis**: Identify top publishers and analyze their reporting patterns. | ||
|
||
### Deliverables for Task 1: | ||
|
||
- **EDA Report**: Summary of initial findings and insights from the exploratory data analysis. | ||
|
||
|