ArticleStatsInsight

Objective

The objective of this project is to extract textual data from articles provided in given URLs and perform text analysis to compute various metrics. The metrics include sentiment scores, readability scores, and other textual statistics.

Data Extraction

Input

The URLs of the articles are provided in the Input.xlsx file. For each URL, the program extracts the article text and saves it in a text file named after the URL_ID.

Extraction Process

Only the article title and text are extracted.

Data Analysis

For each extracted text, perform textual analysis to compute the variables as specified in the Output Data Structure.xlsx file.

Dependencies:

 BeautifulSoup
 NLTK
 Pandas
 Requests

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
data_analysis.py		data_analysis.py
data_scraping.py		data_scraping.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ArticleStatsInsight

Objective

Data Extraction

Input

Extraction Process

Data Analysis

About

Releases

Packages

Languages

shubhamparmar1/ArticleStatsInsight

Folders and files

Latest commit

History

Repository files navigation

ArticleStatsInsight

Objective

Data Extraction

Input

Extraction Process

Data Analysis

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages