Skip to content
This repository has been archived by the owner on Aug 27, 2024. It is now read-only.

Commit

Permalink
[v2.0]
Browse files Browse the repository at this point in the history
  • Loading branch information
areebahmeddd committed Nov 12, 2023
1 parent 3293a8e commit c985eca
Show file tree
Hide file tree
Showing 3 changed files with 12 additions and 6 deletions.
Binary file removed Frontend/src/images/demo-1.gif
Binary file not shown.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
18 changes: 12 additions & 6 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,12 @@ The [Press Information Bureau](https://pib.gov.in/) (PIB) automated feedback sys

The primary functionality of the system is to send timely notifications for negative articles while providing a user-friendly dashboard for data visualization. Additionally, there is a separate Chrome extension for real-time fake news detection.

---

<p align = "center">
<img width = "100%" src = "https://github.com/areebahmeddd/Insight-Ink/blob/main/Frontend/src/images/demo.gif">
</p>

## Overview

<p align = "center">
Expand All @@ -12,9 +18,9 @@ The primary functionality of the system is to send timely notifications for nega

- **Asynchronous Web Scraping**: Utilized [BeautifulSoup](https://beautiful-soup-4.readthedocs.io/en/latest/) library along with asynchronous libraries such as [aiohttp](https://docs.aiohttp.org/en/stable/) and [asyncio](https://docs.python.org/3/library/asyncio.html) to efficiently scrape articles from various national and regional media websites.

- **Text Extraction** & **Language Translation**: Implemented Google's Optical Character Recognition engine ([Pytesseract](https://pytesseract.readthedocs.io/en/latest/)) to extract text from scanned or image-based regional newspaper articles and integrated [Google Translator API](https://py-googletrans.readthedocs.io/en/latest/) to translate the extracted text into English, supporting cross-language analysis.
- **Text Extraction** & **Language Translation**: Implemented Google's Optical Character Recognition engine ([Pytesseract](https://pytesseract.readthedocs.io/en/latest/)) to extract text from scanned or image-based regional newspaper articles and integrated [Google Translator](https://py-googletrans.readthedocs.io/en/latest/) API to translate the extracted text into English, supporting cross-language analysis.

- **Video Content Breakdown**: Leveraged [OpenAI Whisper API](https://platform.openai.com/docs/guides/speech-to-text) for an in-depth analysis of closed captioning in YouTube videos from selected news channels, enhancing media monitoring capabilities.
- **Video Content Breakdown**: Leveraged [OpenAI Whisper](https://platform.openai.com/docs/guides/speech-to-text) API for an in-depth analysis of closed captioning in YouTube videos from selected news channels, enhancing media monitoring capabilities.

🗂️ Processed data is stored automatically in [JSON](https://docs.python.org/3/library/json.html) format with well-defined key-value pairs, ensuring compatibility for frontend integration and wider accessibility across various applications.

Expand All @@ -28,23 +34,23 @@ The primary functionality of the system is to send timely notifications for nega

- **Sentiment Analysis**: Trained a Bidirectional Encoder Representations [from Transformers](https://proceedings.neurips.cc/paper_files/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf) ([BERT](https://huggingface.co/docs/transformers/model_doc/bert)) model within the [PyTorch](https://pytorch.org/docs/stable/torch.html) framework, on a [dataset](https://github.com/areebahmeddd/Insight-Ink/blob/main/Backend/Models/data-2.csv) comprising articles classified as positive, neutral, and negative. The [test model](https://github.com/areebahmeddd/Insight-Ink/tree/main/Backend/Models/Sentiment) achieved an accuracy of ~81%, closely matching the ground truth labels.

📊 [Matplotlib](https://matplotlib.org/stable/index.html) library is applied automatically to generate bar graphs to visually represent the correlation between government departments and the sentiment expressed in news articles, making it easier to identify trends, patterns, and areas of concern.
📊 [Matplotlib](https://matplotlib.org/stable/index.html) library is applied automatically to generate graphs to visually represent the correlation between government departments and the sentiment expressed in news articles, making it easier to identify trends, patterns, and areas of concern.

---

<p align = "center">
<strong>Data Presentation</strong>
</p>

- **Cross-Platform User Interface**: Designed a [website](https://insight-ink.org/) using frameworks such as [React](https://react.dev/reference/react) and [Bootstrap](https://getbootstrap.com/docs/5.3/getting-started/introduction/), with integrated [Twilio API](https://www.twilio.com/docs/sms) for real-time notifications to government officials regarding negative articles, thereby improving the ability to monitor and respond proactively.
- **Cross-Platform User Interface**: Designed a [website](https://insight-ink.org/) using frameworks such as [React](https://react.dev/reference/react) and [Bootstrap](https://getbootstrap.com/docs/5.3/getting-started/introduction/), with integrated [SMTPlib](https://docs.python.org/3/library/smtplib.html) library and [Twilio](https://www.twilio.com/docs/sms) API for real-time notifications to government officials regarding negative articles, thereby improving the ability to monitor and respond proactively.

- **Chrome Extension**:

📦
📦 Hosted the website on GoDaddy and configured the Frontend to send POST requests via the [Axios](https://axios-http.com/docs/api_intro) library and the Backend to process them securely with the [CORS](https://flask-cors.readthedocs.io/en/latest/) extension provided by the [Flask](https://flask.palletsprojects.com/en/latest/quickstart/) framework.

## Getting Started

Follow these steps to set up and run the Insight Ink software on your local machine, or you can watch the [demo video](https://www.youtube.com/watch?v=dQw4w9WgXcQ)
Follow these steps to set up and run the Insight Ink software on your local machine, or you can watch the [demo video](https://www.youtube.com/watch?v=GFApJyF8yc0)

### Prerequisites

Expand Down

0 comments on commit c985eca

Please sign in to comment.