GitHub - Saksham-21/InShorts-News-Classification-via-fine-Tuning-Bert-model

Project Overview

Objective: Scrape Hindi news headlines and their content from five different categories, build a custom tokenizer,
and fine-tune a model for three-class classification.

Key Steps and Achievements

1 Data Collection:
    Scraped Hindi news headlines and content from five different categories.

2 Tokenizer Development:
    Built a custom tokenizer for the Hindi corpus.
    Published the tokenizer on Hugging Face for public use.

3 Model Fine-Tuning:
    Fine-tuned the dataset for three-class classification on Bert.
    Achieved an accuracy of 0.9832.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
NLP_Project.ipynb		NLP_Project.ipynb
README.md		README.md
WebScraper(from InShorts).ipynb		WebScraper(from InShorts).ipynb
combined.csv		combined.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

Saksham-21/InShorts-News-Classification-via-fine-Tuning-Bert-model

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages