Skip to content

Saksham-21/InShorts-News-Classification-via-fine-Tuning-Bert-model

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 

Repository files navigation

Project Overview

Objective: Scrape Hindi news headlines and their content from five different categories, build a custom tokenizer,
and fine-tune a model for three-class classification.

Key Steps and Achievements

1 Data Collection:
    Scraped Hindi news headlines and content from five different categories.

2 Tokenizer Development:
    Built a custom tokenizer for the Hindi corpus.
    Published the tokenizer on Hugging Face for public use.

3 Model Fine-Tuning:
    Fine-tuned the dataset for three-class classification on Bert.
    Achieved an accuracy of 0.9832.

Releases

No releases published

Packages

No packages published