Skip to content

Latest commit

 

History

History
29 lines (20 loc) · 1.19 KB

README.md

File metadata and controls

29 lines (20 loc) · 1.19 KB

Kosovo News Articles Dataset

Getting started

This is the first publicly available dataset for the Albanian language. It contains more than 3 million news articles from various albanian news sources (see list below).

Kosovo News Articles Dataset Header

Content

After having scraped all of the newspages through their Wordpress API’s we merged all of the data into this file, where to separate the origin of each news article we’ve also added the source to each post.

All available articles from the first one posted on each page until 27.08.2020 are stored in the file.

These articles were taken from these news pages:

Download the Dataset: The Kosovo News Articles Dataset is available for download on Kaggle. Access it here.