Trove newspapers in languages other than English

Current version: v1.1

This dataset contains information about newspapers published in languages other than English that have been digitised and made available through Trove. Data about the languages present in newspapers was generated by harvesting a sample of articles from each newspaper using the Trove API, and then using language detection software on the OCRd text of each article. The method is documented in this notebook in the GLAM Workbench.

There are two files:

newspapers_non_english.csv – list of the main languages detected for each newspaper with non-English language content
non-english-newspapers.md – a markdown formatted list of all the newspapers with non-English language content

`newspapers_non_english.csv`

The dataset contains the following columns:

Column	Contents
`id`	newspaper id
`title`	newspaper title
`language`	language code
`proportion`	proportion of articles in this language
`number`	number of articles sampled
`language_full`	full language name

`non-english-newspapers.md`

This is a markdown-formatted list created by grouping the dataset by newspaper title. It includes details of the main languages in each newspaper.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
scripts		scripts
.gitignore		.gitignore
.zenodo.json		.zenodo.json
LICENSE		LICENSE
README.md		README.md
newspapers_non_english.csv		newspapers_non_english.csv
non-english-newspapers.md		non-english-newspapers.md
requirements.in		requirements.in
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Trove newspapers in languages other than English

`newspapers_non_english.csv`

`non-english-newspapers.md`

About

Releases 2

Packages

Languages

License

GLAM-Workbench/trove-newspapers-non-english

Folders and files

Latest commit

History

Repository files navigation

Trove newspapers in languages other than English

newspapers_non_english.csv

non-english-newspapers.md

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

`newspapers_non_english.csv`

`non-english-newspapers.md`

Packages