This dataset contains translations structured as I am [demonym]
in English, with corresponding translations into German (deu), Spanish (spa), French (fra), and Italian (it). The dataset is organized by language in the data/
folder.
The translations were sourced from the following references:
- English https://github.com/mledoze/countries/tree/master
- French https://github.com/mledoze/countries/tree/master
- German https://deutsch.lingolia.com/en/vocabulary/laender-nationalitaeten
- Italian https://www.theintrepidguide.com/nationalities-in-italian/?utm_source=chatgpt.com
- Spanish https://espanol.lingolia.com/en/vocabulary/countries
Each file contains the following columns:
Column Name | Description |
---|---|
eng |
The source sentence in English |
<lang>_m |
The masculine form of the translation (if applicable) |
<lang>_f |
The feminine form of the translation (if applicable) |
<lang>_n |
The neuter form of the translation (if applicable) |
eng | it_m | it_f | it_n |
---|---|---|---|
I am Austrian. | Sono austriaco. | Sono austriaca. | |
I am Belgian. | Sono belga. |