Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create dataset loader for multilingual-NLI-26lang-2mil7 #583

Closed
SamuelCahyawijaya opened this issue Apr 1, 2024 · 1 comment · Fixed by #598
Closed

Create dataset loader for multilingual-NLI-26lang-2mil7 #583

SamuelCahyawijaya opened this issue Apr 1, 2024 · 1 comment · Fixed by #598
Assignees
Labels
pr-ready A PR that closes this issue is Ready to be reviewed

Comments

@SamuelCahyawijaya
Copy link
Collaborator

Dataset multilingual_nli_26lang
Description This dataset contains 2 730 000 NLI text pairs in 26 languages spoken by more than 4 billion people. The dataset can be used to train models for multilingual NLI (Natural Language Inference) or zero-shot classification. The dataset is based on the English datasets MultiNLI, Fever-NLI, ANLI, LingNLI and WANLI and was created using the latest open-source machine translation models.
Subsets -
Languages ind, vie, eng
Tasks Natural Language Inference
License Unknown (unknown)
Homepage https://huggingface.co/datasets/MoritzLaurer/multilingual-NLI-26lang-2mil7
HF URL https://huggingface.co/datasets/MoritzLaurer/multilingual-NLI-26lang-2mil7
Paper URL https://www.cambridge.org/core/journals/political-analysis/article/less-annotating-more-classifying-addressing-the-data-scarcity-issue-of-supervised-machine-learning-with-deep-transfer-learning-and-bertnli/05BB05555241762889825B080E097C27
@akhdanfadh
Copy link
Collaborator

#self-assign

@akhdanfadh akhdanfadh added the pr-ready A PR that closes this issue is Ready to be reviewed label Apr 2, 2024
yongzx added a commit that referenced this issue May 5, 2024
Closes #583 | Add Dataloader multilingual-NLI-26lang-2mil7
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
pr-ready A PR that closes this issue is Ready to be reviewed
Projects
Status: Done
Development

Successfully merging a pull request may close this issue.

2 participants