Data and code for the paper "NER4ID at SemEval-2022 Task 2: Named Entity Recognition for Idiomaticity Detection".
-
Updated
Feb 1, 2023 - Jupyter Notebook
Data and code for the paper "NER4ID at SemEval-2022 Task 2: Named Entity Recognition for Idiomaticity Detection".
Multilingual Vec2Text + Ad-hoc Translation + Masking Defense Mechanism
TransMI: A Framework to Create Strong Baselines from Multilingual Pretrained Language Models for Transliterated Data
TransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language Models
A Cross-Lingual, Context-Aware and Fully-Neural Sentence Alignment System for Long Texts.
Official implementation of "CONCRETE: Improving Cross-lingual Fact Checking with Cross-lingual Retrieval" (COLING'22)
Source Code for "Adapters for Enhanced Modeling of Multilingual Knowledge and Text"
Resources for the paper "PARE: A Simple and Strong Baseline for Monolingual and Multilingual Distantly Supervised Relation Extraction"
Data and code for the paper "ID10M: Idiom Identification in 10 Languages" (NAACL 2022).
Semeval-2013 and -2015 multilingual WSD datasets for BabelNet 4.0
CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates
Framework for probing tasks
Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2021).
This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021.
Add a description, image, and links to the multilinguality topic page so that developers can more easily learn about it.
To associate your repository with the multilinguality topic, visit your repo's landing page and select "manage topics."