Multilingual Vec2Text + Ad-hoc Translation + Masking Defense Mechanism
-
Updated
Sep 17, 2024 - Python
Multilingual Vec2Text + Ad-hoc Translation + Masking Defense Mechanism
Data and code for the paper "NER4ID at SemEval-2022 Task 2: Named Entity Recognition for Idiomaticity Detection".
TransMI: A Framework to Create Strong Baselines from Multilingual Pretrained Language Models for Transliterated Data
TransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language Models
Semeval-2013 and -2015 multilingual WSD datasets for BabelNet 4.0
Data and code for the paper "ID10M: Idiom Identification in 10 Languages" (NAACL 2022).
A Cross-Lingual, Context-Aware and Fully-Neural Sentence Alignment System for Long Texts.
Source Code for "Adapters for Enhanced Modeling of Multilingual Knowledge and Text"
Resources for the paper "PARE: A Simple and Strong Baseline for Monolingual and Multilingual Distantly Supervised Relation Extraction"
Official implementation of "CONCRETE: Improving Cross-lingual Fact Checking with Cross-lingual Retrieval" (COLING'22)
Framework for probing tasks
CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates
Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2021).
This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021.
Add a description, image, and links to the multilinguality topic page so that developers can more easily learn about it.
To associate your repository with the multilinguality topic, visit your repo's landing page and select "manage topics."