A curated list of research papers and resources on code-switching
-
Updated
Nov 3, 2024
A curated list of research papers and resources on code-switching
This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.
This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The text that includes words from two languages such as Hindi written in roman script, mixed with English.
A pipeline for transliteration, spell correction, POS tagging and word sense disambiguation of Hinglish code mixed data to Hindi Devanagari script.
Code for the paper "Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots" (NAACL-HLT 2021)
Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.
This repo contains the source code of HIT: A Hierarchically Fused Deep Attention Network for RobustCode-mixed Language Representation (Accepted in ACL 2021)
Repository containing Abusive Tweet Detection, Location Detection and Gender Detection codes
Jopara (Guarani-dominant mixed with Spanish) sentiment analysis corpus
MaskLID: Code-Switching Language Identification through Iterative Masking -- ACL 2024
Indonesian-English code-mixed Twitter dataset
Psycholinguistic Analysis of Code Mixing - Speech and Natural Language Processing Term Project: CS60057. Department of Computer science and Engineering, Indian Institute of Technology Kharagpur
A word level Language Identification (LID) tool for Tagalog-English (Taglish) text.
A language detection model for code-switched texts in es/en/zh
Tweet ids for code-mixed Russian-German and Russian-Hebrew tweets
A Centralized Frenglish Benchmark from Naturally Occurring Code-Switching and Code-Mixing
Add a description, image, and links to the code-mixing topic page so that developers can more easily learn about it.
To associate your repository with the code-mixing topic, visit your repo's landing page and select "manage topics."