Skip to content

Latest commit

 

History

History
8 lines (6 loc) · 458 Bytes

README.md

File metadata and controls

8 lines (6 loc) · 458 Bytes

EDC

  • The Egyptian Dialect Corpus (EDC)
  • It consists of 218,149 words and is 2,024 KB in size.
  • It was collected from the social media platform Facebook.

If you use the EDC corpus, Please cite this paper:

Tarmom, T., Teahan, W., Atwell, E. and Alsalka, M.A., 2020. Compression versus traditional machine learning classifiers to detect code-switching in varieties and dialects: Arabic as a case study. Natural Language Engineering, 26(6), pp.663-676.