This repository hosts scripts and materials related to the Pangloss Collection. Note that the official repository of the Pangloss Collection is at https://github.com/CNRS/Pangloss and that the interface to consult the Pangloss Collection is at https://pangloss.cnrs.fr/.
The first script hosted in this repository is DOREMIFA2XML, a Praat script that takes as input (i) a Praat TextGrid and (ii) the EFEO-CNRS-SOAS word list. It generates as output an XML file in the Pangloss format for word lists (<WORDLIST>). This script was written by MẠC Đăng Khoa as part of a digitization project funded by the Digital Scientific Library. The project is presented in a PowerPoint slideshow. A step-by-step guide in Vietnamese is provided in the PDF file HowToProduceXMLFile_Vietnamese.pdf.
DOREMIFA2XML is the tool that was used to produce the XML documents for Arem, Muong and other Vietic languages in the Pangloss Collection.
There are also some scripts of mine at https://github.com/alexis-michaud/na/tree/master/SCRIPTS
References: EFEO-CNRS-SOAS Word List for Linguistic Fieldwork in Southeast Asia https://halshs.archives-ouvertes.fr/halshs-01632084/ (by Frédéric Pain, Michel Ferlus, Alexis Michaud, Thị Thu Hà Phạm, Ryan Gehrmann and Minh-Châu Nguyễn)