This module is designed to convert Indonesian graphemes (spelling) into phonemes (pronunciation). Fortunately, most Indonesian word pronunciations can be inferred from their spelling.
Big thanks to Wilson Wongso for sharing about Predicting Phonemes with BERT. I used his code to implement the predictor used in this module.
pip install git+https://github.com/Wikidepia/g2p-id
from g2p_id import G2P
g2p = G2P()
g2p("Rumah Agus terbakar.") # ˈrumah ˈaɡʊs tərˈbakar.
- Variasi Bunyi Vokal - Narabahasa
- Predicting Phonemes with BERT - Wilson Wongso
- Moeliono, Anton M., dkk. 2017. Tata Bahasa Baku Bahasa Indonesia Edisi Keempat. Jakarta: Badan Pengembangan dan Pembinaan Bahasa.
- Add test cases
- Better model for predicting "e"
- Handle heteronym