The Kinyarwanda common voice dataset is a dataset of Kinyarwanda sentences collected in order to train the kinyarwanda deepspeech model. The Kinyarwanda common voice dataset is made of 1,200,000 million + sentences
To check out the common voice dataset go to the common voice website select Kinyarwanda in the language option and you can choose releases depending on the amount of data you want. Note: the latest release have more data