The data is recorded by 290 children from the U.S.A, with a balanced male-female ratio. The recorded content of the data mainly comes from children's books and textbooks, which are in line with children's language usage habits. The recording environment is relatively quiet indoors, the text is manually transferred with high accuracy.
For more details, please refer to the link: https://www.nexdata.ai/datasets/speechrecog/1197?source=Github
16kHz, 16bit, uncompressed wav, mono channel
quiet indoor environment, without echo
children's books and textbooks
286 American children, 53% of which are female, all children are 5-12 years old
Android mobile phone, iPhone
American English
speech recognition; voiceprint recognition.
95% of sentence accuracy
Commercial License