The 97 Hours - German Child's Spontaneous Speech Data, manually screened and processed. Annotation contains transcription text, speaker identification, gender and other informantion. This dataset can be applied in speech recognition (acoustic model or language model training), caption generation, voice content moderation and other AI algorithm research.
For more details, please refer to the link: https://www.nexdata.ai/datasets/speechrecog/1299?source=Github
16k Hz, 16 bit, wav, mono channel;
12 years old and younger children;
including self-media, conversation, live, lecture, variety show;
German;
annotation for the transcription text, speaker identification, gender;
Word Accuracy Rate (WAR) at least 98%.
Commercial License