97-Hours-German-Children-Spontaneous-Speech-Data

Description

The 97 Hours - German Child's Spontaneous Speech Data, manually screened and processed. Annotation contains transcription text, speaker identification, gender and other informantion. This dataset can be applied in speech recognition (acoustic model or language model training), caption generation, voice content moderation and other AI algorithm research.

For more details, please refer to the link: https://www.nexdata.ai/datasets/speechrecog/1299?source=Github

Specifications

Format

16k Hz, 16 bit, wav, mono channel;

Age

12 years old and younger children;

Content category

including self-media, conversation, live, lecture, variety show;

Language

German;

Annotation

annotation for the transcription text, speaker identification, gender;

Accuracy

Word Accuracy Rate (WAR) at least 98%.

Licensing Information

Commercial License

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
000003_1.txt		000003_1.txt
000003_1.wav		000003_1.wav
000003_10.txt		000003_10.txt
000003_10.wav		000003_10.wav
000003_11.txt		000003_11.txt
000003_11.wav		000003_11.wav
000003_2.txt		000003_2.txt
000003_2.wav		000003_2.wav
000003_3.txt		000003_3.txt
000003_3.wav		000003_3.wav
000003_4.txt		000003_4.txt
000003_4.wav		000003_4.wav
000003_5.txt		000003_5.txt
000003_5.wav		000003_5.wav
000003_6.txt		000003_6.txt
000003_6.wav		000003_6.wav
000003_7.txt		000003_7.txt
000003_7.wav		000003_7.wav
000003_8.txt		000003_8.txt
000003_8.wav		000003_8.wav
000003_9.txt		000003_9.txt
000003_9.wav		000003_9.wav
GM000003_1.txt		GM000003_1.txt
GM000003_1.wav		GM000003_1.wav
GM000003_10.txt		GM000003_10.txt
GM000003_10.wav		GM000003_10.wav
GM000003_11.txt		GM000003_11.txt
GM000003_11.wav		GM000003_11.wav
GM000003_2.txt		GM000003_2.txt
GM000003_2.wav		GM000003_2.wav
GM000003_3.txt		GM000003_3.txt
GM000003_3.wav		GM000003_3.wav
GM000003_4.txt		GM000003_4.txt
GM000003_4.wav		GM000003_4.wav
GM000003_5.txt		GM000003_5.txt
GM000003_5.wav		GM000003_5.wav
GM000003_6.txt		GM000003_6.txt
GM000003_6.wav		GM000003_6.wav
GM000003_7.txt		GM000003_7.txt
GM000003_7.wav		GM000003_7.wav
GM000003_8.txt		GM000003_8.txt
GM000003_8.wav		GM000003_8.wav
GM000003_9.txt		GM000003_9.txt
GM000003_9.wav		GM000003_9.wav
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

97-Hours-German-Children-Spontaneous-Speech-Data

Description

Specifications

Format

Age

Content category

Language

Annotation

Accuracy

Licensing Information

About

Releases

Packages

Nexdata-AI/97-Hours-German-Children-Spontaneous-Speech-Data

Folders and files

Latest commit

History

Repository files navigation

97-Hours-German-Children-Spontaneous-Speech-Data

Description

Specifications

Format

Age

Content category

Language

Annotation

Accuracy

Licensing Information

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages