Skip to content

Nexdata-AI/50-People-Chinese-English-Mixed-Average-Tone-Speech-Synthesis-Corpus-Customer-Service

Repository files navigation

50-People-Chinese-English-Mixed-Average-Tone-Speech-Synthesis-Corpus-Customer-Service

Description

50 People - Chinese-English Mixed Average Tone Speech Synthesis Corpus-Customer Service. It is recorded by Chinese native speakers,customer service text, and the syllables, phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

For more details, please refer to the link: https://www.nexdata.ai/datasets/tts/1118?source=Github

Format

48,000Hz, 16bit, uncompressed wav, mono channel;

Recording environment

professional recording studio;

Recording content

customer service text, and the syllables, phonemes and tones are balanced;

Speaker

50 speakers totally, with 50% male and 50% female;

Device

microphone;

Language

Chinese/English mixed;

Annotation

word and Pinyin transcription, four-level prosodic boundary annotation;

Application scenarios

speech synthesis.

Licensing Information

Commercial License