19.46 Hours - American English Speech Synthesis Corpus-Female
Female audio data of American English, 19,841 sentences in total, and 20 hours per month. It is recorded by American English native speakers, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
502 Hours - Chinese Speaking English Speech Data by Mobile Phone
1,279 Chinese speakers from major dialect regions participated in the recording, it is in line with the specific accent of Chinese English speakers. The recorded script cover many categories such as spoken English, speech, and human-computer interaction, rich in content, extensive in fields, and balanced in phonemes. It can be used to improve the recognition effect of the automatic speech recognition system on Chinese people speaking English.