1,796 Hours - German Speech Data by Mobile Phone
German audio data captured by mobile phone, 1,796 hours in total, recorded by 3,442 German native speakers. The recorded text is designed by linguistic experts, covering generic, interactive, on-board, home and other categories. The text has been proofread manually with high accuracy; this data can be used for automatic speech recognition, machine translation, and voiceprint recognition.
GermanGermanyMobile phoneReadingSample
52,483 Shanghai Dialect Pronunciation Dictionary
The data contains more than 50,000 entries. All words and pronunciations are produced by Shanghai dialect linguists, including 410 international phonemes and 74 Shanghai phonemes. The pinyin of Shanghai dialect consists of five single tones, namely, yin ping, yin qu, yang qu, yin ru, yang ru, with accurate pronunciation. It can be used in the research and development of Shanghai dialect identification technology.
DictionaryDialectShanghai DialectIPASample
98 Hours - Taiwan Mandarin Speech Data by Mobile Phone_Reading
The data collects 204 Taiwan residents with 450 sentences for each speaker. The recorded is rich in content, including economy, entertainment, news, spoken language, numbers, letters, etc., covering general scenes and human-computer interaction scenes. Manual transcription of text to make sure the high accuracy. Recording devices are mainstream Android phones and iPhones.
MandarinTAIWANReadingSample
769 Hours - French Speech Data by Mobile Phone
The data volumn is 769 hours and is recorded by 1623 French native speakers. The recording text is designed by linguistic experts, which covers general interactive, in-car and home category. The texts are manually proofread with high accuracy. Recording devices are mainstream Android phones and iPhones.
FrenchFranceMobile phoneReadingSample
20 People-English Emotional Speech Data by Microphone
English emotional audio data captured by microphone, 20 American native speakers participate in the recording, 2,100 sentences per person; the recorded script covers 10 emotions such as anger, happiness, sadness; the voice is recorded by high-fidelity microphone therefore has high quality; it is used for analytical detection of emotional speech.
EnglishUSAEmotionMicrophoneReadingSample
23,349 People Multi-race and Multi-pose Face Images Data
23,349 People Multi-race and Multi-pose Face Images Data. This data includes Asian race, Caucasian race, black race, brown race and Indians. Each subject were collected 29 images under different scenes and light conditions. The 29 images include 28 photos (multi light conditions, multiple poses and multiple scenes) + 1 ID photo. This data can be used for face recognition related tasks.
Multi-raceMultiple light conditionsMulti-poseSample
388 Hours - Spanish Speaking English Speech Data by Mobile Phone
891 Spanish native speakers participated in the recording with authentic accent. The recorded script is designed by linguists and cover a wide range of topics including generic, interactive, on-board and home. The text is manually proofread with high accuracy. It matches with mainstream Android and Apple system phones. The data set can be applied for automatic speech recognition, and machine translation scenes.
EnglishSpainCellphoneReadingSample
CUSTOMIZED COLLECTION & ANNOTATION SERVICES
1,000,000+ crowdsourcing to perform complex and professional projects