en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

https://www.datatang.com/

https://www.datatang.ai/

m.datatang.ai

1240

_AI数据集产品_数据堂

330 Hours - Dari Conversational Speech Data by Telephone_330 Hours - Dari Conversational Speech Data by Telephone

330 Hours - Dari Conversational Speech Data by Telephone

  • Licensed Off-the-shelf Datasets to Boost AI Projects Development.

The 330 Hours - Dari Conversational Speech Data collected by telephone involved 452 native speakers, developed with proper balance of gender ratio, Speakers would choose a few familiar topics out of the given list and start conversations to ensure dialogues' fluency and naturalness. The recording devices are various mobile phones. The audio format is 8kHz, 8bit, WAV, and all the speech data was recorded in quiet indoor environments. All the speech audio was manually transcribed with text content, the start and end time of each effective sentence, and speaker identification.

Ask For a Quote Get Data Sample

Specifications

Format
8kHz, 8bit, ulaw/alaw pcm, mono channel;
Recording Environment
quiet indoor environment, without echo;
Recording content
dozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed;
Demographics
452 speakers totally, with 94% male and 6% female;
Annotation
annotating for the transcription text, speaker identification and gender
Device
Telephony recording system;
Language
Dari
Application scenarios
speech recognition; voiceprint recognition;
Accuracy rate
the word accuracy rate is not less than 95%

  • مکتبهای دولتی مکتبهای شخصی وجود داره

  • دیگه ده همی جای شما

  • کلشان در تعلیم مصروف هستن و مکتبا همینطور فعلا خوب مکتبام شروع شدن

  • دیگه مکتبا پوهنتونا همینطور مدرسه ها کلشان ده همیجه فعال هستن

  • پاچای پیشینگی ما بود و حالا که هسته

The explicitly authorized and high-quality training dataset of the acquiree helps you start your AI project quickly

Get Started Now

Recommended Dataset

104 Hours - Brazilian Portuguese Conversational Speech Data by Telephone
104 Hours - Brazilian Portuguese Conversational Speech Data by Telephone
58 Hours - European Portuguese Child's Spontaneous Speech Data-Nexdata
58 Hours - European Portuguese Child's Spontaneous Speech Data-Nexdata
97 Hours - Brazilian Portuguese Child's Spontaneous Speech Data
97 Hours - Brazilian Portuguese Child's Spontaneous Speech Data
100 Hours - Thai Child's Spontaneous Speech Data
100 Hours - Thai Child's Spontaneous Speech Data

数据亮点

330 Hours - Dari Conversational Speech Data by Telephone

*姓名:

*手机:

*公司名称:

*企业邮箱:

*需求:

330 Hours - Dari Conversational Speech Data by Telephone