en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

https://www.datatang.com/

https://www.datatang.ai/

m.datatang.ai

1241

_AI数据集产品_数据堂

157 Hours - Pushtu Conversational Speech Data by Telephone_157 Hours - Pushtu Conversational Speech Data by Telephone

157 Hours - Pushtu Conversational Speech Data by Telephone

  • Licensed Off-the-shelf Datasets to Boost AI Projects Development.

The 157 Hours - Pushtu Conversational Speech Data collected by telephone involved 224 native speakers, developed with proper balance of gender ratio, Speakers would choose a few familiar topics out of the given list and start conversations to ensure dialogues' fluency and naturalness. The recording devices are various mobile phones. The audio format is 8kHz, 8bit, WAV, and all the speech data was recorded in quiet indoor environments. All the speech audio was manually transcribed with text content, the start and end time of each effective sentence, and speaker identification.

Ask For a Quote Get Data Sample

Specifications

Format
8kHz, 8bit, u-law/a-law pcm, mono channel;
Recording Environment
quiet indoor environment, without echo;
Recording content
dozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed;
Demographics
224 speakers totally, with 92% male and 8% female;
Annotation
annotating for the transcription text, speaker identification and gender
Device
Telephony recording system;
Language
Pushtu
Application scenarios
speech recognition; voiceprint recognition;
Accuracy rate
the word accuracy rate is not less than 95%

  • ته

  • واده له راتلل او ترڅنګ یی د خپلوان راتک

  • وخت کی خو ډیر د خوشحالی احساس وو

  • بعضی وخت ده انسان باید

  • دا بیا نه رازی یا ډیر کم یی

The explicitly authorized and high-quality training dataset of the acquiree helps you start your AI project quickly

Get Started Now

Recommended Dataset

104 Hours - Brazilian Portuguese Conversational Speech Data by Telephone
104 Hours - Brazilian Portuguese Conversational Speech Data by Telephone
58 Hours - European Portuguese Child's Spontaneous Speech Data-Nexdata
58 Hours - European Portuguese Child's Spontaneous Speech Data-Nexdata
97 Hours - Brazilian Portuguese Child's Spontaneous Speech Data
97 Hours - Brazilian Portuguese Child's Spontaneous Speech Data
100 Hours - Thai Child's Spontaneous Speech Data
100 Hours - Thai Child's Spontaneous Speech Data

数据亮点

157 Hours - Pushtu Conversational Speech Data by Telephone

*姓名:

*手机:

*公司名称:

*企业邮箱:

*需求:

157 Hours - Pushtu Conversational Speech Data by Telephone