en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

https://www.datatang.com/

https://www.datatang.ai/

m.datatang.ai

1242

_AI数据集产品_数据堂

196 Hours - Urdu Conversational Speech Data by Telephone_196 Hours - Urdu Conversational Speech Data by Telephone

196 Hours - Urdu Conversational Speech Data by Telephone

  • Licensed Off-the-shelf Datasets to Boost AI Projects Development.

The 196 Hours - Urdu Conversational Speech Data collected by telephone involved 270 native speakers, developed with proper balance of gender ratio, Speakers would choose a few familiar topics out of the given list and start conversations to ensure dialogues' fluency and naturalness. The recording devices are various mobile phones. The audio format is 8kHz, 8bit, WAV, and all the speech data was recorded in quiet indoor environments. All the speech audio was manually transcribed with text content, the start and end time of each effective sentence, and speaker identification.

Ask For a Quote Get Data Sample

Specifications

Format
8kHz, 8bit, u-law/a-law pcm, mono channel;
Recording Environment
quiet indoor environment, without echo;
Recording content
dozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed;
Demographics
270 speakers totally, with 56% male and 44% female;.
Annotation
annotating for the transcription text, speaker identification and gender
Device
Telephony recording system;
Language
Urdu
Application scenarios
speech recognition; voiceprint recognition;
Accuracy rate
the word accuracy rate is not less than 95%

  • کہ یہ کرو، اور وہ کرو، اور احتیاطی تدابیر وغیرہ بھی نا۔

  • اچھا اور جب دیکھیں، ہمارا پہلے ہوتا تھا، یہ نزلہ وغیرہ اور ہم کہتے تھے، چلو ہم،

  • ہمم، اور دیکھیں ذرا، covid کا اور کورونا کا آج کل جو ہے، وہ اخبار اور ٹی وی میں بھی اتنا بتا رہے ہیں،

  • ہاں تمہیں پتہ ہے نا، پچھلے دو سالوں سے سردی زیادہ ہوتی ہے، تو سردی کے موسم میں کورونا کے پھیلنے کا خطرہ بھی زیادہ ہوتا ہے۔

The explicitly authorized and high-quality training dataset of the acquiree helps you start your AI project quickly

Get Started Now

Recommended Dataset

104 Hours - Brazilian Portuguese Conversational Speech Data by Telephone
104 Hours - Brazilian Portuguese Conversational Speech Data by Telephone
58 Hours - European Portuguese Child's Spontaneous Speech Data-Nexdata
58 Hours - European Portuguese Child's Spontaneous Speech Data-Nexdata
97 Hours - Brazilian Portuguese Child's Spontaneous Speech Data
97 Hours - Brazilian Portuguese Child's Spontaneous Speech Data
100 Hours - Thai Child's Spontaneous Speech Data
100 Hours - Thai Child's Spontaneous Speech Data

数据亮点

196 Hours - Urdu Conversational Speech Data by Telephone

*姓名:

*手机:

*公司名称:

*企业邮箱:

*需求:

196 Hours - Urdu Conversational Speech Data by Telephone