Beijing TTS Recording Center has 1 master control room and 2 professional recording rooms, each equipped with independent control systems. The recording studio has passed the test of Tsinghua University's Building Environment to reach the professional-grade NR15 acoustic standard. The reverberation time is less than 0.1 second, and the background noise is less than 30dB(A). It supports professional & amateur voice actors TTS data and front-end model data production.
Hefei Data Center is located in the “Big Data Town” of Shushan Economic Development Zone. It covers an area of 1,500 square meters and can accommodate 500 professional annotators. Since its establishment, Hefei Data Center has continuously cultivated enterprises in the artificial intelligence industry chain. It can provide image and audio data collection and labeling services.
Baoding Data Center has 1,200 square meters of workspace and 200 full-time annotators, 60% of whom are senior labelers with more than 3 years of labeling experience. It supports multiple data annotation scenarios such as voice recognition, face recognition, OCR recognition, and autonomous driving.
All of our staff have more than 5 years of work experience thus they are familiar with different kinds of data requirements and able to deeply understand clients’ application scenario.
Our annotators have more than 3 years of experience in data annotation, who are skilled in 3D point cloud annotation, segmentation annotation and TTS annotation. For new annotators, we provide a 90-days complete training system.
The data base is equipped with double entrance guard, 24 hours of network monitoring, and double network backups to ensure data security.
Professional QA team. More than 7 years of experience in project management and quality control. The data accuracy rate can reach to 96%-99% after rounds of QA. We make timely and dynamic quality control in the whole process of annotation to ensure to deliver data on time.
Human: living body, key points (human face, human body and gesture)、attributes
Scenario: 3D point cloud, LiDar data annotation
OCR: Q&A, games, multiple languages
Mandarin: natural dialogue, reading, interactive
Dialect: natural dialogue, reading
Foreign language: natural conversation, reading
NLP：multiple interactive annotation, entity annotation, text pronunciation annotation (polyphone, character, number)
TTS：fine annotation, coarse annotation (Pinyin, mixed Chinese and English), rhythm annotation (audio rhythm, text rhythm)