3,255 Hours-Chinese Children Speech data by Mobile phone
- 3,255 hours
- 9,780 people
- 6-12 aged children
Datatang has passed the certification of ISO27001 Information Security Management System and ISO9001 Quality Management System.
Mobile phone captured audio data of Chinese children, with total duration of 3,255 hours. 9,780 speakers are children aged 6 to 12, with accent covering seven dialect areas; the recorded text contains common children languages such as essay stories, numbers, and their interactions on cars, at home, and with voice assistants, precisely matching the actual application scenes. All sentences are manually transferred with high accuracy.
- 16kHz/44.1kHz (mobile phone/microphone), 16bit, uncompressed wav, mono channel
- Recording environment
- quiet indoor environment, without echo
- Recording content (read speech)
- kids' stories; human-machine interaction category; smart home command and control category; numbers; general category
- 9,780 speakers totally, with 51% males and 49% females, all children are 6-12 years old
- Android mobile phone, iPhone; part of the speaker has data recorded by microphone
- Application scenarios
- speech recognition; voiceprint recognition.
- Accuracy rate
- 97% (the accuracy rate of the noise symbols and pinyin is not included)