
3,255 Hours-Chinese Children Speech data by Mobile phone
- 3,255 hours
- 9,780 people
- 6-12 aged children
Datatang has passed the certification of ISO27001 Information Security Management System and ISO9001 Quality Management System.


Data Introduction
Mobile phone captured audio data of Chinese children, with total duration of 3,255 hours. 9,780 speakers are children aged 6 to 12, with accent covering seven dialect areas; the recorded text contains common children languages such as essay stories, numbers, and their interactions on cars, at home, and with voice assistants, precisely matching the actual application scenes. All sentences are manually transferred with high accuracy.
Data Specification
- Format
- 16kHz/44.1kHz (mobile phone/microphone), 16bit, uncompressed wav, mono channel
- Recording environment
- quiet indoor environment, without echo
- Recording content (read speech)
- kids' stories; human-machine interaction category; smart home command and control category; numbers; general category
- Demographics
- 9,780 speakers totally, with 51% males and 49% females, all children are 6-12 years old
- Device
- Android mobile phone, iPhone; part of the speaker has data recorded by microphone
- Language
- mandarin
- Application scenarios
- speech recognition; voiceprint recognition.
- Accuracy rate
- 97% (the accuracy rate of the noise symbols and pinyin is not included)
Sample
-
00:00/00:00
[S]心平气和地跟她讲啦三次
-
00:00/00:00
[N]一个小时后提醒我
-
00:00/00:00
[N]他们又立刻照啦一张照片
-
00:00/00:00
[N]说一是一是什么意思
-
00:00/00:00
[S]十一月十一日是什么节日