797 People – Young Children Chinese Speech Data
- 120 sentences for each person
- children at the age of 3-5
- 16kHz, 16bit, wav
Datatang has passed the certification of ISO27001 Information Security Management System and ISO9001 Quality Management System.
The data were recorded by 797 Chinese children aged 3 to 5, of whom 39% were children aged 5. The recording content conforms to the characteristics of children, mainly storybooks, children's songs, spoken language. Around 120 sentences for each speaker. It is simultaneously recorded by hi-fi microphone and cellphone. The vaild data are 41.8 hours. Texts are manually transcribed with high accuracy.
- Microphone 44.1kHz, 16bit, wav, mono; Android phone 16kHz, 16bit, wav, mono; Apple mobile phone 22.05kHz, 16bit, wav, mono
- Recording Environment
- Quiet room
- Recording Content
- Daily language; children's songs; storybooks; instruction interaction sentences; numbers; letters; weekday names
- 797 people, about 120 sentences/person; 51% are males; all of them are children aged from 3~5, among which 39% are aged at 5
- Simultaneous recording by the microphone and mobile phone; Android : IOS=3 :1
- Application Scene
- Speech recognition; machine translation; voiceprint recognition