
200 People - Chinese Wake-up Words Speech Data by Mobile Phone
- 200 people, 180 sentences per person
- 24.6 hours
- covering seven dialect regions
Datatang has passed the certification of ISO27001 Information Security Management System and ISO9001 Quality Management System.


Data Introduction
Chinese wake-up words audio data captured by mobile phone, collected from 200 people, 180 sentences per person, a total length of 24.5 hours; recording staff come from seven dialect regions with balanced gender distribution; collection environment was diversified; recorded text includes wake-up words and colloquial sentences.
Data Specification
- Data Size
- 200 People, about 180 sentences per person, 24.6 Hour in total
- Recording Content
- Wake-up Word
- Recording Environment
- Quiet Environment, Noisy Environment: Indoor, In-car, Street side
- Storage Format
- 16kHz, 16bit, mono channel, wav
- Data Source
- Customize copyrighted data
- Speaker
- Chinese, 200 People, female account for 53%
- Age
- 108 People between 18-35 account for 54%, 69 people between 36-45 account for 34%, 23 people between 46-59 account for 12%
- Covered Area
- 7 large dialects area are covered
- Device
- HuaWei Honor8 : HuaWei G9 = 1.2:1
- Related Area
- Waking-up Word, Colloquial Sentences
Sample
-
00:00/00:00
T0253G0001S0122.wavfgf成吉思汗
-
00:00/00:00
T0253G0004S0148.wavfgf天下无敌
-
00:00/00:00
T0253G0149S0152.wavfgf天下无敌
-
00:00/00:00
T0253G0149S0168.wavfgf天下无敌
-
00:00/00:00