245 Hours – Mandarin Speech Data in Cars by Mobile Phone
- 695 participants
- 300 sentences for each person
- 16kHz, 16 bit, wav
Datatang has passed the certification of ISO27001 Information Security Management System and ISO9001 Quality Management System.
695 Chinese native speakers participated in the recording, with 245 hours of valid data, covering many regions of the country. The recording was carried out in the car environment, covering various scenarios such as different road types, different vehicle models, window opening and closing situations, whether music was turned on or not, etc.
- 16kHz, 16bit, wav, mono channel
- Recording Environment
- Recording Content
- customer consultancy(more than 30 fields), SMS, news
- 695 people, about 300 sentence/person, male: 324 female: 371 ,613 aged ≤25, 77 aged from 26~40岁, 5 aged over 40 , mainly from 28 provinces such as Jiangsu, Henan, Shandong, Anhui, Hubei,Hunan, Shaanxi and so on
- Android mobile phone
- Application Scenario
- speech recognition; machine translation; voiceprint recognition
- Annotation Content
- text, noise symbol