1,025 Hours - Mandarin Strong Accent Speech Data by Mobile Phone
- 2,000 people
- balanced gender distribution
- covering some strong northern accent provinces
Datatang has passed the certification of ISO27001 Information Security Management System and ISO9001 Quality Management System.
More than 2,000 Chinese native speakers participated in the recording with equal gender. Speakers are mainly from the southern China, and some of them are from the provinces of northern China with Strong accents. The recording content is rich, covering mobile phone voice assistant interaction, smart home command and control, In-car command and control, numbers and other fields, which is accurately matching the smart home, intelligent car and other practical application scenarios.
- mobile phone, 16kHz, 16bit, uncompressed wav, mono channel
- Recording Environment
- moderately quiet indoor environment, without echo
- Recording Content
- generic category and control category; human-machine interaction category; smart home command and control category; in-car command and control category; numbers;
- 2,444 speakers totally, with 54% males and 46% females, and 59% speakers of all are in the age group of 18~25, 37% speakers of all are in the age group of 26~45, 4% speakers of all are in the age group of 46~60; speakers are mainly from the southern China, and some of them are from the provinces of northern China with Strong accents.
- iPhone, Android mobile phone
- Strong Accented Mandarin
- Application scene
- speech recognition, machine translation; voiceprint recognition