303 Hours - Mixed Speech with Chinese and English Data by Mobile Phone
- 1,113 person
- seven main dialect zones
- sentences with Chinese and English
Datatang has passed the certification of ISO27001 Information Security Management System and ISO9001 Quality Management System.
The data is recorded by 1113 Chinese native speakers with accents covering seven major dialect areas. The recorded text is a mixture of Chinese and English sentences, covering general scenes and human-computer interaction scenes. It is rich in content and accurate in transcription. It can be used for improving the recognition effect of the speech recognition system on Chinese-English mixed reading speech.
- 16kHz, 16bit, uncompressed wav, mono channel
- Recording environment
- quiet indoor environment, without echo
- Recording content (read speech)
- general category; human-machine interaction category
- 1,113 speakers totally, with 45% males and 55% females, and 75% speakers of all are in the age group of 14-25, 25% speakers of all in the age group of 26-46.
- Android mobile phone, iPhone;
- mandarin; English
- Application scenarios
- speech recognition; voiceprint recognition.
- Accuracy rate
我说放一首Stop The Drama.
我叫你帮我打开Sha DOW Rocket.