
203 People - Taiwanese Mandarin Speech Data by Mobile Phone_Guiding
- 203 people
- Taiwan locals
- 16kHz, 16bit, wav
Datatang has passed the certification of ISO27001 Information Security Management System and ISO9001 Quality Management System.


Data Introduction
The data collected 203 Taiwan people, covering Taipei, Kaohsiung, Taichung, Tainan, etc. 137 females, 66 males. It is recorded in quiet indoor environment. It can be used in speech recognition, machine translation, voiceprint recognition model training and algorithm research.
Data Specification
- Format
- 16kHz, 16bit, uncompressed wav, mono channel
- Recording environment
- quiet indoor environment, without echo
- Recording content (read speech )
- smart car, smart home, speech assistant
- Speaker
- 203 Taiwanese, 67% of which are female
- Device
- Android mobile phone, iPhone
- Language
- Mandarin
- Transcription content
- text, noise symbols, special identifiers
- Accuracy rate
- 95%(the accuracy rate of noise symbols and other identifiers is not included)
- Application scenarios
- speech recognition, voiceprint recognition
Sample
-
00:00/00:00
查詢餘額
-
00:00/00:00
前排右側座位調前
-
00:00/00:00
放低靠背
-
00:00/00:00
把電壓轉換至兩百二十福特
-
00:00/00:00
今天晚上哪一台播甄嬛傳