
435 Hours - Spanish Speech Data by Mobile Phone
- 435 hours
- 989 speakers, balanced age distribution
- various recording contents
Datatang has passed the certification of ISO27001 Information Security Management System and ISO9001 Quality Management System.


Data Introduction
The data volumn is 435 hours and is recorded by 989 Spanish native speakers. The recording text is designed by linguistic experts, which covers general interactive, in-car and home category. The texts are manually proofread with high accuracy. Recording devices are mainstream Android phones and iPhones.
Data Specification
- Format
- 16kHz, 16bit, uncompressed wav, mono channel
- Recording Environment
- quiet indoor environment, low background noise, without echo
- Recording content (read speech)
- oral category; human-machine interaction category; smart home command and in-car command category; numbers; news category
- Demographics
- 989 speakers totally, with 49% male and 51% female ; and 57% speakers of all are in the age group of 17-25,39% speakers of all are in the age group of 26-45, 4% speakers of all are in the age group of 46-60;
- Device
- Android mobile phone, iPhone
- Language
- Spanish
- Application scenario
- speech recognition, voiceprint recognition
Sample
-
00:00/00:00
Quiero que apagues la televisión.
-
00:00/00:00
El comentario que acompaña la foto es claro a este respecto: Mamá, por favor, dónde está ese peto
-
00:00/00:00
Las siete horas,veinticuatro minutos y treinta y cinco segundos.
-
00:00/00:00
No se imagina lo que significa para mí.
-
00:00/00:00
¿Qué temática tiene la película? Tenemos que hablar de Kevin