1,000 Hours - Brazilian Portuguese Speech Data by Mobile Phone
- 1000 hours
- 2000 speakers
- balanced distribution for gender and age
Datatang has passed the certification of ISO27001 Information Security Management System and ISO9001 Quality Management System.
The data volumn is 1000 hours and is recorded by 2000 Brazilian native speakers. The recording text is designed by linguistic experts, which covers general interactive, in-car and home category. The texts are manually proofread with high accuracy. Recording devices are mainstream Android phones and iPhones.
- 16kHz, 16bit, uncompressed wav, mono channel
- Recording environment
- quiet indoor environment, low background noise, without echo
- Recording content (read speech)
- oral category; news category ;human-machine interaction category; smart home command and control category; in-car command and control category; numbers;
- 2,000 speakers totally, with male and female accounting within ±5% of the half; and 60% speakers of all are in the age group of 18-25,35% speakers of all are in the age group of 26-45, 5% speakers of all are in the age group of 46-60, with a floating rate of 5%;
- Android mobile phone, iPhone
- Application scenarios
- speech recognition; voiceprint recognition
Porque Douradoquara é tão famoso para os viajantes
Ao chegar sentou-se na cama abaixo de pôsteres de Dirk Nowitzki e Porzingis
quatrocentos e quarenta e um mil ducentos e trinta e dois reais
Na comunicação ela cita artigos das leis russas que apontam para punição quanto à humilhação ou insulto.
Joy nós estamos casados há vinte anos.