
1,002 Hours - Russian Speech Data by Mobile Phone
- 1,960 people
- balanced in gender
Datatang has passed the certification of ISO27001 Information Security Management System and ISO9001 Quality Management System.


Data Introduction
1960 Russian native speakers participated in the recording with authentic accent. The recorded script is designed by linguists and cover a wide range of topics including generic, interactive, in-vehicle and home. The text is manually proofread with high accuracy. It matches with mainstream Android and Apple system phones.
Data Specification
- Format
- 16kHz, 16bit, uncompressed wav, mono channel
- Recording Environment
- quiet indoor environment, low background noise, without echo
- Recording content (read speech)
- generic category; human-machine interaction category; smart home command and in-car command category; numbers
- Demographics
- 1,960 speakers totally, with 50% male and 50% female; and 61% speakers of all are in the age group of 18-25,35% speakers of all are in the age group of 26-45, 4% speakers of all are in the age group of 46-60;
- Device
- Android mobile phone, iPhone
- Language
- Russian
- Application scenario
- speech recognition, voiceprint recognition
Sample
-
00:00/00:00
Сколько времени добираться от моего дома до ближайшего банка?
-
00:00/00:00
Пожалуйста, установи таймер обратного отсчета для мультиварки на полчаса.
-
00:00/00:00
Ещё одно такое замечание и я скажу Ван Хэю переключить на два понастоящему.
-
00:00/00:00
Пожалуйста дай сведения о наличии товара «свитеры» у продавца
-
00:00/00:00
Там отметили, что дети получили медицинскую помощь, в случае необходимости им окажут также психологическую поддержку