1,002 Hours - Kunming Dialect Speech Data by Mobile Phone
2,284 native speakers of Kunming dialect participated in the recording, with authentic accent and from multiple age groups. The recorded script covers a wide range of topics such as generic, interactive, on-board, and home. Local people in Kunming participated in quality check and proofreading, and the text was transferred accurately. It matches with mainstream Android and Apple system phones.
997 Hours – Wuhan Dialect Speech Data by Mobile Phone
Mobile phone captured audio data of Wuhan dialect, 997 hours in total, recorded by more than 2,000 Wuhan dialect native speakers. The recorded text covers generic, interactive, on-board, home and other categories, with rich contents. Wuhan locals participate in quality check and proofreading. Sentence accuracy rate reaches 95 %; this data set can be used for automatic speech recognition, machine translation, and voiceprint recognition.