
19.46 Hours - American English Speech Synthesis Corpus-Female
- 20000 sentences
- 20 hours
- Amercian native speakers
Datatang has passed the certification of ISO27001 Information Security Management System and ISO9001 Quality Management System.


Data Introduction
Female audio data of American English, 19,841 sentences in total and 19.46 hours. It is recorded by American English native speakers, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
Data Specification
- Format
- 44,100Hz, 16bit, uncompressed wav, mono channel.
- Recording environment
- professional recording studio.
- Recording content
- general narrative sentences, interrogative sentences, etc.
- Speaker
- native speaker of American English, female.
- Annotation Feature
- word transcription, part-of-speech, phoneme boundary, four-level accents, four-level prosodic boundary.
- Device
- Microphone
- Language
- American English
- Application scenarios
- speech synthesis
Sample
-
00:00/00:00
Is- it|really take time?$VERB PRON ADV VERB NOUNIH1 Z/IH1 T/R IH1 L IY03/T EY1 K/T AY1 M/
-
00:00/00:00
Was he|start a bird|as he walk$in the wood?$VERB PRON VERB DET NOUN CONJ PRON VERB ADP DET NOUNW AA1 Z/HH IY1/S T* AA1 R T3-1/AX0/B ER1 D3/AE1 Z/HH IY1/W AO1 K3/IH0 N/DH AX1/W UH1 D/
-
00:00/00:00
Look$at the way|they hear operas$and see oil paintings.$VERB ADP DET NOUN PRON VERB NOUN CONJ VERB NOUN NOUNL UH1 K/AE1 T-1/DH AX1/W EY13/DH EY1/HH IH1 R/AA1 P R AX0 Z3/AE1 N D-1/S IY13/OY1 L/P EY1 N T AX0 NG Z/
-
00:00/00:00
The focus-|of this chapter$is the American revolution.$DET NOUN ADP DET NOUN VERB DET ADJ NOUNDH AX1/F OW1 K AX0 S/AH1 V/DH AX1 S/CH AE1 P T ER03/IH1 Z/DH AX1/AX0 M EH1 R AX0 K AX0 N/R EH2 V AX0 L UW1 SH AX0 N/
-
00:00/00:00
Was$the young|birds|feather?$VERB DET ADJ NOUN NOUNW AA1 Z/DH AX1/Y AH1 NG3/B ER1 D Z/F EH1 DH ER0/