en

SOLUTIONS

Please fill in your name

Mobile phone format error

Please enter the phone number

Please fill in the full name of the company

Please fill in your e-mail

Requirement description cannot be empty

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

Requirement description format error,Minimum 5 characters required

No data available

TTS

ACCENTED ENGLISH AUTOMATIC SPEECH RECOGNITION WORKSHOP & CHALLENGE

Organizers

  • ACCENTED ENGLISH AUTOMATIC SPEECH RECOGNITION WORKSHOP & CHALLENGE_Organizers

    CCF Task Force on

    Speech Dialogue

    and Auditory Processing

  • ACCENTED ENGLISH AUTOMATIC SPEECH RECOGNITION WORKSHOP & CHALLENGE_Organizers

    Audio, Speech and Language

    Processing Group, Northwestern

    Polytechnical University

  • ACCENTED ENGLISH AUTOMATIC SPEECH RECOGNITION WORKSHOP & CHALLENGE_Organizers

    xi'an software park

  • ACCENTED ENGLISH AUTOMATIC SPEECH RECOGNITION WORKSHOP & CHALLENGE_Organizers

    SHAANXI KUNPENG

    Ecological

    Innovation Center

  • ACCENTED ENGLISH AUTOMATIC SPEECH RECOGNITION WORKSHOP & CHALLENGE_Organizers

    Speech Lab, Shanghai Jiao

    Tong University, China

  • ACCENTED ENGLISH AUTOMATIC SPEECH RECOGNITION WORKSHOP & CHALLENGE_Organizers

    School of Computer Science and

    Engineering, Nanyang Technological

    University, Singapore

  • ACCENTED ENGLISH AUTOMATIC SPEECH RECOGNITION WORKSHOP & CHALLENGE_Organizers

    Center for Language and Speech

    Processing, John Hopkins

    University, United States

  • ACCENTED ENGLISH AUTOMATIC SPEECH RECOGNITION WORKSHOP & CHALLENGE_Organizers

    Datatang (Beijing)

    Technology Co., Ltd.

  • ACCENTED ENGLISH AUTOMATIC SPEECH RECOGNITION WORKSHOP & CHALLENGE_
  • CHALLENGE BACKGROUND

    INTERSPEECH2020 Accented English Speech Recognition Workshop

    INTERSPEECH has grown into the world's largest technical conference focused on speech processing and application. The conferences emphasize interdisciplinary approaches addressing all aspects of speech science and technology, ranging from basic theories to advanced applications. As a flagship sattelite workshop, the Accented English Speech Recognition Challenge (AESRC) will provide a common testbed for researchers in speech recogntion, especially recognition of speech with accents, and the challenge workshop will be held on October 25, 2020 in Shanghai.

  • CHALLENGE INTRODUCTION

    Accented English Speech Recognition Chanllenge

    English is the most influential universal language in the world. English speech recognition is also one of the most concerned areas in both academia and industry. At present, advanced ASR systems have achieved good effect and meet most requirements for standard English. In accent English field, however, recognizing English speech with accents still remains a challenging task. The difficulties in building an accent English ASR system mainly arise from the diversity of pronunciation accuracy, intonation speed and pronunciation of some syllables. On the other hand, the shortage of accent speech data limits the relevant research.

    The Interspeech 2020 Accented English Speech Recognition Challenge (AESRC) will open 8 sets of accented English data from different countries to the participants, covering various pronunciation characteristics and accents, aiming to promote the discussion and exchange on English language research and accent speech recognition. It is expected that all researchers from academia and industry can learn from each other and truly gain by participating our challenge & workshop.

    Computing resources will be provided by Huawei

  • ACCENTED ENGLISH AUTOMATIC SPEECH RECOGNITION WORKSHOP & CHALLENGE_

TRACK SETTING

Track1

Accent Identification

Use permitted data only to train an accent recognition model. Submit the result of language identification on the test set.

Note:No limit for models and training technics. Evaluation considers the identification accuracy of the test set only.
Track2

Accented English Speech Recognition

Use permitted data only to train an ASR model for recognizing all kinds of accented English. Submit the result text of recognition.

Note: Test sets will include accents beyond training data in order to evaluate the generalization performance of the model. All kinds of system combination methods including ROVER are strictly prohibited. Language model training should only use the transcripts of permitted speech training data. Data augmentation should only be applied on the permitted speech data only.

Specified data

160 hours of labelled accented speech collected in Russia, Korea, US, Portugal, Japan, India, UK and China (20 hours/country) will be released to the registered teams.

Duration

20 hours ×8

Language & Accent

Accented English from Russia, Korea, US, Portugal, Japan, India, UK, China

Speaker

40 – 110 speakers per accent

Audio Format

16kHz, 16bit, single channel wav

Recording environment

Indoor, mobile phone

Speech content

Daily communication, interaction with smart devices, etc

Datasets will be released with metadata files organized in the following format

FIELD

DESCRIPTION

SEX

Speaker gender

AGE

Speaker age

ACT

Accent type

MIT

Recording device

SCC

Recording environment

LBR

Utterance duration

ORS

Raw text

Librispeech data is also permitted to use in both tracks. (http://www.openslr.org/12/

Challenge Schedule

Awards

Note:All the prize amounts include the tax.

International Scientific Committee

(Names listed in no particular order)

Lei Xie

Northwestern Polytechnical University, China

Yanmin Qian

Shanghai Jiao Tong University, China

Shinji Watanabe

John Hopkins University, United States

Chng Eng Siong

Nanyang Technological University, Singapore

Qiangze Feng

Datatang(Beijing)Technology Co.,Ltd, China

Participants

Challenge is open to university, scientific research institutes, and internet enterprises.

Note: The challenge organizers and technical support units such as the employees who have the access to the business, products and data about the challenge will automatically withdraw from the challenge and give up the qualifications.

Registration

  • If you are interested in the challenge, please contact us by email to interspeech2020@datatang.com
  • Download the registration form (either English or Chinese version), fill in the information, and send it to the email address above. The registration deadline is Aug 31 2020.
  • The organizing committee will review and verify the qualifications of the participating teams within 5 working days. The teams that have passed the review will sign the challenge data usage agreement, and qualified to join the challenge.
  • The training data will be announced on Aug 31 2020, and the data downloading method will be provided to the participants who have passed the review and signed the agreement.
Download

Anti-cheating Statement

  • Participants are prohibited to register more than one time.

  • Participants are supposed to obey the data using rules of each track strictly. Teams that break the rules will be disqualified to use the data and the results will be invalid.

Q&A

Download FAQ for 2020AESRC

All rights reserved by Datatang (Beijing) Technology Co.

Terms Privacy Datatang. All Rights Reserved. Legal statement and privacy policy

数据堂_datatang