“OLR Challenge 2017”版本间的差异
(以“=AP16 Oriental Language Recognition (AP16-OLR) Challenge= The AP16 OLR challenge is part of the special session "multilingual speech and language processing" on APS...”为内容创建页面) |
|||
第1行: | 第1行: | ||
− | = | + | =Oriental Language Recognition (OLR) 2017 Challenge= |
+ | |||
+ | Oriental languages involve interesting specialalities. The OLR challenge series aims at boosting language recognition technology for oriental languages. | ||
+ | Following the success of [[OLR challenge 2016]], the new challenge in 2017 sets up more challenging tasks that involve more languages and shorter | ||
+ | speech segments. | ||
− | |||
− | |||
==Data== | ==Data== | ||
− | The challenge is based on | + | The challenge is based on two multilingual database, AP16-OL7 that was designed for the OLR challenge 2016, and a new complementary AP17-OL3 |
+ | database. These two databases are both provided by SpeechOcean (www.speechocean.com). | ||
+ | |||
+ | |||
+ | The features for AP16-OL7 involve: | ||
* Mobile channel | * Mobile channel | ||
第15行: | 第21行: | ||
* The data profile is [[媒体文件:User Agreement-AP16-OL7-Format.pdf|here ]] | * The data profile is [[媒体文件:User Agreement-AP16-OL7-Format.pdf|here ]] | ||
* The Licence for the data is [[AP16-OL7-licence|here]] | * The Licence for the data is [[AP16-OL7-licence|here]] | ||
+ | |||
+ | The feature for AP17-OL3 involve: | ||
+ | |||
+ | * Mobile channel | ||
+ | * 3 languages in total | ||
+ | * 24 speakers (18 speakers for training/development, 6 speakers for test). | ||
+ | * 30 hours of speech signals in total | ||
+ | * Transcriptions and lexica are provided | ||
+ | * The data profile is [[媒体文件:User Agreement-AP17-OL3-Format.pdf|here ]] | ||
+ | * The Licence for the data is [[AP17-OL3-licence|here]] | ||
+ | |||
==Evaluation tools== | ==Evaluation tools== | ||
* The Kaldi-based baseline scripts [[媒体文件:Baseline.rar|here ]] | * The Kaldi-based baseline scripts [[媒体文件:Baseline.rar|here ]] | ||
* The evaluation toolkit [[媒体文件:Tools.rar|here ]] | * The evaluation toolkit [[媒体文件:Tools.rar|here ]] | ||
− | |||
− | |||
==Participation rules== | ==Participation rules== | ||
第32行: | 第47行: | ||
* The detailed evaluation plan can be found in the above paper as well. | * The detailed evaluation plan can be found in the above paper as well. | ||
− | == | + | ==Important dates== |
− | * | + | * May 20, AP17-OL7 training data release. |
− | * | + | * Oct.1, test data release. |
− | * | + | * Oct.2, 12:00PM, Beijing time, submission deadline |
− | + | * APSIPA17, results announcement. | |
− | * | + | |
− | + | ||
==Registration procedure== | ==Registration procedure== | ||
第49行: | 第62行: | ||
− | = | + | ==Organizers== |
− | + | ||
− | == | + | |
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | ||
− | + | * Dong Wang, Tsinghua University | |
− | * | + | * Lantian Li, Tsinghua University |
+ | * Qing Chen, SpeechOcean |
2017年4月22日 (六) 02:58的版本
目录
[隐藏]Oriental Language Recognition (OLR) 2017 Challenge
Oriental languages involve interesting specialalities. The OLR challenge series aims at boosting language recognition technology for oriental languages. Following the success of OLR challenge 2016, the new challenge in 2017 sets up more challenging tasks that involve more languages and shorter speech segments.
Data
The challenge is based on two multilingual database, AP16-OL7 that was designed for the OLR challenge 2016, and a new complementary AP17-OL3 database. These two databases are both provided by SpeechOcean (www.speechocean.com).
The features for AP16-OL7 involve:
- Mobile channel
- 7 languages in total
- 24 speakers (18 speakers for training/development, 6 speakers for test).
- 71 hours of speech signals in total
- Transcriptions and lexica are provided
- The data profile is here
- The Licence for the data is here
The feature for AP17-OL3 involve:
- Mobile channel
- 3 languages in total
- 24 speakers (18 speakers for training/development, 6 speakers for test).
- 30 hours of speech signals in total
- Transcriptions and lexica are provided
- The data profile is here
- The Licence for the data is here
Evaluation tools
Participation rules
- Participants of both the special session and the AP16-OLR challenge can apply for AP16-OL7 by sending emails to the organizers (see below).
- Agreement for the usage of AP16-OL7 should be signed and returned to the organizer before the data can be downloaded.
- Publications based on AP16-OL7 should cite the following paper:
Dong Wang, Lantian Li, Difei Tang, Qing Chen, AP16-OL7: a multilingual database for oriental languages and a language recognition baseline, submitted to APSIPA 2016.pdf
- The detailed evaluation plan can be found in the above paper as well.
Important dates
- May 20, AP17-OL7 training data release.
- Oct.1, test data release.
- Oct.2, 12:00PM, Beijing time, submission deadline
- APSIPA17, results announcement.
Registration procedure
If you are interested to participate the challenge, or if you have any other questions, comments, suggestions about the challenge, please send email to the organizer:
- Dr. Dong Wang (wangdong99@mails.tsinghua.edu.cn)
- Ms. Qing Chen(chenqing@speechocean.com)
Organizers
- Dong Wang, Tsinghua University
- Lantian Li, Tsinghua University
- Qing Chen, SpeechOcean