“ASR-nsfc-data”版本间的差异
来自cslt Wiki
第8行: | 第8行: | ||
==Uyghur== | ==Uyghur== | ||
− | In the second phase, the Uyghur dataset | + | In the second phase, the Uyghur dataset consists of: |
− | * 136h speech audio and 353 speakers | + | * 136h speech audio and 353 speakers (166 males and 187 females). |
* transcription of the speech audio. | * transcription of the speech audio. | ||
* lexicon in word level. | * lexicon in word level. | ||
第16行: | 第16行: | ||
==Kazakh== | ==Kazakh== | ||
− | In the second phase, the Kazakh dataset | + | In the second phase, the Kazakh dataset consists of: |
− | * 78h speech audio and 86 speakers | + | * 78h speech audio and 86 speakers (40 males and 46 females). |
* transcription of the speech audio. | * transcription of the speech audio. | ||
* lexicon in word level. | * lexicon in word level. | ||
第24行: | 第24行: | ||
==Tibetan== | ==Tibetan== | ||
− | In the second phase, the Tibetan dataset | + | In the second phase, the Tibetan dataset consists of: |
− | * 72h speech audio and 147 speakers | + | * 72h speech audio and 147 speakers (66 males and 81 females). |
* transcription of the speech audio. | * transcription of the speech audio. | ||
* lexicon in word level. | * lexicon in word level. |
2020年6月3日 (三) 00:49的版本
Data resources
In order to promote the development of minority speech signal processing technology, we will publish all the M2ASR dataset to scientific research institutions for free. You should ask for license before you can download the dataset.
Please send Email to shiying@cslt.org or lilt@cslt.org to get the license.
Uyghur
In the second phase, the Uyghur dataset consists of:
- 136h speech audio and 353 speakers (166 males and 187 females).
- transcription of the speech audio.
- lexicon in word level.
Kazakh
In the second phase, the Kazakh dataset consists of:
- 78h speech audio and 86 speakers (40 males and 46 females).
- transcription of the speech audio.
- lexicon in word level.
Tibetan
In the second phase, the Tibetan dataset consists of:
- 72h speech audio and 147 speakers (66 males and 81 females).
- transcription of the speech audio.
- lexicon in word level.
Mongolian
Coming soon...
Kirgiz
Coming soon...