ASR-nsfc-resource

来自cslt Wiki
2017年11月2日 (四) 03:14Tangzy讨论 | 贡献的版本

跳转至: 导航搜索

Language resources

Uyghur

M2ASR-UYGH-PRIO Speech Database (XJU)

phone set

lexicon

train set transcription

test set transcription

In the first phase, we release 100h speech audio of M2ASR-UYGH-PRIO for free. LICENSE is needed.


Kazakh

M2ASR-KAZAK-PRIO Speech Database (XJU)

phone set

lexicon

train transcription

test transcription

In the first phase, we release 50h speech audio of M2ASR-KAZAK-PRIO for free. LICENSE is needed.


Tibetan

M2ASR-TIBETAN-PRIO Speech Database (NMU)

phone set

lexicon

train transcription

test transcription

In the first phase, we release 50h speech audio of M2ASR-TIBETAN-PRIO for free. LICENSE is needed.

Note: We use the unicode of tibetan to represent tibetan letters in the transcriptions at present.



Mongolian

M2ASR-MONGOLIAN-PRIO Speech Database (NMU)

lexicon

In the first phase, we release 50h speech audio of M2ASR-MONGOLIAN-PRIO for free. LICENSE is needed.


Kirgiz

Coming soon.


Models

Systems

Platform

License

Contact Dr. Dong Wang (wangdong99@mails.tsinghua.edu.cn)