“M2asr-delivery-phaseI”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
Tibetan
Kazakh
 
(相同用户的4个中间修订版本未显示)
第11行: 第11行:
 
[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/2/2c/Uyghur.lexicon.doc lexicon]
 
[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/2/2c/Uyghur.lexicon.doc lexicon]
  
[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/7/71/Uyghur.train_trans.txt train set transcription]
+
In the first phase, we release 100h speech audio of M2ASR-UYGH-PRIO for free. LICENSE is needed.
 
+
[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/5/57/Uyghur.test_trans.txt test set transcription]
+
 
+
In the first phase, we release 100h speech audio of M2ASR-UYGH-PRIO for free. LICENSE is needed.  
+
 
+
  
 
==Kazakh==
 
==Kazakh==
第28行: 第23行:
 
[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/3/34/Kazak.lexicon.txt lexicon]
 
[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/3/34/Kazak.lexicon.txt lexicon]
  
[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/9/9e/Kazak.train_trans.txt train transcription]
+
In the first phase, we release 50h speech audio of M2ASR-KAZAK-PRIO for free. LICENSE is needed.
 
+
[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/1/17/Kazak.test_trans.txt test transcription]
+
 
+
In the first phase, we release 50h speech audio of M2ASR-KAZAK-PRIO for free. LICENSE is needed.  
+
 
+
 
+
  
 
==Tibetan==
 
==Tibetan==
第60行: 第49行:
  
 
[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/7/70/Mongol_dict_v2.0.txt lexicon]
 
[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/7/70/Mongol_dict_v2.0.txt lexicon]
 
In the first phase, we release 50h speech audio of M2ASR-MONGOLIAN-PRIO for free. LICENSE is needed.
 
 
 
  
 
==Kirgiz==
 
==Kirgiz==

2018年1月28日 (日) 06:58的最后版本

Language resources

Uyghur

M2ASR-UYGH-PRIO Speech Database (XJU)

phone set

lexicon

In the first phase, we release 100h speech audio of M2ASR-UYGH-PRIO for free. LICENSE is needed.

Kazakh

M2ASR-KAZAK-PRIO Speech Database (XJU)

phone set

lexicon

In the first phase, we release 50h speech audio of M2ASR-KAZAK-PRIO for free. LICENSE is needed.

Tibetan

M2ASR-TIBETAN-PRIO Speech Database (NMU)

phone set

lexicon

train transcription

test transcription

In the first phase, we release 10h speech audio of M2ASR-TIBETAN-PRIO for free. LICENSE is needed.

Note: We use the unicode of tibetan to represent tibetan letters in the transcriptions at present.

Mongolian

M2ASR-MONGOLIAN-PRIO Speech Database (NMU)

lexicon

Kirgiz

Coming soon.


Models

Systems

Platform

License

Contact Dr. Dong Wang (wangdong99@mails.tsinghua.edu.cn)