“ASR-nsfc-resource”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
第15行: 第15行:
 
[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/5/57/Uyghur.test_trans.txt test set transcription]
 
[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/5/57/Uyghur.test_trans.txt test set transcription]
  
 +
In the first phase, we release 100h speech audio of M2ASR-UYGH-PRIO for free. LICENSE is needed.
  
  
第28行: 第29行:
  
 
[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/1/17/Kazak.test_trans.txt test transcription]
 
[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/1/17/Kazak.test_trans.txt test transcription]
 +
 +
In the first phase, we release 50h speech audio of M2ASR-KAZAK-PRIO for free. LICENSE is needed.
  
  
第42行: 第45行:
  
 
[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/8/8d/Tibetan.test_trans.txt test transcription]
 
[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/8/8d/Tibetan.test_trans.txt test transcription]
 +
 +
In the first phase, we release 50h speech audio of M2ASR-TIBETAN-PRIO for free. LICENSE is needed.
  
 
Note: We use the unicode of tibetan to represent tibetan letters in the transcriptions at present.
 
Note: We use the unicode of tibetan to represent tibetan letters in the transcriptions at present.
第53行: 第58行:
  
 
[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/7/70/Mongol_dict_v2.0.txt lexicon]
 
[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/7/70/Mongol_dict_v2.0.txt lexicon]
 +
 +
In the first phase, we release 50h speech audio of M2ASR-MONGOLIAN-PRIO for free. LICENSE is needed.
  
  

2017年11月2日 (四) 03:14的版本

Language resources

Uyghur

M2ASR-UYGH-PRIO Speech Database (XJU)

phone set

lexicon

train set transcription

test set transcription

In the first phase, we release 100h speech audio of M2ASR-UYGH-PRIO for free. LICENSE is needed.


Kazakh

M2ASR-KAZAK-PRIO Speech Database (XJU)

phone set

lexicon

train transcription

test transcription

In the first phase, we release 50h speech audio of M2ASR-KAZAK-PRIO for free. LICENSE is needed.


Tibetan

M2ASR-TIBETAN-PRIO Speech Database (NMU)

phone set

lexicon

train transcription

test transcription

In the first phase, we release 50h speech audio of M2ASR-TIBETAN-PRIO for free. LICENSE is needed.

Note: We use the unicode of tibetan to represent tibetan letters in the transcriptions at present.



Mongolian

M2ASR-MONGOLIAN-PRIO Speech Database (NMU)

lexicon

In the first phase, we release 50h speech audio of M2ASR-MONGOLIAN-PRIO for free. LICENSE is needed.


Kirgiz

Coming soon.


Models

Systems

Platform

License

Contact Dr. Dong Wang (WangDong99@mails.tsinghua.edu.cn)