“M2asr-release-resource-uyghur-speech-v1”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
Release Note
第9行: 第9行:
 
<pre>
 
<pre>
  
M2ASR-UYGH-PRIO speech data is a Uyghur speech database collected by Prof. Askar Hamdulla in 2012.01-2012.09. The entire database involves more than 100 hours of speech data, recorded by
+
M2ASR-UYGHUR speech data is a Uyghur speech database collected by Prof. Askar Hamdulla. The entire database involves more than 100 hours of speech data, recorded by
 
desktop microphones. The sampling rate is 16 kHz, and the precision is 16 bits.  
 
desktop microphones. The sampling rate is 16 kHz, and the precision is 16 bits.  
  
The open database THUYG20 is sampled from this dataset. More information can be found in the paper below:
+
However before M2ASR-UYGHUR we have released a uyghur speech database named THUYG20 which involves about 20 hours speech data. This database is not part of M2ASR (the IP is XinJiang University),  
 
+
but Prof. Askar Hamdulla contributed this data to build the Uyghur baseline system. You can download THUYG20 by [ http://data.cslt.org/thuyg20-openslr/README.html click here ]
Askar Rouze, Shi Yin, Zhiyong Zhang, Dong Wang, Askar Humdulla, Fang Zheng, "THUYG THUYG-20: A free Uyghur Speech Database", NCMSSC 2015
+
http://wangd.cslt.org/public/pdf/urghur.pdf
+
 
+
 
+
This database is not part of M2ASR (the IP is Xinjiang University), however Prof. Askar Hamdulla contributed this data to build the Uyghur baseline system.
+
 
+
 
Location: /m2asr/release/resource/uyghur/speech/m2asr-uygh-prior
 
Location: /m2asr/release/resource/uyghur/speech/m2asr-uygh-prior
  
 
</pre>
 
</pre>

2017年11月5日 (日) 07:07的版本

Meta info

  • Name: M2ASR-UYGH-PRIOR Database
  • Type: Speech
  • Author: Askar Hamdulla
  • Licence: Xinjiang University

Release Note


M2ASR-UYGHUR speech data is a Uyghur speech database collected by Prof. Askar Hamdulla. The entire database involves more than 100 hours of speech data, recorded by
desktop microphones. The sampling rate is 16 kHz, and the precision is 16 bits. 

However before M2ASR-UYGHUR we have released a uyghur speech database named THUYG20 which involves about 20 hours speech data. This database is not part of M2ASR (the IP is XinJiang University), 
but Prof. Askar Hamdulla contributed this data to build the Uyghur baseline system. You can download THUYG20 by [ http://data.cslt.org/thuyg20-openslr/README.html click here ]
Location: /m2asr/release/resource/uyghur/speech/m2asr-uygh-prior