M2asr-release-resource-uyghur-speech-v1

来自cslt Wiki
2016年12月28日 (三) 08:30Cslt讨论 | 贡献的版本

跳转至: 导航搜索

Meta info

  • NAME: M2ASR-UYGH-PRIOR speech database
  • Type: Speech database
  • Author: Askar Hamdulla
  • Licence: Xinjiang University

Release Note


M2ASR-UYGH-PRIO speech data is a Uyghur speech database collected by Prof. Askar Hamdulla in 2012.01-2012.09. The entire database involves more than 100 hours of speech data, recorded by
desktop microphones. The sampling rate is 16k Hz, and the precision is 16 bits. 

The open database THUYG20 is sampled from this dataset. More information can be found in the paper below:

Askar Rouze, Shi Yin, Zhiyong Zhang, Dong Wang, Askar Humdulla, Fang Zheng, "THUYG THUYG-20: A free Uyghur Speech Database", NCMSSC 2015
http://wangd.cslt.org/public/pdf/urghur.pdf


This database is not part of M2ASR (the IP is Xinjiang University), however Prof. Askar Hamdulla contribute this data to build the Uyghur baseline system.

Location: /m2asr/release/resource/uyghur/speech/m2asr-uygh-prior