M2asr-release-resource-uyghur-speech-v1

来自cslt Wiki
2017年11月2日 (四) 00:45Tangzy讨论 | 贡献的版本

跳转至: 导航搜索

Meta info

  • Name: M2ASR-UYGH-PRIOR Database
  • Type: Speech
  • Author: Askar Hamdulla
  • Licence: Xinjiang University

Release Note


M2ASR-UYGH-PRIO speech data is a Uyghur speech database collected by Prof. Askar Hamdulla in 2012.01-2012.09. The entire database involves more than 100 hours of speech data, recorded by
desktop microphones. The sampling rate is 16 kHz, and the precision is 16 bits. 

The open database THUYG20 is sampled from this dataset. More information can be found in the paper below:

Askar Rouze, Shi Yin, Zhiyong Zhang, Dong Wang, Askar Humdulla, Fang Zheng, "THUYG THUYG-20: A free Uyghur Speech Database", NCMSSC 2015
http://wangd.cslt.org/public/pdf/urghur.pdf


This database is not part of M2ASR (the IP is Xinjiang University), however Prof. Askar Hamdulla contributed this data to build the Uyghur baseline system.

Location: /m2asr/release/resource/uyghur/speech/m2asr-uygh-prior