Public data

来自cslt Wiki
2014年9月30日 (二) 08:34Cslt讨论 | 贡献的版本

跳转至: 导航搜索

CCC data resource

CSLT holds a close collaboration with Chinese Corpus Consortium (CCC) to collect and publish databases in China. The aim of the CCC is to provide corpora for Chinese ASR, TTS, NLP, perception analysis, phonetics analysis, linguistic analysis, and other related tasks. The corpora can be speech- or text-based; read or spontaneous; wideband or narrowband; standard or dialectal Chinese; clean or with noise; or of any other kinds which are deemed helpful for the foresaid purposes.

Visit CCC

Uyghur text database

Uyghur database