“Public data”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
Tibetan ASR database
 
(4位用户的14个中间修订版本未显示)
第5行: 第5行:
 
[http://www.cccforum.org visit CCC]
 
[http://www.cccforum.org visit CCC]
  
==Uyghur text database==
 
  
 +
==CNCeleb1==
 +
This data is a large-scale speaker recognition dataset collected 'in the wild'. The dataset contains more than 130,000 utterances from 1,000 Chinese celebrities,
 +
and covers 11 different genres in real world. All the audio files are coded as single channel and sampled at 16kHz with 16-bit precision.
 +
 +
[http://www.openslr.org/82/ download from openSLR]
 +
 +
 +
==Trivial events database==
 +
A free database involving 7 types of human trivial events: cough, laugh, "wei", "hmm", "tsk-tsk", "ahem", sniff. The data is
 +
collected using a recording Android App.
 +
 +
[https://share.weiyun.com/389a55251c59fc4f9740d5c28be380f7 download from Cloud]
 +
 +
 +
==Disguise database==
 +
A free database involving human's normal speech and disguised speech. The data is collected using a recording Android App.
 +
 +
[https://share.weiyun.com/a7355eb4321dafd2887460daa915191d download from Cloud]
 +
 +
 +
==Uyghur text database==
 
CSLT collaborated with the [http://www.xju.edu.cn/ XinJiang University] on a wide range of research including speech recognition, information retrieval and text processing. We published a multitude of resources to boost the research on Uyghur. The text data published here is used for Uyghur text classification tasks, which involves 500 health and non-health documents respectively. It was collected by Mahpirat from XJU when she visited CSLT from 2012-2013.  
 
CSLT collaborated with the [http://www.xju.edu.cn/ XinJiang University] on a wide range of research including speech recognition, information retrieval and text processing. We published a multitude of resources to boost the research on Uyghur. The text data published here is used for Uyghur text classification tasks, which involves 500 health and non-health documents respectively. It was collected by Mahpirat from XJU when she visited CSLT from 2012-2013.  
  
[http://data.cslt.org/uygh/zip/data.tar.gz download] [http://pan.baidu.com/s/1hqKwE00 download from Baidu]
+
Download is under construction....
  
  
 
==Sheik Cantonese lexicon==
 
==Sheik Cantonese lexicon==
 
 
A free Cantonese lexicon collected from Adam Sheik's Cantonese Dict project.  
 
A free Cantonese lexicon collected from Adam Sheik's Cantonese Dict project.  
  
 
[http://data.cslt.org/cantonese/sheik/index.html check details]
 
[http://data.cslt.org/cantonese/sheik/index.html check details]
  
== THUYG-20 database ==
 
  
 +
== THUYG-20 database ==
 
A free speech database for constructing a full-fledged Uyghur ASR system.  
 
A free speech database for constructing a full-fledged Uyghur ASR system.  
  
[http://data.cslt.org/thuyg20/README.html check details]
+
Download is under construction....
  
== THUYG-20 SRE database ==
 
  
 +
== THUYG-20 SRE database ==
 
A free speech database for constructing a full-fledged Uyghur speaker recognition system.  
 
A free speech database for constructing a full-fledged Uyghur speaker recognition system.  
  
[http://data.cslt.org/thuyg20-sre/README.html check details]
+
Download is under construction....
  
  
 
== SUD-12 database ==
 
== SUD-12 database ==
 
 
A speech database used for short utterance speaker recognition
 
A speech database used for short utterance speaker recognition
  
 
[http://data.cslt.org/susr/SUB12/index.html check details]
 
[http://data.cslt.org/susr/SUB12/index.html check details]
  
== THUCH30 database ==
 
  
 +
== THUCH30 database ==
 
A speech database used for Chinese LVCSR. Recorded by Dong Wang many many years ago.
 
A speech database used for Chinese LVCSR. Recorded by Dong Wang many many years ago.
  
 
[http://data.cslt.org/thchs30/README.html check details]
 
[http://data.cslt.org/thchs30/README.html check details]
  
==kazak ASR database==
+
 
 +
==Kazak ASR database==
 
A speech database used for Kazak LVCSR.  
 
A speech database used for Kazak LVCSR.  
  
第50行: 第69行:
  
 
[https://share.weiyun.com/4cf4ec64e4e59f8280de8c7baecaad27  QQ weiyun share link]
 
[https://share.weiyun.com/4cf4ec64e4e59f8280de8c7baecaad27  QQ weiyun share link]
 +
 +
You can send e-mail to shiying@cslt.org to ask for share password.
 +
  
 
==Tibetan ASR database==
 
==Tibetan ASR database==
第59行: 第81行:
 
[https://share.weiyun.com/da691bff0f7c641646ae9fb1154ffdce QQ weiyun share link]
 
[https://share.weiyun.com/da691bff0f7c641646ae9fb1154ffdce QQ weiyun share link]
  
You can send e-mail to shiying@cslt.riit.tsinghua.edu.cn to ask for share password.
+
You can send e-mail to shiying@cslt.org to ask for share password.
 +
 
 +
 
 +
==CSLT-Chronos database==
 +
A time-varying dataset for speaker recognition.
 +
 
 +
The entire package involves the 8 recording sessions selected to evaluate the time-variance effect.
 +
For commercial usage, only the MFCCs/Fbanks are published for research usage.
 +
 
 +
[http://166.111.134.19:7777/data/chronos/README.md Readme]
 +
[http://166.111.134.19:7777/data/chronos/CSLT-Chronos.tar.gz Download]
 +
 
 +
For any query, please contact with lilt@cslt.org
 +
 
 +
 
 +
==CSLT-ESDB database==
 +
A speech dataset for speech emotion recognition.
 +
 
 +
The entire package involves 4 emotions recorded by 30 actors.
 +
 
 +
[http://166.111.134.19:7777/data/cslt-esdb/README.md Readme]
 +
[http://166.111.134.19:7777/data/cslt-esdb/CSLT-ESDB.tar.gz Download]
 +
 
 +
For any query, please contact with shr19@mails.tsinghua.edu.cn

2021年7月28日 (三) 02:48的最后版本

CCC data resource

CSLT holds a close collaboration with Chinese Corpus Consortium (CCC) to collect and publish databases in China. The aim of the CCC is to provide corpora for Chinese ASR, TTS, NLP, perception analysis, phonetics analysis, linguistic analysis, and other related tasks. The corpora can be speech- or text-based; read or spontaneous; wideband or narrowband; standard or dialectal Chinese; clean or with noise; or of any other kinds which are deemed helpful for the foresaid purposes.

visit CCC


CNCeleb1

This data is a large-scale speaker recognition dataset collected 'in the wild'. The dataset contains more than 130,000 utterances from 1,000 Chinese celebrities, and covers 11 different genres in real world. All the audio files are coded as single channel and sampled at 16kHz with 16-bit precision.

download from openSLR


Trivial events database

A free database involving 7 types of human trivial events: cough, laugh, "wei", "hmm", "tsk-tsk", "ahem", sniff. The data is collected using a recording Android App.

download from Cloud


Disguise database

A free database involving human's normal speech and disguised speech. The data is collected using a recording Android App.

download from Cloud


Uyghur text database

CSLT collaborated with the XinJiang University on a wide range of research including speech recognition, information retrieval and text processing. We published a multitude of resources to boost the research on Uyghur. The text data published here is used for Uyghur text classification tasks, which involves 500 health and non-health documents respectively. It was collected by Mahpirat from XJU when she visited CSLT from 2012-2013.

Download is under construction....


Sheik Cantonese lexicon

A free Cantonese lexicon collected from Adam Sheik's Cantonese Dict project.

check details


THUYG-20 database

A free speech database for constructing a full-fledged Uyghur ASR system.

Download is under construction....


THUYG-20 SRE database

A free speech database for constructing a full-fledged Uyghur speaker recognition system.

Download is under construction....


SUD-12 database

A speech database used for short utterance speaker recognition

check details


THUCH30 database

A speech database used for Chinese LVCSR. Recorded by Dong Wang many many years ago.

check details


Kazak ASR database

A speech database used for Kazak LVCSR.

The entire package involves the full set of speech and language resources required to establish a Kazak speech recognition system.

QQ weiyun share link

You can send e-mail to shiying@cslt.org to ask for share password.


Tibetan ASR database

A speech database used for Tibetan LVCSR.

The entire package involves the full set of speech and language resources required to establish a Tibetan speech recognition system.

QQ weiyun share link

You can send e-mail to shiying@cslt.org to ask for share password.


CSLT-Chronos database

A time-varying dataset for speaker recognition.

The entire package involves the 8 recording sessions selected to evaluate the time-variance effect. For commercial usage, only the MFCCs/Fbanks are published for research usage.

Readme Download

For any query, please contact with lilt@cslt.org


CSLT-ESDB database

A speech dataset for speech emotion recognition.

The entire package involves 4 emotions recorded by 30 actors.

Readme Download

For any query, please contact with shr19@mails.tsinghua.edu.cn