2016年5月16日 (一) 23:44的版本

Mixlingual Speech Processing and Chinese-English MixASR Challenge

Organizers: Dong Wang(Tsinghua Univ.), Qing Chen(Speech Ocean)
Email: wangdong99@mails.tsinghua.edu.cn; chenqing@speechocean.com

Introduction

The modern society demonstrates clear mutual influence among languages, e.g., Mandarin to language minor languages in China, and English to other languages in the world. This leads to a clear mixlingual phenomenon, i.e., some words of a foreign (or target, embedded) language are embedded in a host (or source, matrix) language. This mixlingual effect causes significant problems in various speech processing tasks. This special session focuses on recent research on mixlingual speech processing from a broad range of disciplines, including but not limited to speech recognition, speech synthesis, speech analysis, spoken understanding. Particularly, this special session calls for a mixlingual ASR challenge, for which we offer a large Chinese-English mixlingual speech database THT-MCE120 (provided by Speechocean) that involves 120h of speech data and the associated resources.

Scope

This special session is expected to attract papers on recent research progress in the area of mixlingual speech processing. The targeted research topics are, but not limited to, the following:

 Mixlingual phonetic and phonological analysis
 Mixlingual speech recognition
 Mixlingual speech synthesis
 Language turn detection
 Mixlingual language understanding

Chinese-English Mixlingual ASR (MixASR-CHEN) Challenge

The THT-MCE120 database involves 120h of Chinese-English mixlingual data, where English words are embedded in the host Chinese sentences. This special session call for a Chinese-English MixASR challenge based on this database. The data will be free to institutes who (1) participate the MixASR-CHEN challenge; (2) participate this special session and require data to evaluate their research.

2016年5月16日 (一) 23:44的版本（查看源代码） Cslt（讨论 \| 贡献） ←上一编辑		2016年5月16日 (一) 23:44的版本（查看源代码） Cslt（讨论 \| 贡献）下一编辑→
第18行：		第18行：
	==Chinese-English Mixlingual ASR (MixASR-CHEN) Challenge==		==Chinese-English Mixlingual ASR (MixASR-CHEN) Challenge==

−	The THT-MCE120 database involves 120h of Chinese-English mixlingual data, where English words are embedded in the host Chinese sentences. This special session call for a Chinese-English MixASR challenge based on this database. ~~The data~~ '''will be free''' ~~to institutes~~ who (1) participate the MixASR-CHEN challenge; (2) participate this special session and require data to evaluate their research.	+	The THT-MCE120 database involves 120h of Chinese-English mixlingual data, where English words are embedded in the host Chinese sentences. This special session call for a Chinese-English MixASR challenge based on this database. '''The data will be free to institutes''' who (1) participate the MixASR-CHEN challenge; (2) participate this special session and require data to evaluate their research.

“News-201605172”版本间的差异

2016年5月16日 (一) 23:44的版本

目录

Mixlingual Speech Processing and Chinese-English MixASR Challenge

Introduction

Scope

Chinese-English Mixlingual ASR (MixASR-CHEN) Challenge

导航菜单

个人工具

名字空间

变种

查看

操作

搜索

导航

工具