“AP17:OLR-special session”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
第1行: 第1行:
===Title===
+
==Title==
  
 
Minor- and Multilingual speech and language processing
 
Minor- and Multilingual speech and language processing
  
===Organizers===
+
==Organizers==
  
 
Dong Wang: Tsinghua University (wangdong99@mails.tsinghua.edu.cn)
 
Dong Wang: Tsinghua University (wangdong99@mails.tsinghua.edu.cn)
第10行: 第10行:
  
  
===Introduction===
+
==Introduction==
  
  
第25行: 第25行:
  
  
===Potential Papers===
+
==Potential Papers==
  
  
Title: Prior-constrained multilingual speech recognition  
+
===Title: Prior-constrained multilingual speech recognition ===
Author: Ying Shi, Zhiyuan Tang, Dong Wang
+
*Author: Ying Shi, Zhiyuan Tang, Dong Wang
  
Abstract: Conventional multilingual speech recognition follows ether a tandem approach (language identification)  
+
*Abstract: Conventional multilingual speech recognition follows ether a tandem approach (language identification)  
 
or parallel architecture (parallel decoding). This paper presented a novel prior-constrained approach that  
 
or parallel architecture (parallel decoding). This paper presented a novel prior-constrained approach that  
 
conduct the decoding in a multilingual linguistic space, where a prior of the language is used to constrain  
 
conduct the decoding in a multilingual linguistic space, where a prior of the language is used to constrain  
第38行: 第38行:
  
  
Title: Memory-based Uyghur-Chinese Translation
+
===Title: Memory-based Uyghur-Chinese Translation===
Author: Shiyue Zhang, Guli, Mijit Ablimit, Askar Hamdulla
+
*Author: Shiyue Zhang, Guli, Mijit Ablimit, Askar Hamdulla
  
Abstract: Neural machine translation (NMT) has achieved significant performance. However, this NMT approach  
+
*Abstract: Neural machine translation (NMT) has achieved significant performance. However, this NMT approach  
 
has not yet effectively applied to minor languages such as Uyghur to Chinese translation. The main problem here  
 
has not yet effectively applied to minor languages such as Uyghur to Chinese translation. The main problem here  
 
is that the limited training data does not support an end-to-end neural learning. In this paper, we propose to  
 
is that the limited training data does not support an end-to-end neural learning. In this paper, we propose to  
第48行: 第48行:
 
statistical machine translation (SMT) approach.  
 
statistical machine translation (SMT) approach.  
  
Title: Resource construction for Mongolia  
+
===Title: Resource construction for Mongolia ===
Author: Shipeng Xu, Guanyu Li, Hongzhi Yu
+
*Author: Shipeng Xu, Guanyu Li, Hongzhi Yu
  
Abstract: Mongolia is a typical low-resource language. The resource limitation is in various aspects, from acoustic  
+
*Abstract: Mongolia is a typical low-resource language. The resource limitation is in various aspects, from acoustic  
 
analysis, phonetic rules, lexicon, speech and text data. This paper describes our recent progression on Mongolia  
 
analysis, phonetic rules, lexicon, speech and text data. This paper describes our recent progression on Mongolia  
 
resource construction supported by the NSFC project.  
 
resource construction supported by the NSFC project.  
  
Title: Resource construction for tibetan
+
===Title: Resource construction for tibetan===
Author: Guanyu Li, Hongzhi Yu
+
*Author: Guanyu Li, Hongzhi Yu
  
Abstract: Tibetan is an important low-resource language in China. The syllable structure of Tibetan is similar  
+
*Abstract: Tibetan is an important low-resource language in China. The syllable structure of Tibetan is similar  
 
as Chinese, but the composition rules in orthographic forms is highly complex. Additionally, the lexicon  
 
as Chinese, but the composition rules in orthographic forms is highly complex. Additionally, the lexicon  
 
resource is far from standard and rich. This paper describes our recent progression on Tibetan
 
resource is far from standard and rich. This paper describes our recent progression on Tibetan
 
resource construction supported by the NSFC M2ASR project.  
 
resource construction supported by the NSFC M2ASR project.  
  
Title: A large Kazak speech database and a speech recognition baseline
+
===Title: A large Kazak speech database and a speech recognition baseline===
Author: Askar Hamdulla, Ying Shi
+
*Author: Askar Hamdulla, Ying Shi
  
Abstract: We describe the construction process of a large scale Kazak speech database. The database involves  
+
*Abstract: We describe the construction process of a large scale Kazak speech database. The database involves  
 
150 hours of speech signals, recorded by more than 200 speakers. A speech recognition baseline system based  
 
150 hours of speech signals, recorded by more than 200 speakers. A speech recognition baseline system based  
 
on the Kaldi toolkit was also constructed. We hope this database will be a standard  dataset for a multiple  
 
on the Kaldi toolkit was also constructed. We hope this database will be a standard  dataset for a multiple  
 
Kazak speech processing tasks, including ASR, speaker recognition and language understanding.
 
Kazak speech processing tasks, including ASR, speaker recognition and language understanding.

2017年5月2日 (二) 01:56的版本

Title

Minor- and Multilingual speech and language processing

Organizers

Dong Wang: Tsinghua University (wangdong99@mails.tsinghua.edu.cn) Guanyu Li: Northwest National University (guanyu-li@163.com) Mijit Ablimit: Xinjiang University (mijit@xju.edu.cn)


Introduction

Minor- and multilingual phenomenon is a important for modern international societies. This special session focuses on minor- and multilingual speech and language processing, including but not limited to the following topics:

- Minor- and Multilingual phonetic and phonological analysis - Minor- and Multilingual speech recognition - Minor- and Multilingual speaker recognition - Minor- and Multilingual speech synthesis - Minor- and Multilingual language understanding - Resource construction for minor- and multilingual langauges


Potential Papers

Title: Prior-constrained multilingual speech recognition

  • Author: Ying Shi, Zhiyuan Tang, Dong Wang
  • Abstract: Conventional multilingual speech recognition follows ether a tandem approach (language identification)

or parallel architecture (parallel decoding). This paper presented a novel prior-constrained approach that conduct the decoding in a multilingual linguistic space, where a prior of the language is used to constrain the decoding frame by frame. Our experiments found that this approach can realize true simultaneous multilingual speech recognition.


Title: Memory-based Uyghur-Chinese Translation

  • Author: Shiyue Zhang, Guli, Mijit Ablimit, Askar Hamdulla
  • Abstract: Neural machine translation (NMT) has achieved significant performance. However, this NMT approach

has not yet effectively applied to minor languages such as Uyghur to Chinese translation. The main problem here is that the limited training data does not support an end-to-end neural learning. In this paper, we propose to use a memory structure to assist the NMT inference under the condition of limited resource languages. Our experiments demonstrated that the this approach is highly efficient compared to the vanilla NMT, and outperforms the conventional statistical machine translation (SMT) approach.

Title: Resource construction for Mongolia

  • Author: Shipeng Xu, Guanyu Li, Hongzhi Yu
  • Abstract: Mongolia is a typical low-resource language. The resource limitation is in various aspects, from acoustic

analysis, phonetic rules, lexicon, speech and text data. This paper describes our recent progression on Mongolia resource construction supported by the NSFC project.

Title: Resource construction for tibetan

  • Author: Guanyu Li, Hongzhi Yu
  • Abstract: Tibetan is an important low-resource language in China. The syllable structure of Tibetan is similar

as Chinese, but the composition rules in orthographic forms is highly complex. Additionally, the lexicon resource is far from standard and rich. This paper describes our recent progression on Tibetan resource construction supported by the NSFC M2ASR project.

Title: A large Kazak speech database and a speech recognition baseline

  • Author: Askar Hamdulla, Ying Shi
  • Abstract: We describe the construction process of a large scale Kazak speech database. The database involves

150 hours of speech signals, recorded by more than 200 speakers. A speech recognition baseline system based on the Kaldi toolkit was also constructed. We hope this database will be a standard dataset for a multiple Kazak speech processing tasks, including ASR, speaker recognition and language understanding.