AP17:OLR-special session

- Minor- and Multilingual phonetic and phonological analysis - Minor- and Multilingual speech recognition - Minor- and Multilingual speaker recognition - Minor- and Multilingual speech synthesis - Minor- and Multilingual language understanding - Resource construction for minor- and multilingual langauges

Potential Papers

Title: Prior-constrained multilingual speech recognition Author: Ying Shi, Zhiyuan Tang, Dong Wang

Abstract: Conventional multilingual speech recognition follows ether a tandem approach (language identification) or parallel architecture (parallel decoding). This paper presented a novel prior-constrained approach that conduct the decoding in a multilingual linguistic space, where a prior of the language is used to constrain the decoding frame by frame. Our experiments found that this approach can realize true simultaneous multilingual speech recognition.

Title: Memory-based Uyghur-Chinese Translation Author: Shiyue Zhang, Guli, Mijit Ablimit, Askar Hamdulla

Abstract: Neural machine translation (NMT) has achieved significant performance. However, this NMT approach has not yet effectively applied to minor languages such as Uyghur to Chinese translation. The main problem here is that the limited training data does not support an end-to-end neural learning. In this paper, we propose to use a memory structure to assist the NMT inference under the condition of limited resource languages. Our experiments demonstrated that the this approach is highly efficient compared to the vanilla NMT, and outperforms the conventional statistical machine translation (SMT) approach.

Title: Resource construction for Mongolia Author: XXX, Guanyu Li, Hongzhi Yu

Abstract: Mongolia is a typical low-resource language. The resource limitation is in various aspects, from acoustic analysis, phonetic rules, lexicon, speech and text data. This paper describes our recent progression on Mongolia resource construction supported by the NSFC project.

Title: A large Kazak speech database and a speech recognition baseline Author: Askar Hamdulla, Ying Shi

Abstract: We describe the construction process of a large scale Kazak speech database. The database involves 150 hours of speech signals, recorded by more than 200 speakers. A speech recognition baseline system based on the Kaldi toolkit was also constructed. We hope this database will be a standard dataset for a multiple Kazak speech processing tasks, including ASR, speaker recognition and language understanding.

AP17:OLR-special session

目录

Title

Organizers

Introduction

Potential Papers

导航菜单

个人工具

名字空间

变种

查看

操作

搜索

导航

工具