Research introduction

来自cslt Wiki
2022年8月15日 (一) 06:40Chenrenmiao讨论 | 贡献的版本

跳转至: 导航搜索

Research Scheme

CSLT devotes to cut-edging research in speech and language processing. The main research scheme involves understanding patterns of human speech and language, retrieving information from spoken language materials, integration of multilevel resources of speech and languages. The research filed covers psychology, signal processing, linguistics, pattern recognition and machine learning, artificial intelligence and knowledge representation.

Research Labs

Automatic Speech Recognition Lab

The ASR lab focuses on speech signal processing and spoken information retrieval. The research topics involve speech recognition, text to speech, spoken event detection, spoken term detection, spoken document retrieval. The current research focuses on sparse learning for speech signals, deep learning for speech signals and spoken documents, spoken term detection. The team has published more than 100 peer-reviewed papers, including some top-rank journals and international conferences, please check here

We collaborated with Sinovoice. Based on the HCI cloud platform, our ASR technology is now serving more than 10 million users. We were also working with Huilan and Pachira on intelligent QA and spoken information discovery. Previously, we collaborated with Tencent on the project of Tencent ASR system.


Group space

Voice Print Recognition Lab

The VPR lab of CSLT focus on speaker recognition technologies. The current research focuses on robust speaker recognition, DNN-based speaker recognition, factorization approaches, fast verification algorithms. The team has published more than 100 papers on the major journals and conferences, please check here.

The team is active in deliver practical systems and industrial solutions. Our partners involve a multitude of enterprises and organization, such as CCB. The delivered technologies are actively used by millions of people every day.


Natural Language Processing Lab

The NLP Lab focuses on two themes. The first theme involves various topics on natural language processing, including word segmentation, Chinese syntactic parsing, word sense induction, word sense disambiguation, sentiment analysis/opinion mining, and the second theme involves topics related to information retrieval, including query intent analysis, text clustering, text classification, topic model, document representation, etc.