ASR-nsfc-publication
来自cslt Wiki
Journal papers
- Yunqi Cai, Lantian Li, Andrew Abel, Xiaoyan Zhu, Dong Wang, "Deep Normalization for Speaker Vectors", IEEE Transactions on Audio, Speech and Language Processing, 2020.
- Dong Wang, "A Simulation Study on Optimal Scores for Speaker Recognition", EURASIP Journal on Audio, Speech, and Music Processing, 2020.
- Zhiyuan Tang, Lantian Li, Dong Wang, Ravichander Vipperla, "Collaborative Joint Training With Multitask Recurrent Model for Speech and Speaker Recognition", IEEE TASLP 2018, vol 25, no.3. online
- Zhiyuan Tang,Dong Wang,Yixiang Chen,Lantian Li,Andrew Abel, "Phonetic Temporal Neural Model for Language Identification", IEEE TASLP 2017. online
Conference papers
- Ying Shi, Haolin Chen, Zhiyuan Tang, Lantian Li, Dong Wang, Jiqing Han, Can We Trust Deep Speech Prior?, ISLT 2021[1]
- Zheng Li, Miao Zhao, Qingyang Hong, Lin Li, Zhiyuan Tang, Dong Wang, Liming Song, Cheng Yang, "AP20-OLR Challenge: Three Tasks and TheirBaselines", APSIPA 2020. [2]
- Jiawen Kang,Ruiqi Liu,Lantian Li,Yunqi Cai,Dong Wang,Thomas Fang Zheng, "Domain-Invariant Speaker Vector Projection by Model-Agnostic Meta-Learning", Interspeech 2020. [3]
- Sitong Cheng,Zhixin Liu,Lantian Li,Zhiyuan Tang,Dong Wang,Thomas Fang Zheng, "ASR-Free Pronunciation Assessment", Interspeech 2020. [4]
- Lantian Li,Dong Wang,Thomas Fang Zheng, "Neural Discriminant Analysis for Deep Speaker Embedding", Interspeech 2020. [5]
- Yue Fan, Jiawen Kang, Lantian Li, Kaicheng Li, Haolin Chen, Sitong Cheng, Pengyuan Zhang, Ziya Zhou, Yunqi Cai, Dong Wang, "CN-CELEB: A Challenging Chinese Speaker Recognition Dataset", ICASSP 2020. [6]
- Wupeng Wang, Chao Xing, Dong Wang, Xiao Chen, Fengyu Sun, "A ROBUST AUDIO-VISUAL SPEECH ENHANCEMENT MODEL", ICASSP 2020, [7]
- Yang Zhang and Lantian Li and Dong Wang, "VAE-based regularization for deep speaker embedding", Interspeech 2019 [8].
- Lantian Li,Zhiyuan Tang,Ying Shi,Dong Wang, "Gaussian-Constrained Training for Speaker Verification", ICASSP 2019[9]
- Jiyuan Zhang,Dong Wang, "Chinese Poetry Generation with Flexible Styles", ISCSLP 2018[10].
- Jiyuan Zhang,Zheling Zhang,Shiyue Zhang, Dong Wang,"VV-COUPLET: AN OPEN SOURCE CHINESE COUPLET GENERATION SYSTEM", APSIPA 2018. [11]
- Zhiyuan Tang,Dong Wang,Qing Chen, "AP18-OLR CHALLENGE: THREE TASKS AND THEIR BASELINES",APSIPA 2018.[12]
- Ying Shi,Zhiyuan Tang, Lantian Li,Zheling Zhang,Dong Wang, "MAP AND RELABEL: TOWARDS ALMOST-ZERO RESOURCE SPEECH RECOGNITION",APSIPA 2018.[13]
- Miao Zhang, Xiaofei Kang, Yanqing Wang, Lantian Li, Zhiyuan Tang, Haisheng Dai, Dong Wang*, HUMAN AND MACHINE SPEAKER RECOGNITION BASED ON SHORT TRIVIAL EVENT, ICASSP 2018 arXiv
- Lantian Li, Zhiyuan Tang, Dong Wang, Thomas Fang Zheng, FULL-INFO TRAINING FOR DEEP SPEAKER FEATURE LEARNING, ICASSP 2018.arXiv
- Lantian Li, Dong Wang*, Yixiang Chen, Ying Shing, Zhiyuan Tang, Thomas Fang Zheng, DEEP FACTORIZATION FOR SPEECH SIGNAL, ICASSP 2018 arXiv
- Dong Wang, Thomas Fang Zheng, Zhiyuan Tang, Ying Shi, Lantian Li, Shiyue Zhang Hongzhi Yu, Guanyu Li, Shipeng Xu, Askar Hummdulla, Mijit Ablimit, Gulnigar Mahmut, M2ASR: AMBITIONS AND FIRST YEAR PROGRESS, O-COCOSDA 2017. pdf
- Yang Feng, Shiyue Zhang, Andy Zhang, Dong Wang and Andrew Abel, Memory-augmented Neural Machine Translation, EMNLP 2017 [14] .
- Lantian Li, Yixiang Chen, Dong Wang, Thomas Fang Zheng, A Study on Replay Attack and Anti-Spoofing for Automatic Speaker Verification, Interspeech 2017 [15].
- Lantian Li, Yixiang Chen, Ying Shi, Zhiyuan Tang, Dong Wang, "Deep Speaker Feature Learning for Text-independent Speaker Verification", Interspeech 2017[16].
- Jiyuan Zhang, Yang Feng, Dong Wang, Yang Wang, Andrw Abel, Shiyue Zhang, Andi Zhangi, "Flexible and Creative Chinese Poetry Generation Using Neural Memory", ACL 2017 [17]
- Zhiyuan Tang, Ying Shi, Dong Wang, Yang Feng, and Shiyue Zhang, "Memory Visualization for Gated Recurrent Neural Networks in Speech Recognition", ICASSP 2017.[18]
- Zhiyuan Tang, Dong Wang, Yixiang Chen, Qing Chen, AP17-OLR Challenge: Data, Plan, and Baseline, APSIPA 2017, link: arXiv
- Shiyue Zhang, Gulnigar Mahmut, Dong Wang, Askar Hamdulla, Memory-augmented Chinese-Uyghur Neural Machine Translation, APSIPA 2017, link: arXiv
- Shipeng Xu , Hongzhi Yu, Thomas Fang Zheng and Jinghao Yan, Language Resource Construction for Mongolian, APSIPA 2017, pdf
- Guanyu Li, Hongzhi Yu, Thomas Fang Zheng, Jinghao Yan, Free Linguistic and Speech Resources for Tibetan, APSIPA 2017, link: pdf
- Ying Shi, Askar Hamdulla, Zhiyuan Tang, Dong Wang, Thomas Fang Zheng, A Free Kazak Speech Database and a Speech Recognition Baseline, APSIPA 2017, link: pdf
- Mijit Ablimit, Sardar Parhat, Askar Hamdulla, Thomas Fang Zheng , A Multilingual Language Processing Tool for Uyghur, Kazak and Kirghiz, APSIPA 2017, link: pdf
- Aodong Li, Shiyue Zhangy, Dong Wangz and Thomas Fang Zheng, Enhanced Neural Machine Translation by Learning from Draft, APSIPA 2017, link: pdf
- Lantian Li, Dong Wang, Askar Rozi, Thomas Fang Zheng, Cross-lingual Speaker Verification with Deep Feature Learning, APSIPA 2017, link: arXiv
- Dong Wang, Lantian Li, Zhiyuan Tang, Thomas Fang Zheng, Deep Speaker Verification: Do We Need End to End?, APSIPA 2017, link: arXiv
- Miao Zhang, Yixiang Chen, Lantian Li and Dong Wang, Speaker Recognition with Cough, Laugh and “Wei”, APSIPA 2017, link: arXiv
- Jinghao Yan, Hongzhi Yu, Guanyu Li,"Tibetan acoustic model research based on TDNN", APSIPA ASC 2018