“ASR-nsfc-publication”版本间的差异
来自cslt Wiki
第2行: | 第2行: | ||
# Yunqi Cai, Lantian Li, Andrew Abel, Xiaoyan Zhu, Dong Wang, "Deep Normalization for Speaker Vectors", IEEE Transactions on Audio, Speech and Language Processing, 2020. [https://arxiv.org/pdf/2004.04095.pdf] | # Yunqi Cai, Lantian Li, Andrew Abel, Xiaoyan Zhu, Dong Wang, "Deep Normalization for Speaker Vectors", IEEE Transactions on Audio, Speech and Language Processing, 2020. [https://arxiv.org/pdf/2004.04095.pdf] | ||
# Dong Wang, "A Simulation Study on Optimal Scores for Speaker Recognition", EURASIP Journal on Audio, Speech, and Music Processing, 2020. [http://wangd.cslt.org/public/pdf/nl-eurosip.pdf] | # Dong Wang, "A Simulation Study on Optimal Scores for Speaker Recognition", EURASIP Journal on Audio, Speech, and Music Processing, 2020. [http://wangd.cslt.org/public/pdf/nl-eurosip.pdf] | ||
+ | # Analysis of phonemes and tones confusion rules obtained by ASR,Wireless Networks,2020.[https://link.springer.com/article/10.1007%2Fs11276-019-02220-2 link] | ||
# Zhiyuan Tang, Lantian Li, Dong Wang, Ravichander Vipperla, "Collaborative Joint Training With Multitask Recurrent Model for Speech and Speaker Recognition", IEEE TASLP 2018, vol 25, no.3. [http://ieeexplore.ieee.org/document/7782371 online] | # Zhiyuan Tang, Lantian Li, Dong Wang, Ravichander Vipperla, "Collaborative Joint Training With Multitask Recurrent Model for Speech and Speaker Recognition", IEEE TASLP 2018, vol 25, no.3. [http://ieeexplore.ieee.org/document/7782371 online] | ||
# Zhiyuan Tang,Dong Wang,Yixiang Chen,Lantian Li,Andrew Abel, "Phonetic Temporal Neural Model for Language Identification", IEEE TASLP 2017. [http://ieeexplore.ieee.org/document/8070977 online] | # Zhiyuan Tang,Dong Wang,Yixiang Chen,Lantian Li,Andrew Abel, "Phonetic Temporal Neural Model for Language Identification", IEEE TASLP 2017. [http://ieeexplore.ieee.org/document/8070977 online] | ||
+ | |||
+ | ==Journal papers (EI)== | ||
+ | # Siamese Attention-based LSTM for Speech Emotion Recognition,IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, v E103A, n 7, p 937-941, July 1, 2020 | ||
+ | # Uyghur short-text classification based on reliable sub-word morphology, International Journal of Reasoning-based Intelligent Systems,v 11, n 3, p 250-255, 2019 | ||
+ | # A Robust Morpheme Sequence and Convolutional Neural Network-Based Uyghur and Kazakh Short Text Classification, Information (Switzerland), v 10, n 12, December 1, 2019 | ||
+ | # Investigation of the phonological error rules of Mandarin by Uyghur second language learners,Quarterly Journal of Indian Pulp and Paper Technical Association,v 30, n 1, p 492-500, March 1, 2018 | ||
==Conference papers (EI)== | ==Conference papers (EI)== | ||
第15行: | 第22行: | ||
# Yang Zhang and Lantian Li and Dong Wang, "VAE-based regularization for deep speaker embedding", Interspeech 2019 [https://arxiv.org/abs/1904.03617]. | # Yang Zhang and Lantian Li and Dong Wang, "VAE-based regularization for deep speaker embedding", Interspeech 2019 [https://arxiv.org/abs/1904.03617]. | ||
# Lantian Li,Zhiyuan Tang,Ying Shi,Dong Wang, "Gaussian-Constrained Training for Speaker Verification", ICASSP 2019[https://arxiv.org/abs/1811.03258] | # Lantian Li,Zhiyuan Tang,Ying Shi,Dong Wang, "Gaussian-Constrained Training for Speaker Verification", ICASSP 2019[https://arxiv.org/abs/1811.03258] | ||
+ | # A morpheme sequence and convolutional neural network based Kazakh text classification,2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019, p 1903-1906, November 2019 | ||
+ | # A Comparative Analysis of Acoustic Characteristics between Kazak Uyghur Mandarin Learners and Standard Mandarin Speakers,Proceedings of the 2019 International Conference on Asian Language Processing, IALP 2019, p 474-479, November 2019 | ||
+ | # Statistical Analysis of Syllable Duration of Uyghur Language,Proceedings of the 2019 International Conference on Asian Language Processing, IALP 2019, p 468-473, November 2019 | ||
# Lantian Li,Zhiyuan Tang,Ying Shi,Dong Wang, "Phonetic-Attention Scoring for Deep Speaker Features in Speaker Verification", APSIPA 2019 [https://arxiv.org/pdf/1811.03255.pdf] | # Lantian Li,Zhiyuan Tang,Ying Shi,Dong Wang, "Phonetic-Attention Scoring for Deep Speaker Features in Speaker Verification", APSIPA 2019 [https://arxiv.org/pdf/1811.03255.pdf] | ||
# Lantian Li*,Xueyi Wang*,Dong Wang, "VAE-based Domain Adaptation for Speaker Verification", APSIPA 2019. [https://arxiv.org/pdf/1908.10092.pdf] | # Lantian Li*,Xueyi Wang*,Dong Wang, "VAE-based Domain Adaptation for Speaker Verification", APSIPA 2019. [https://arxiv.org/pdf/1908.10092.pdf] | ||
第26行: | 第36行: | ||
# Miao Zhang, Xiaofei Kang, Yanqing Wang, Lantian Li, Zhiyuan Tang, Haisheng Dai, Dong Wang*, HUMAN AND MACHINE SPEAKER RECOGNITION BASED ON SHORT TRIVIAL EVENT, ICASSP 2018 [https://arxiv.org/pdf/1711.05443 arXiv] | # Miao Zhang, Xiaofei Kang, Yanqing Wang, Lantian Li, Zhiyuan Tang, Haisheng Dai, Dong Wang*, HUMAN AND MACHINE SPEAKER RECOGNITION BASED ON SHORT TRIVIAL EVENT, ICASSP 2018 [https://arxiv.org/pdf/1711.05443 arXiv] | ||
# Jinghao Yan, Hongzhi Yu, Guanyu Li,"Tibetan acoustic model research based on TDNN", APSIPA ASC 2018 | # Jinghao Yan, Hongzhi Yu, Guanyu Li,"Tibetan acoustic model research based on TDNN", APSIPA ASC 2018 | ||
+ | # Acoustic Features of Mandarin Diphthongs by Uyghur Learners at Primary Level,2018 Oriental COCOSDA - International Conference on Speech Database and Assessments, ICSDA 2018 - Proceedings, p 60-66, July 2, 2018, | ||
+ | # Multilingual Stemming and Term extraction for Uyghur, Kazak and Kirghiz,2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Proceedings, p 587-590, July 2, 2018 | ||
# Lantian Li, Zhiyuan Tang, Dong Wang, Thomas Fang Zheng, FULL-INFO TRAINING FOR DEEP SPEAKER FEATURE LEARNING, ICASSP 2018.[https://arxiv.org/pdf/1711.00366 arXiv] | # Lantian Li, Zhiyuan Tang, Dong Wang, Thomas Fang Zheng, FULL-INFO TRAINING FOR DEEP SPEAKER FEATURE LEARNING, ICASSP 2018.[https://arxiv.org/pdf/1711.00366 arXiv] | ||
# Lantian Li, Dong Wang*, Yixiang Chen, Ying Shing, Zhiyuan Tang, Thomas Fang Zheng, DEEP FACTORIZATION FOR SPEECH SIGNAL, ICASSP 2018 [https://arxiv.org/pdf/1803.00886 arXiv] | # Lantian Li, Dong Wang*, Yixiang Chen, Ying Shing, Zhiyuan Tang, Thomas Fang Zheng, DEEP FACTORIZATION FOR SPEECH SIGNAL, ICASSP 2018 [https://arxiv.org/pdf/1803.00886 arXiv] | ||
第44行: | 第56行: | ||
# Dong Wang, Lantian Li, Zhiyuan Tang, Thomas Fang Zheng, Deep Speaker Verification: Do We Need End to End?, APSIPA 2017, link: [https://arxiv.org/abs/1706.07859 arXiv] | # Dong Wang, Lantian Li, Zhiyuan Tang, Thomas Fang Zheng, Deep Speaker Verification: Do We Need End to End?, APSIPA 2017, link: [https://arxiv.org/abs/1706.07859 arXiv] | ||
# Miao Zhang, Yixiang Chen, Lantian Li and Dong Wang, Speaker Recognition with Cough, Laugh and “Wei”, APSIPA 2017, link: [https://arxiv.org/abs/1706.07860 arXiv] | # Miao Zhang, Yixiang Chen, Lantian Li and Dong Wang, Speaker Recognition with Cough, Laugh and “Wei”, APSIPA 2017, link: [https://arxiv.org/abs/1706.07860 arXiv] | ||
+ | # A rule and statistical modeling based stem extraction method for Kazakh words,Proceedings of the 2017 International Conference on Asian Language Processing, IALP 2017, v 2018-January, p 231-234, July 2, 2017 | ||
+ | |||
+ | ===Other papers=== | ||
+ | # 沙尔旦尔·帕尔哈提,米吉提·阿不里米提,艾斯卡尔·艾木都拉. 基于稳健词素序列和LSTM的维吾尔语短文本分类[J]. 中文信息学报,2020,34(01):63-70. | ||
+ | # 沙尔旦尔·帕尔哈提,米吉提·阿不里米提,艾斯卡尔·艾木都拉. 词干单元和卷积神经网络的哈萨克短文本分类[J]. 小型微型计算机系统,2020,41(08):1627-1633. | ||
+ | # 维-哈-柯多语言词素切分集成环境研究[J]. 电视技术,2020,44(06):46-51+63. |
2020年12月18日 (五) 12:50的版本
Journal papers (SCI)
- Yunqi Cai, Lantian Li, Andrew Abel, Xiaoyan Zhu, Dong Wang, "Deep Normalization for Speaker Vectors", IEEE Transactions on Audio, Speech and Language Processing, 2020. [1]
- Dong Wang, "A Simulation Study on Optimal Scores for Speaker Recognition", EURASIP Journal on Audio, Speech, and Music Processing, 2020. [2]
- Analysis of phonemes and tones confusion rules obtained by ASR,Wireless Networks,2020.link
- Zhiyuan Tang, Lantian Li, Dong Wang, Ravichander Vipperla, "Collaborative Joint Training With Multitask Recurrent Model for Speech and Speaker Recognition", IEEE TASLP 2018, vol 25, no.3. online
- Zhiyuan Tang,Dong Wang,Yixiang Chen,Lantian Li,Andrew Abel, "Phonetic Temporal Neural Model for Language Identification", IEEE TASLP 2017. online
Journal papers (EI)
- Siamese Attention-based LSTM for Speech Emotion Recognition,IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, v E103A, n 7, p 937-941, July 1, 2020
- Uyghur short-text classification based on reliable sub-word morphology, International Journal of Reasoning-based Intelligent Systems,v 11, n 3, p 250-255, 2019
- A Robust Morpheme Sequence and Convolutional Neural Network-Based Uyghur and Kazakh Short Text Classification, Information (Switzerland), v 10, n 12, December 1, 2019
- Investigation of the phonological error rules of Mandarin by Uyghur second language learners,Quarterly Journal of Indian Pulp and Paper Technical Association,v 30, n 1, p 492-500, March 1, 2018
Conference papers (EI)
- Ying Shi, Haolin Chen, Zhiyuan Tang, Lantian Li, Dong Wang, Jiqing Han, Can We Trust Deep Speech Prior?, SLT 2021[3]
- Zheng Li, Miao Zhao, Qingyang Hong, Lin Li, Zhiyuan Tang, Dong Wang, Liming Song, Cheng Yang, "AP20-OLR Challenge: Three Tasks and TheirBaselines", APSIPA 2020. [4]
- Jiawen Kang,Ruiqi Liu,Lantian Li,Yunqi Cai,Dong Wang,Thomas Fang Zheng, "Domain-Invariant Speaker Vector Projection by Model-Agnostic Meta-Learning", Interspeech 2020. [5]
- Sitong Cheng,Zhixin Liu,Lantian Li,Zhiyuan Tang,Dong Wang,Thomas Fang Zheng, "ASR-Free Pronunciation Assessment", Interspeech 2020. [6]
- Lantian Li,Dong Wang,Thomas Fang Zheng, "Neural Discriminant Analysis for Deep Speaker Embedding", Interspeech 2020. [7]
- Yue Fan, Jiawen Kang, Lantian Li, Kaicheng Li, Haolin Chen, Sitong Cheng, Pengyuan Zhang, Ziya Zhou, Yunqi Cai, Dong Wang, "CN-CELEB: A Challenging Chinese Speaker Recognition Dataset", ICASSP 2020. [8]
- Wupeng Wang, Chao Xing, Dong Wang, Xiao Chen, Fengyu Sun, "A ROBUST AUDIO-VISUAL SPEECH ENHANCEMENT MODEL", ICASSP 2020, [9]
- Yang Zhang and Lantian Li and Dong Wang, "VAE-based regularization for deep speaker embedding", Interspeech 2019 [10].
- Lantian Li,Zhiyuan Tang,Ying Shi,Dong Wang, "Gaussian-Constrained Training for Speaker Verification", ICASSP 2019[11]
- A morpheme sequence and convolutional neural network based Kazakh text classification,2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019, p 1903-1906, November 2019
- A Comparative Analysis of Acoustic Characteristics between Kazak Uyghur Mandarin Learners and Standard Mandarin Speakers,Proceedings of the 2019 International Conference on Asian Language Processing, IALP 2019, p 474-479, November 2019
- Statistical Analysis of Syllable Duration of Uyghur Language,Proceedings of the 2019 International Conference on Asian Language Processing, IALP 2019, p 468-473, November 2019
- Lantian Li,Zhiyuan Tang,Ying Shi,Dong Wang, "Phonetic-Attention Scoring for Deep Speaker Features in Speaker Verification", APSIPA 2019 [12]
- Lantian Li*,Xueyi Wang*,Dong Wang, "VAE-based Domain Adaptation for Speaker Verification", APSIPA 2019. [13]
- Jiayao Wu, Zhiyuan Tang and Dong Wang, "Structure Growth for Small-Footprint Speech Recognition", APSIPA 2019. [14]
- Zhiyuan Tang, Dong Wang, Liming Song, "AP19-OLR Challenge: Three Tasks and Their Baselines", APSIPA 2019. [15]
- Yunqi Cai, Dong Wang, "Question Mark Prediction By Bert", APSIPA 2019 [16]
- Jiyuan Zhang,Dong Wang, "Chinese Poetry Generation with Flexible Styles", ISCSLP 2018[17].
- Jiyuan Zhang,Zheling Zhang,Shiyue Zhang, Dong Wang,"VV-COUPLET: AN OPEN SOURCE CHINESE COUPLET GENERATION SYSTEM", APSIPA 2018. [18]
- Zhiyuan Tang,Dong Wang,Qing Chen, "AP18-OLR CHALLENGE: THREE TASKS AND THEIR BASELINES",APSIPA 2018.[19]
- Ying Shi,Zhiyuan Tang, Lantian Li,Zheling Zhang,Dong Wang, "MAP AND RELABEL: TOWARDS ALMOST-ZERO RESOURCE SPEECH RECOGNITION",APSIPA 2018.[20]
- Miao Zhang, Xiaofei Kang, Yanqing Wang, Lantian Li, Zhiyuan Tang, Haisheng Dai, Dong Wang*, HUMAN AND MACHINE SPEAKER RECOGNITION BASED ON SHORT TRIVIAL EVENT, ICASSP 2018 arXiv
- Jinghao Yan, Hongzhi Yu, Guanyu Li,"Tibetan acoustic model research based on TDNN", APSIPA ASC 2018
- Acoustic Features of Mandarin Diphthongs by Uyghur Learners at Primary Level,2018 Oriental COCOSDA - International Conference on Speech Database and Assessments, ICSDA 2018 - Proceedings, p 60-66, July 2, 2018,
- Multilingual Stemming and Term extraction for Uyghur, Kazak and Kirghiz,2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018 - Proceedings, p 587-590, July 2, 2018
- Lantian Li, Zhiyuan Tang, Dong Wang, Thomas Fang Zheng, FULL-INFO TRAINING FOR DEEP SPEAKER FEATURE LEARNING, ICASSP 2018.arXiv
- Lantian Li, Dong Wang*, Yixiang Chen, Ying Shing, Zhiyuan Tang, Thomas Fang Zheng, DEEP FACTORIZATION FOR SPEECH SIGNAL, ICASSP 2018 arXiv
- Dong Wang, Thomas Fang Zheng, Zhiyuan Tang, Ying Shi, Lantian Li, Shiyue Zhang Hongzhi Yu, Guanyu Li, Shipeng Xu, Askar Hummdulla, Mijit Ablimit, Gulnigar Mahmut, M2ASR: AMBITIONS AND FIRST YEAR PROGRESS, O-COCOSDA 2017. pdf
- Yang Feng, Shiyue Zhang, Andy Zhang, Dong Wang and Andrew Abel, Memory-augmented Neural Machine Translation, EMNLP 2017 [21] .
- Lantian Li, Yixiang Chen, Dong Wang, Thomas Fang Zheng, A Study on Replay Attack and Anti-Spoofing for Automatic Speaker Verification, Interspeech 2017 [22].
- Lantian Li, Yixiang Chen, Ying Shi, Zhiyuan Tang, Dong Wang, "Deep Speaker Feature Learning for Text-independent Speaker Verification", Interspeech 2017[23].
- Jiyuan Zhang, Yang Feng, Dong Wang, Yang Wang, Andrw Abel, Shiyue Zhang, Andi Zhangi, "Flexible and Creative Chinese Poetry Generation Using Neural Memory", ACL 2017 [24]
- Zhiyuan Tang, Ying Shi, Dong Wang, Yang Feng, and Shiyue Zhang, "Memory Visualization for Gated Recurrent Neural Networks in Speech Recognition", ICASSP 2017.[25]
- Zhiyuan Tang, Dong Wang, Yixiang Chen, Qing Chen, AP17-OLR Challenge: Data, Plan, and Baseline, APSIPA 2017, link: arXiv
- Shiyue Zhang, Gulnigar Mahmut, Dong Wang, Askar Hamdulla, Memory-augmented Chinese-Uyghur Neural Machine Translation, APSIPA 2017, link: arXiv
- Shipeng Xu , Hongzhi Yu, Thomas Fang Zheng and Jinghao Yan, Language Resource Construction for Mongolian, APSIPA 2017, pdf
- Guanyu Li, Hongzhi Yu, Thomas Fang Zheng, Jinghao Yan, Free Linguistic and Speech Resources for Tibetan, APSIPA 2017, link: pdf
- Ying Shi, Askar Hamdulla, Zhiyuan Tang, Dong Wang, Thomas Fang Zheng, A Free Kazak Speech Database and a Speech Recognition Baseline, APSIPA 2017, link: pdf
- Mijit Ablimit, Sardar Parhat, Askar Hamdulla, Thomas Fang Zheng , A Multilingual Language Processing Tool for Uyghur, Kazak and Kirghiz, APSIPA 2017, link: pdf
- Aodong Li, Shiyue Zhangy, Dong Wangz and Thomas Fang Zheng, Enhanced Neural Machine Translation by Learning from Draft, APSIPA 2017, link: pdf
- Lantian Li, Dong Wang, Askar Rozi, Thomas Fang Zheng, Cross-lingual Speaker Verification with Deep Feature Learning, APSIPA 2017, link: arXiv
- Dong Wang, Lantian Li, Zhiyuan Tang, Thomas Fang Zheng, Deep Speaker Verification: Do We Need End to End?, APSIPA 2017, link: arXiv
- Miao Zhang, Yixiang Chen, Lantian Li and Dong Wang, Speaker Recognition with Cough, Laugh and “Wei”, APSIPA 2017, link: arXiv
- A rule and statistical modeling based stem extraction method for Kazakh words,Proceedings of the 2017 International Conference on Asian Language Processing, IALP 2017, v 2018-January, p 231-234, July 2, 2017
Other papers
- 沙尔旦尔·帕尔哈提,米吉提·阿不里米提,艾斯卡尔·艾木都拉. 基于稳健词素序列和LSTM的维吾尔语短文本分类[J]. 中文信息学报,2020,34(01):63-70.
- 沙尔旦尔·帕尔哈提,米吉提·阿不里米提,艾斯卡尔·艾木都拉. 词干单元和卷积神经网络的哈萨克短文本分类[J]. 小型微型计算机系统,2020,41(08):1627-1633.
- 维-哈-柯多语言词素切分集成环境研究[J]. 电视技术,2020,44(06):46-51+63.