“Delivery-2016”版本间的差异
来自cslt Wiki
(→Patent) |
(→Author Statistics for TRP) |
||
(3位用户的36个中间修订版本未显示) | |||
第1行: | 第1行: | ||
=Technical report= | =Technical report= | ||
+ | #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/5/53/TRP-20160039.pdf TRP-20160039 Speaker Segmentation Using Deep Speaker Vectors for Fast Speaker Change Scenarios, Renyu Wang, Mingliang Gu, Lantian Li, Mingxing Xu, Thomas Fang Zheng] | ||
+ | #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/0/06/TRP-20160038.pdf TRP-20160038 生物特征识别技术综述, Thomas Fang Zheng, Askar Rozi, Renyu Wang, Lantian Li] | ||
+ | #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/3/3d/TRP-20160037.pdf TRP-20160037 声纹识别技术及其应用现状, Thomas Fang Zheng, Lantian Li, Hui Zhang, Askar Rozi] | ||
+ | #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/5/5f/Dtq.pdf TRP-20160036 Deep Q-trading, Yang Wang, Dong Wang, Shiyue Zhang,Yang Feng, Shiyao Li,and Qiang Zhou] | ||
+ | #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/9/92/Moses%E6%93%8D%E4%BD%9C%E6%89%8B%E5%86%8C--%E5%86%AF%E6%B4%8B.pdf TRP-20160035 Moses中文操作手册, 冯洋] | ||
+ | #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/3/3d/CCF-ASR.pdf TRP-20160034 The Present and Future of Speech Recognition, Dong Wang] | ||
+ | #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/a/a2/Memory.pdf TRP-20160033 Memoryless Document Vector, Dongxu Zhang, Dong Wang] | ||
+ | #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/7/7a/Turing.pdf TRP-20160032 Can Machine Generate Traditional Chinese Poetry? A Turing Test, Qixin Wang, Tianyi Luo, Dong Wang] | ||
+ | #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/ce/TRP-20160031.pdf TRP-20160031 OC16-CE80: A Chinese-English Mixlingual Database and A Speech Recognition Baseline, Dong Wang, Zhiyuan Tang, Difei Tang and Qing Chen] | ||
+ | #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/9/9e/TRP-20160030.pdf TRP-20160030 Collaborative Joint Training with Multi-task Recurrent Model for Speech and Speaker Recognition, Zhiyuan Tang, Lantian Li, Dong Wang and Ravichander Vipperla] | ||
+ | #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/0/09/TRP-20160029.pdf TRP-20160029 Multi-task Recurrent Model for Speech and Speaker Recognition, Zhiyuan Tang, Lantian Li and Dong Wang] | ||
+ | #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/5/55/TRP-20160028.pdf TRP-20160028 Multi-task Recurrent Model for True Multilingual Speech Recognition, Zhiyuan Tang, Lantian Li and Dong Wang] | ||
#[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/2/27/TRP-20160027.pdf TRP-20160027 Collaborative Learning for Language and Speaker Recognition, Lantian Li, Zhiyuan Tang, Dong Wang, Yang Feng and Shiyue Zhang] | #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/2/27/TRP-20160027.pdf TRP-20160027 Collaborative Learning for Language and Speaker Recognition, Lantian Li, Zhiyuan Tang, Dong Wang, Yang Feng and Shiyue Zhang] | ||
#[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/9/9e/TRP-20160026.pdf TRP-20160026 Weakly Supervised PLDA Training, Lantian Li, Dong Wang, Yixiang Chen and Chenghui Zhao] | #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/9/9e/TRP-20160026.pdf TRP-20160026 Weakly Supervised PLDA Training, Lantian Li, Dong Wang, Yixiang Chen and Chenghui Zhao] | ||
第10行: | 第22行: | ||
#[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/7/76/TRP-20160020.pdf TRP-20160020 Feature Transformation For Speaker Verification Under Speaking Rate Mismatch Condition, Askar Rozi, Lantian Li, Dong Wang and Thomas Fang Zheng] | #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/7/76/TRP-20160020.pdf TRP-20160020 Feature Transformation For Speaker Verification Under Speaking Rate Mismatch Condition, Askar Rozi, Lantian Li, Dong Wang and Thomas Fang Zheng] | ||
#[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/c9/TRP-20160019.pdf TRP-20160019 Language-aware PLDA for Multilingual Speaker Recognition, Askar Rozi, Dong Wang, Lantian Li and Thomas Fang Zheng] | #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/c9/TRP-20160019.pdf TRP-20160019 Language-aware PLDA for Multilingual Speaker Recognition, Askar Rozi, Dong Wang, Lantian Li and Thomas Fang Zheng] | ||
− | #[TRP-20160018] | + | #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/b/b5/Ijcai16.pdf TRP-20160018 Chinese Song Iambics Generation with Neural Attention-based Model, Qixin Wang, Tianyi Luo, Dong Wang] |
#[http://cslhttp://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/%E7%89%B9%E6%AE%8A:%E4%B8%8A%E4%BC%A0%E6%96%87%E4%BB%B6t.riit.tsinghua.edu.cn/mediawiki/images/7/7e/Nnet3_config.pdf TRP-20160017 How to Config Kaldi nnet3 (in Chinese), Zhiyuan Tang and Dong Wang] | #[http://cslhttp://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/%E7%89%B9%E6%AE%8A:%E4%B8%8A%E4%BC%A0%E6%96%87%E4%BB%B6t.riit.tsinghua.edu.cn/mediawiki/images/7/7e/Nnet3_config.pdf TRP-20160017 How to Config Kaldi nnet3 (in Chinese), Zhiyuan Tang and Dong Wang] | ||
#[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/e/e6/Joint_training_config.pdf TRP-20160016 How to deploy joint training in Kaldi (in Chinese), Hang Luo, Zhiyuan Tang and Dong Wang] | #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/e/e6/Joint_training_config.pdf TRP-20160016 How to deploy joint training in Kaldi (in Chinese), Hang Luo, Zhiyuan Tang and Dong Wang] | ||
第28行: | 第40行: | ||
#[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/3/39/How_to_deal_with_low_frequency_words.pdf TRP-20160002: Low-Frequency Words Embedding, Chao Xing, Yiqiao Pan, Dong Wang] | #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/3/39/How_to_deal_with_low_frequency_words.pdf TRP-20160002: Low-Frequency Words Embedding, Chao Xing, Yiqiao Pan, Dong Wang] | ||
#[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/b/b6/Max-margin.pdf TRP-20160001: Max-margin metric learning for speaker recognition, Lantian Li, Chao Xing, Dong Wang] | #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/b/b6/Max-margin.pdf TRP-20160001: Max-margin metric learning for speaker recognition, Lantian Li, Chao Xing, Dong Wang] | ||
+ | |||
+ | =Author Statistics for TRP= | ||
+ | {|class="wikitable" | ||
+ | ! rank !! name !! TRP number !! first author number | ||
+ | |- | ||
+ | | 1 || Dong Wang || 32 || 6 | ||
+ | |- | ||
+ | | 2 || Lantian Li || 19 || 8 | ||
+ | |- | ||
+ | | 3 || Zhiyuan Tang || 9 || 5 | ||
+ | |- | ||
+ | | 4 || Thomas Fang Zheng || 8 || 2 | ||
+ | |- | ||
+ | | 5 || Yang Feng || 5 || 1 | ||
+ | |- | ||
+ | | 6 || Tianyi Luo || 3 || 1 | ||
+ | |- | ||
+ | | 7 || Shiyue Zhang || 3 || 1 | ||
+ | |- | ||
+ | | 8 || Chao Xing || 2 || 1 | ||
+ | |- | ||
+ | | 9 || Ying Shi || 2 || 1 | ||
+ | |- | ||
+ | | 10 || Qixin Wang || 2 || 2 | ||
+ | |- | ||
+ | | 11 || Difei Tang || 2 || 0 | ||
+ | |- | ||
+ | | 12 || Askar Rozi || 5 || 2 | ||
+ | |- | ||
+ | | 13 || Qiang Zhou || 2 || 1 | ||
+ | |- | ||
+ | | 14 || Yixiang Chen || 2 || 0 | ||
+ | |- | ||
+ | | 15 || Chenghui Zhao || 2 || 1 | ||
+ | |- | ||
+ | | 16 || Javier Tejedor || 2 || 0 | ||
+ | |- | ||
+ | | 17 || Qing Chen || 2 || 0 | ||
+ | |- | ||
+ | | 18 || Nurbolat || 1 || 0 | ||
+ | |- | ||
+ | | 19 || Hang Luo || 1 || 1 | ||
+ | |- | ||
+ | | 20 || Renyu Wang || 1 || 0 | ||
+ | |- | ||
+ | | 21 || Chenhao Zhang || 1 || 0 | ||
+ | |- | ||
+ | | 22 || Askar Humdulla || 1 || 0 | ||
+ | |- | ||
+ | | 23 || Xi Ma || 1 || 1 | ||
+ | |- | ||
+ | | 24 || Guozhen Zhao || 1 || 0 | ||
+ | |- | ||
+ | | 25 || April Pu || 1 || 0 | ||
+ | |- | ||
+ | | 26 || Yiqiao Pan || 1 || 0 | ||
+ | |- | ||
+ | | 27 || Gang Wang || 1 || 0 | ||
+ | |- | ||
+ | | 28 || Jiyuan Zhang || 1 || 1 | ||
+ | |- | ||
+ | | 29 || Caixia Wang || 1 || 0 | ||
+ | |- | ||
+ | | 30 || Ravichander Vipperla || 1 || 0 | ||
+ | |- | ||
+ | | 31 || Shiyao Li || 1 || 0 | ||
+ | |- | ||
+ | | 32 || Rayilam Parhat || 1 || 0 | ||
+ | |- | ||
+ | | 33 || Ralph Grishman || 1 || 0 | ||
+ | |- | ||
+ | | 34 || Yang Wang || 1 || 1 | ||
+ | |- | ||
+ | | 35 || Dongxu Zhang || 1 || 1 | ||
+ | |- | ||
+ | |} | ||
+ | |||
+ | =Book= | ||
+ | |||
+ | #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/6/61/Book.pdf 现代机器学习技术导论] | ||
+ | |||
+ | =Paper= | ||
+ | |||
+ | Journal: | ||
+ | |||
+ | #Zhiyuan Tang, Lantian Li, Dong Wang, and Ravichander Vipperla, "Collaborative Joint Training with Multi-task Recurrent Model for Speech and Speaker Recognition", IEEE/ACM Transactions on Audio, Speech, and Language Processing. Preprint, 2016. (DOI: 10.1109/TASLP.2016.2639323) | ||
+ | #Linlin Wang, Jun Wang, Lantian Li, Thomas Fang Zheng, Frank K.Soong, "Improving Speaker Verfication Performance against Long-Term Speaker Variability", Speech Communication, 79 (2016), 14-29, Mar. 2016. | ||
+ | #Lantian Li, Dong Wang, Chenhao Zhang, Thomas Fang Zheng, "Improving Short Utterance Speaker Recognition by Modeling Speech Unit Classes", In IEEE/ACM Transactions on Audio, Speech, and Language Processing (Volume:PP, Issue:99) DOI:10.1109/TASLP 2016. | ||
+ | #Thomas Fang Zheng, Rozi Askar, Renyu Wang, Lantian Li, "Overview of Biometric Recognition Technology", Journal of Information Security Research, 2(1): 12-26, Jan. 2016. | ||
+ | #Thomas Fang Zheng, Lantian Li, Hui Zhang, Rozi Askar, "Overview of Voiceprint Recognition Technology and Applications", Journal of Information Security Research, 2(1): 44-57, Jan. 2016. | ||
+ | #Xi Ma, Dong Wang, Javier Tejedor "Similar Word Model for Unfrequent Word Enhancement in Speech Recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol 24, no. 10, 2016. | ||
+ | |||
+ | Conference: | ||
+ | |||
+ | #Dongxu Zhang, Dong Wang, "Relation Classification: CNN or RNN?", NLPCC-ICCPOL 2016 | ||
+ | #Dongxu Zhang, Tianyi Luo, Dong Wang, "Learning from LDA using Deep Neural Networks", NLPCC-ICCPOL 2016 | ||
+ | #Askar Rozi, Dong Wang, Lantian Li, Thomas Fang Zheng, " Language-aware PLDA for Multilingual Speaker Recognition", OCOCOSDA 2016 (best student paper) | ||
+ | #Dong Wang, Zhiyuan Tang, Difei Tang, Qing Chen "OC16-CE80: A Chinese-English Mixlingual Database and A Speech Recognition Baseline", OCOCOSDA 2016 (best paper) | ||
+ | #Chenghui Zhao, Lantian Li, Dong Wang and April Pu, " Local Training for PLDA in Speaker Verification", OCOCOSDA 2016 | ||
+ | #Lantian Li, Dong Wang, Thomoas Fang Zheng, "Max-Margin Metric Learning for Speaker Recognition", ISCSLP 2016 | ||
+ | #Lantian Li, Dong Wang, Thomas Fang Zheng, " Binary Speaker Embedding", ISCSLP 2016 | ||
+ | #Lantian Li, Dong Wang, Xiaodong Zhang, Thomas Fang Zheng, Panshi Jin, "System Combination for Short Utterance Speaker Recognition", APSIPA 2016 | ||
+ | #Zhiyuan Tang, Lantian Li, Dong Wang, "Multi-task Recurrent Model for Speech and Speaker Recognition", APSIPA 2016 | ||
+ | #Askar Rozi, Dong Wang, Lantian Li, Thomas Fang Zheng, "Feature Transformation For Speaker Verification Under Speaking Rate Mismatch Condition", APSIPA 2016 | ||
+ | #Aiting Liu, Chao Xing, Yang Feng, Dong Wang, "Learning Ordered Word Representations", APSIPA 2016 | ||
+ | #Dong Wang, Lantian Li, Difei Tang, Qing Chen, "AP16-OL7: A Multilingual Database for Oriental Languages and A Language Recognition Baseline", APSIPA 2016 | ||
+ | #Zhiyuan Tang, Lantian Li, Dong Wang, "Multi-task Recurrent Model for True Multilingual Speech Recognition", APSIPA 2016 | ||
+ | #Qinxin Wang, Tianyi Luo, Dong Wang, "Can Machine Generate Traditional Chinese Poetry? A Feigenbaum Test", BICS 2016 | ||
+ | #Dong Wang, Qiang Zhou, Amir Hussian, "Deep and Sparse Learning in Speech and Language Processing: An Overview", BICS 2016 | ||
+ | #Qixin Wang, Tianyi Luo, Dong Wang, Chao Xing, "Chinese Song Iambics Generation with Neural Attention-based Model", IJCAI 2016 | ||
+ | #Zhiyuan Tang, Dong Wang, Zhiyong Zhang, "Recurrent Neural Network Training with Dark Knowledge Transfer", ICASSP 2016 | ||
+ | #Mian Wang, Dong Wang, "VMF-SNE: EMBEDDING FOR SPHERICAL DATA", ICASSP 2016 | ||
+ | #Renyu Wang, Mingliang Gu, Lantian Li, Mingxing Xu, Thomas Fang Zheng, "Speaker Segmentation Using Deep Speaker Vectors for Fast Speaker Change Scenarios", ICASSP 2017 | ||
=Patent= | =Patent= | ||
− | #郑方 李蓝天 邬晓钧 别凡虎 王军 语音重放检测方法和装置 [D-ear] [[媒体文件: | + | #郑方 李蓝天 邬晓钧 别凡虎 王军 语音重放检测方法和装置 2016100073590 [D-ear] [[媒体文件:2016100073590.pdf|交底书]] |
− | #郑方 李蓝天 邬晓钧 王刚 刘乐 基于声纹识别、人脸识别以及同步活体检测的身份认证方法及系统 [D-ear] [[媒体文件: | + | #郑方 李蓝天 邬晓钧 王刚 刘乐 基于声纹识别、人脸识别以及同步活体检测的身份认证方法及系统 2015108119085 [D-ear] [[媒体文件:2015108119085.pdf|交底书]] |
#王东 邢超 张之勇 赵梦原 一种面向混合语言的语音合成方法 [FreeNeb] [[媒体文件:一种中英文混合的语音合成方法.docx|交底书]] | #王东 邢超 张之勇 赵梦原 一种面向混合语言的语音合成方法 [FreeNeb] [[媒体文件:一种中英文混合的语音合成方法.docx|交底书]] | ||
#王东 张之勇 赵梦原 黄伟明 李国强,一种日语语音识别系统训练方法 [同方] [[媒体文件:一种日语语音识别系统训练方法.docx|交底书]] | #王东 张之勇 赵梦原 黄伟明 李国强,一种日语语音识别系统训练方法 [同方] [[媒体文件:一种日语语音识别系统训练方法.docx|交底书]] | ||
第81行: | 第206行: | ||
{| class="wikitable" | {| class="wikitable" | ||
! name !! type!! size !! dir !! description | ! name !! type!! size !! dir !! description | ||
+ | |- | ||
+ | |ASVspoof 2017||wav||348M||corpora/lilt/ASVspoof 2017|| ASVspoof 2017 data (from INTERSPEECH) collected by Lantian Li | ||
+ | |- | ||
+ | |CSLT_China300||wav||8.8G||corpora/lilt/CSLT_China300|| 300 chinese speakers data for SRE collected by Lantian Li | ||
+ | |- | ||
+ | |CSLT_Replay||wav||13G||corpora/lilt/CSLT_Replay|| CSLT replay spoofing data collected by Lantian Li | ||
+ | |- | ||
+ | |CSLT_Digit||wav||1.3G||corpora/lilt/CSLT_Digit|| Digit string data for SRE collected by Lantian Li | ||
+ | |- | ||
+ | |Idiap_avspoof||wav||21G||corpora/lilt/Idiap_avspoof|| Idiap Avspoof data collected by Lantian Li | ||
+ | |- | ||
+ | |RedDots||wav||1.2G||corpora/lilt/RedDots|| RedDots data for SRE collected by Lantian Li | ||
+ | |- | ||
+ | |SITW||wav||19G||corpora/lilt/SITW|| SITW data for SRE collected by Lantian Li | ||
+ | |- | ||
+ | |VCTK||wav||11G||corpora/lilt/VCTK|| VCTK data for ASR and SRE collected by Lantian Li | ||
+ | |- | ||
+ | |VoxForge||wav||11G||corpora/lilt/VoxForge|| VoxForge data for ASR and SRE collected by Lantian Li | ||
|- | |- | ||
|lyric||text||-||corpora/art/lyric|| song lyric data collected by Jiyuan Zhang | |lyric||text||-||corpora/art/lyric|| song lyric data collected by Jiyuan Zhang | ||
第95行: | 第238行: | ||
{| class="wikitable" | {| class="wikitable" | ||
! Code Name!! Author !! Description | ! Code Name!! Author !! Description | ||
+ | |- | ||
+ | |THCHS30||Wang Dong, Zhang Xuewei|| THCHS30 recipe [https://github.com/kaldi-asr/kaldi link] | ||
+ | |- | ||
+ | |THUYG20||Wang Dong, Zhang Xuewei|| THUYG20 recipe [https://github.com/wangdong99/kaldi link] | ||
|- | |- | ||
|viewExp||Wang Yang|| The experiment platform of the Finance team [http://git.cslt.org/finance/viewexp/tree/master link] | |viewExp||Wang Yang|| The experiment platform of the Finance team [http://git.cslt.org/finance/viewexp/tree/master link] | ||
|- | |- | ||
|vvBeam||Wang Dong|| Beamforming using superdirective array [http://git.cslt.org/speech/beam link] | |vvBeam||Wang Dong|| Beamforming using superdirective array [http://git.cslt.org/speech/beam link] | ||
+ | |- | ||
+ | |vvEngine||Zhiyong Zhang|| ASR Engine [http://git.cslt.org/zhangzy/freepulsar link] | ||
+ | |- | ||
+ | |vvPoem||Jiyuan Zhang|| Vivi Poem generatoin http://git.cslt.org/vivi/vvpoem link] | ||
+ | |- | ||
+ | |vvQA||Chao Xing|| QA system [http://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/FreeNeb_ViviQA_Release link] | ||
|- | |- | ||
|} | |} | ||
− |
2017年1月8日 (日) 02:59的最后版本
Technical report
- TRP-20160039 Speaker Segmentation Using Deep Speaker Vectors for Fast Speaker Change Scenarios, Renyu Wang, Mingliang Gu, Lantian Li, Mingxing Xu, Thomas Fang Zheng
- TRP-20160038 生物特征识别技术综述, Thomas Fang Zheng, Askar Rozi, Renyu Wang, Lantian Li
- TRP-20160037 声纹识别技术及其应用现状, Thomas Fang Zheng, Lantian Li, Hui Zhang, Askar Rozi
- TRP-20160036 Deep Q-trading, Yang Wang, Dong Wang, Shiyue Zhang,Yang Feng, Shiyao Li,and Qiang Zhou
- TRP-20160035 Moses中文操作手册, 冯洋
- TRP-20160034 The Present and Future of Speech Recognition, Dong Wang
- TRP-20160033 Memoryless Document Vector, Dongxu Zhang, Dong Wang
- TRP-20160032 Can Machine Generate Traditional Chinese Poetry? A Turing Test, Qixin Wang, Tianyi Luo, Dong Wang
- TRP-20160031 OC16-CE80: A Chinese-English Mixlingual Database and A Speech Recognition Baseline, Dong Wang, Zhiyuan Tang, Difei Tang and Qing Chen
- TRP-20160030 Collaborative Joint Training with Multi-task Recurrent Model for Speech and Speaker Recognition, Zhiyuan Tang, Lantian Li, Dong Wang and Ravichander Vipperla
- TRP-20160029 Multi-task Recurrent Model for Speech and Speaker Recognition, Zhiyuan Tang, Lantian Li and Dong Wang
- TRP-20160028 Multi-task Recurrent Model for True Multilingual Speech Recognition, Zhiyuan Tang, Lantian Li and Dong Wang
- TRP-20160027 Collaborative Learning for Language and Speaker Recognition, Lantian Li, Zhiyuan Tang, Dong Wang, Yang Feng and Shiyue Zhang
- TRP-20160026 Weakly Supervised PLDA Training, Lantian Li, Dong Wang, Yixiang Chen and Chenghui Zhao
- TRP-20160025 Local Training for PLDA in Speaker Verification, Chenghui Zhao, Lantian Li, Dong Wang and April Pu
- TRP-20160024 Decision Making Based on Cohort Scores for Speaker Verification, Lantian Li, Renyu Wang, Gang Wang, Caixia Wang and Thomas Fang Zheng
- TRP-20160023 AP16-OL7: A Multilingual Database for Oriental Languages and A Language Recognition Baseline, Dong Wang, Lantian Li, Difei Tang and Qing Chen
- TRP-20160022 System Combination for Short Utterance Speaker Recognition, Lantian Li, Dong Wang and Thomas Fang Zheng
- TRP-20160021 Improving Short Utterance Speaker Recognition by Modeling Speech Unit Classes, Lantian Li, Dong Wang, Chenhao Zhang and Thomas Fang Zheng
- TRP-20160020 Feature Transformation For Speaker Verification Under Speaking Rate Mismatch Condition, Askar Rozi, Lantian Li, Dong Wang and Thomas Fang Zheng
- TRP-20160019 Language-aware PLDA for Multilingual Speaker Recognition, Askar Rozi, Dong Wang, Lantian Li and Thomas Fang Zheng
- TRP-20160018 Chinese Song Iambics Generation with Neural Attention-based Model, Qixin Wang, Tianyi Luo, Dong Wang
- TRP-20160017 How to Config Kaldi nnet3 (in Chinese), Zhiyuan Tang and Dong Wang
- TRP-20160016 How to deploy joint training in Kaldi (in Chinese), Hang Luo, Zhiyuan Tang and Dong Wang
- TRP-20160015 How to run ASR system for Kazak (in Chinese), Ying Shi, Zhiyuan Tang,Nurbolat and Dong Wang
- TRP-20160014 Exploring The Role of Deep Speaker Features for Speaker Verification, Lantian Li and Dong Wang
- TRP-20160013 Sparse Discriminative Analysis and Its Application in Distraction Classification, Dong Wang
- TRP-20160012 i-vector system in Kaldi (in Chinese) Yixang Chen, Lantian Li and Dong Wang
- TRP-20160011 基于说话人信道相关的录音重放检测若干方法探究 Lantian Li, Yixiang Chen and Dong Wang
- TRP-20160010 Highly Restricted Keyword Selection Based on Sparse Analysis for Uyghur Text Categorization, Dong Wang, Askar Humdulla, Rayilam Parhat, Javier Tejedor
- TRP-20160009: RNNG Code User Guide, Shiyue Zhang and Yang Feng
- TRP-20160008: Different styles of poetry generation based on memory model, Jiyuan Zhang,Yang Feng and Dong Wang
- TRP-20160007: Distraction Detection Using Sparse Discriminative Analysis, Dong Wang and Guozhen Zhao
- TRP-20160006: Visualization Analysis for Recurrent Networks, Zhiyuan Tang, Ying Shi and Dong Wang
- TRP-20160005: Distributed Representation Learning for Knowledge Graphs with Entity Descriptions; Miao Fan, Qiang Zhou, Thomas Fang Zheng, Ralph Grishman
- TRP-20160004: A Review of Neural QA, Tianyi Luo and Dong Wang
- TRP-20160003: A study of Similar Word Model for Unfrequent Word Enhancement in Speech Recognition, Xi Ma, Dong Wang and Javier Tejedor
- TRP-20160002: Low-Frequency Words Embedding, Chao Xing, Yiqiao Pan, Dong Wang
- TRP-20160001: Max-margin metric learning for speaker recognition, Lantian Li, Chao Xing, Dong Wang
Author Statistics for TRP
rank | name | TRP number | first author number |
---|---|---|---|
1 | Dong Wang | 32 | 6 |
2 | Lantian Li | 19 | 8 |
3 | Zhiyuan Tang | 9 | 5 |
4 | Thomas Fang Zheng | 8 | 2 |
5 | Yang Feng | 5 | 1 |
6 | Tianyi Luo | 3 | 1 |
7 | Shiyue Zhang | 3 | 1 |
8 | Chao Xing | 2 | 1 |
9 | Ying Shi | 2 | 1 |
10 | Qixin Wang | 2 | 2 |
11 | Difei Tang | 2 | 0 |
12 | Askar Rozi | 5 | 2 |
13 | Qiang Zhou | 2 | 1 |
14 | Yixiang Chen | 2 | 0 |
15 | Chenghui Zhao | 2 | 1 |
16 | Javier Tejedor | 2 | 0 |
17 | Qing Chen | 2 | 0 |
18 | Nurbolat | 1 | 0 |
19 | Hang Luo | 1 | 1 |
20 | Renyu Wang | 1 | 0 |
21 | Chenhao Zhang | 1 | 0 |
22 | Askar Humdulla | 1 | 0 |
23 | Xi Ma | 1 | 1 |
24 | Guozhen Zhao | 1 | 0 |
25 | April Pu | 1 | 0 |
26 | Yiqiao Pan | 1 | 0 |
27 | Gang Wang | 1 | 0 |
28 | Jiyuan Zhang | 1 | 1 |
29 | Caixia Wang | 1 | 0 |
30 | Ravichander Vipperla | 1 | 0 |
31 | Shiyao Li | 1 | 0 |
32 | Rayilam Parhat | 1 | 0 |
33 | Ralph Grishman | 1 | 0 |
34 | Yang Wang | 1 | 1 |
35 | Dongxu Zhang | 1 | 1 |
Book
Paper
Journal:
- Zhiyuan Tang, Lantian Li, Dong Wang, and Ravichander Vipperla, "Collaborative Joint Training with Multi-task Recurrent Model for Speech and Speaker Recognition", IEEE/ACM Transactions on Audio, Speech, and Language Processing. Preprint, 2016. (DOI: 10.1109/TASLP.2016.2639323)
- Linlin Wang, Jun Wang, Lantian Li, Thomas Fang Zheng, Frank K.Soong, "Improving Speaker Verfication Performance against Long-Term Speaker Variability", Speech Communication, 79 (2016), 14-29, Mar. 2016.
- Lantian Li, Dong Wang, Chenhao Zhang, Thomas Fang Zheng, "Improving Short Utterance Speaker Recognition by Modeling Speech Unit Classes", In IEEE/ACM Transactions on Audio, Speech, and Language Processing (Volume:PP, Issue:99) DOI:10.1109/TASLP 2016.
- Thomas Fang Zheng, Rozi Askar, Renyu Wang, Lantian Li, "Overview of Biometric Recognition Technology", Journal of Information Security Research, 2(1): 12-26, Jan. 2016.
- Thomas Fang Zheng, Lantian Li, Hui Zhang, Rozi Askar, "Overview of Voiceprint Recognition Technology and Applications", Journal of Information Security Research, 2(1): 44-57, Jan. 2016.
- Xi Ma, Dong Wang, Javier Tejedor "Similar Word Model for Unfrequent Word Enhancement in Speech Recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol 24, no. 10, 2016.
Conference:
- Dongxu Zhang, Dong Wang, "Relation Classification: CNN or RNN?", NLPCC-ICCPOL 2016
- Dongxu Zhang, Tianyi Luo, Dong Wang, "Learning from LDA using Deep Neural Networks", NLPCC-ICCPOL 2016
- Askar Rozi, Dong Wang, Lantian Li, Thomas Fang Zheng, " Language-aware PLDA for Multilingual Speaker Recognition", OCOCOSDA 2016 (best student paper)
- Dong Wang, Zhiyuan Tang, Difei Tang, Qing Chen "OC16-CE80: A Chinese-English Mixlingual Database and A Speech Recognition Baseline", OCOCOSDA 2016 (best paper)
- Chenghui Zhao, Lantian Li, Dong Wang and April Pu, " Local Training for PLDA in Speaker Verification", OCOCOSDA 2016
- Lantian Li, Dong Wang, Thomoas Fang Zheng, "Max-Margin Metric Learning for Speaker Recognition", ISCSLP 2016
- Lantian Li, Dong Wang, Thomas Fang Zheng, " Binary Speaker Embedding", ISCSLP 2016
- Lantian Li, Dong Wang, Xiaodong Zhang, Thomas Fang Zheng, Panshi Jin, "System Combination for Short Utterance Speaker Recognition", APSIPA 2016
- Zhiyuan Tang, Lantian Li, Dong Wang, "Multi-task Recurrent Model for Speech and Speaker Recognition", APSIPA 2016
- Askar Rozi, Dong Wang, Lantian Li, Thomas Fang Zheng, "Feature Transformation For Speaker Verification Under Speaking Rate Mismatch Condition", APSIPA 2016
- Aiting Liu, Chao Xing, Yang Feng, Dong Wang, "Learning Ordered Word Representations", APSIPA 2016
- Dong Wang, Lantian Li, Difei Tang, Qing Chen, "AP16-OL7: A Multilingual Database for Oriental Languages and A Language Recognition Baseline", APSIPA 2016
- Zhiyuan Tang, Lantian Li, Dong Wang, "Multi-task Recurrent Model for True Multilingual Speech Recognition", APSIPA 2016
- Qinxin Wang, Tianyi Luo, Dong Wang, "Can Machine Generate Traditional Chinese Poetry? A Feigenbaum Test", BICS 2016
- Dong Wang, Qiang Zhou, Amir Hussian, "Deep and Sparse Learning in Speech and Language Processing: An Overview", BICS 2016
- Qixin Wang, Tianyi Luo, Dong Wang, Chao Xing, "Chinese Song Iambics Generation with Neural Attention-based Model", IJCAI 2016
- Zhiyuan Tang, Dong Wang, Zhiyong Zhang, "Recurrent Neural Network Training with Dark Knowledge Transfer", ICASSP 2016
- Mian Wang, Dong Wang, "VMF-SNE: EMBEDDING FOR SPHERICAL DATA", ICASSP 2016
- Renyu Wang, Mingliang Gu, Lantian Li, Mingxing Xu, Thomas Fang Zheng, "Speaker Segmentation Using Deep Speaker Vectors for Fast Speaker Change Scenarios", ICASSP 2017
Patent
- 郑方 李蓝天 邬晓钧 别凡虎 王军 语音重放检测方法和装置 2016100073590 [D-ear] 交底书
- 郑方 李蓝天 邬晓钧 王刚 刘乐 基于声纹识别、人脸识别以及同步活体检测的身份认证方法及系统 2015108119085 [D-ear] 交底书
- 王东 邢超 张之勇 赵梦原 一种面向混合语言的语音合成方法 [FreeNeb] 交底书
- 王东 张之勇 赵梦原 黄伟明 李国强,一种日语语音识别系统训练方法 [同方] 交底书
- 汪洋,王东,刘荣 在线强化学习交易策略 [张露] 文底书
- 王东,白紫薇,冯洋,杜新凯,游士学,基于LSTM模型的现代文到古诗的转换技术 [汇联] 交底书
- 王东,张记袁,冯洋,杜新凯,游士学, 一种基于共同语义空间的个性化音乐生成技术 [汇联] 交底书
Talks
Date | Speaker | Title | Materials | On duty |
---|---|---|---|---|
2016/1/4 | Zhiyong Zhang | Parallel training,MPE and natural gradient | slides | |
2016/1/18 | Dongxu Zhang | Memoryless Document Vector | slides | |
2016/3/14 | Zhiyuan Tang | Oral presentation for "vMF-SNE: Embedding for Spherical Data" | slides | |
2016/3/28 | Tianyi Luo | Review for Neural QA | slides | |
2016/4/11 | Rong Liu | Recommendation in Youku | slides | |
2016/5/09 | Miao Fan | Learning contextual embeddings of knowledge base with entity descriptions. | slides | |
2016/5/16 | Yang Wang | Research on conversation thread detection. | slides | |
2016/5/20 | Yang Wang & Maoning Wang | Research on portfolio selection. | slides1 slides2 | |
2016/5/20 | Zhiyuan Tang | ICASSP 2016 summary | slides | |
2016/5/23 | Dong Wang | graphical model and neural model | slides papers | |
2016/8/02 | Zhiyuan Tang | Visualizing, Measuring and Understanding Neural Networks: A Brief Survey | slides | |
2016/8/03 | Yang Wang | Neural networks and genetic programming for financial forecasting | slides | |
2016/11/05 | Yang Wang | Reinforcement Learning Models and Simulations | slides | |
2016/11/12 | Yang Wang | Generative Adversarial Nets | slides | |
2016/11/22 | Zhiyuan Tang | INTERSPEECH 2016 summary | slides | |
2016/11/30 | Dong Wang | Deep and sparse learning in speech and language: an overview | slides |
Database
name | type | size | dir | description |
---|---|---|---|---|
ASVspoof 2017 | wav | 348M | corpora/lilt/ASVspoof 2017 | ASVspoof 2017 data (from INTERSPEECH) collected by Lantian Li |
CSLT_China300 | wav | 8.8G | corpora/lilt/CSLT_China300 | 300 chinese speakers data for SRE collected by Lantian Li |
CSLT_Replay | wav | 13G | corpora/lilt/CSLT_Replay | CSLT replay spoofing data collected by Lantian Li |
CSLT_Digit | wav | 1.3G | corpora/lilt/CSLT_Digit | Digit string data for SRE collected by Lantian Li |
Idiap_avspoof | wav | 21G | corpora/lilt/Idiap_avspoof | Idiap Avspoof data collected by Lantian Li |
RedDots | wav | 1.2G | corpora/lilt/RedDots | RedDots data for SRE collected by Lantian Li |
SITW | wav | 19G | corpora/lilt/SITW | SITW data for SRE collected by Lantian Li |
VCTK | wav | 11G | corpora/lilt/VCTK | VCTK data for ASR and SRE collected by Lantian Li |
VoxForge | wav | 11G | corpora/lilt/VoxForge | VoxForge data for ASR and SRE collected by Lantian Li |
lyric | text | - | corpora/art/lyric | song lyric data collected by Jiyuan Zhang |
poem | text | - | corpora/art/poem | Traditional Chinese poem data collected by Qixin Wang and Jiyuan Zhang |
cnsong | text | - | corpora/art/song | Chinese Song sentences collected by Aiting Liu |
factordb | transaction | - | corpora/finance/factordb | Factor database collected by Wangyang |
Code
Code Name | Author | Description |
---|---|---|
THCHS30 | Wang Dong, Zhang Xuewei | THCHS30 recipe link |
THUYG20 | Wang Dong, Zhang Xuewei | THUYG20 recipe link |
viewExp | Wang Yang | The experiment platform of the Finance team link |
vvBeam | Wang Dong | Beamforming using superdirective array link |
vvEngine | Zhiyong Zhang | ASR Engine link |
vvPoem | Jiyuan Zhang | Vivi Poem generatoin http://git.cslt.org/vivi/vvpoem link] |
vvQA | Chao Xing | QA system link |