2017年1月8日 (日) 02:59的最后版本

Technical report

TRP-20160039 Speaker Segmentation Using Deep Speaker Vectors for Fast Speaker Change Scenarios, Renyu Wang, Mingliang Gu, Lantian Li, Mingxing Xu, Thomas Fang Zheng
TRP-20160038 生物特征识别技术综述, Thomas Fang Zheng, Askar Rozi, Renyu Wang, Lantian Li
TRP-20160037 声纹识别技术及其应用现状, Thomas Fang Zheng, Lantian Li, Hui Zhang, Askar Rozi
TRP-20160036 Deep Q-trading, Yang Wang, Dong Wang, Shiyue Zhang,Yang Feng, Shiyao Li,and Qiang Zhou
TRP-20160035 Moses中文操作手册, 冯洋
TRP-20160034 The Present and Future of Speech Recognition, Dong Wang
TRP-20160033 Memoryless Document Vector, Dongxu Zhang, Dong Wang
TRP-20160032 Can Machine Generate Traditional Chinese Poetry? A Turing Test, Qixin Wang, Tianyi Luo, Dong Wang
TRP-20160031 OC16-CE80: A Chinese-English Mixlingual Database and A Speech Recognition Baseline, Dong Wang, Zhiyuan Tang, Difei Tang and Qing Chen
TRP-20160030 Collaborative Joint Training with Multi-task Recurrent Model for Speech and Speaker Recognition, Zhiyuan Tang, Lantian Li, Dong Wang and Ravichander Vipperla
TRP-20160029 Multi-task Recurrent Model for Speech and Speaker Recognition, Zhiyuan Tang, Lantian Li and Dong Wang
TRP-20160028 Multi-task Recurrent Model for True Multilingual Speech Recognition, Zhiyuan Tang, Lantian Li and Dong Wang
TRP-20160027 Collaborative Learning for Language and Speaker Recognition, Lantian Li, Zhiyuan Tang, Dong Wang, Yang Feng and Shiyue Zhang
TRP-20160026 Weakly Supervised PLDA Training, Lantian Li, Dong Wang, Yixiang Chen and Chenghui Zhao
TRP-20160025 Local Training for PLDA in Speaker Veriﬁcation, Chenghui Zhao, Lantian Li, Dong Wang and April Pu
TRP-20160024 Decision Making Based on Cohort Scores for Speaker Verification, Lantian Li, Renyu Wang, Gang Wang, Caixia Wang and Thomas Fang Zheng
TRP-20160023 AP16-OL7: A Multilingual Database for Oriental Languages and A Language Recognition Baseline, Dong Wang, Lantian Li, Difei Tang and Qing Chen
TRP-20160022 System Combination for Short Utterance Speaker Recognition, Lantian Li, Dong Wang and Thomas Fang Zheng
TRP-20160021 Improving Short Utterance Speaker Recognition by Modeling Speech Unit Classes, Lantian Li, Dong Wang, Chenhao Zhang and Thomas Fang Zheng
TRP-20160020 Feature Transformation For Speaker Verification Under Speaking Rate Mismatch Condition, Askar Rozi, Lantian Li, Dong Wang and Thomas Fang Zheng
TRP-20160019 Language-aware PLDA for Multilingual Speaker Recognition, Askar Rozi, Dong Wang, Lantian Li and Thomas Fang Zheng
TRP-20160018 Chinese Song Iambics Generation with Neural Attention-based Model, Qixin Wang, Tianyi Luo, Dong Wang
TRP-20160017 How to Config Kaldi nnet3 (in Chinese), Zhiyuan Tang and Dong Wang
TRP-20160016 How to deploy joint training in Kaldi (in Chinese), Hang Luo, Zhiyuan Tang and Dong Wang
TRP-20160015 How to run ASR system for Kazak (in Chinese), Ying Shi, Zhiyuan Tang，Nurbolat and Dong Wang
TRP-20160014 Exploring The Role of Deep Speaker Features for Speaker Verification, Lantian Li and Dong Wang
TRP-20160013 Sparse Discriminative Analysis and Its Application in Distraction Classification, Dong Wang
TRP-20160012 i-vector system in Kaldi (in Chinese) Yixang Chen, Lantian Li and Dong Wang
TRP-20160011 基于说话人信道相关的录音重放检测若干方法探究 Lantian Li, Yixiang Chen and Dong Wang
TRP-20160010 Highly Restricted Keyword Selection Based on Sparse Analysis for Uyghur Text Categorization, Dong Wang, Askar Humdulla, Rayilam Parhat, Javier Tejedor
TRP-20160009: RNNG Code User Guide, Shiyue Zhang and Yang Feng
TRP-20160008: Different styles of poetry generation based on memory model, Jiyuan Zhang,Yang Feng and Dong Wang
TRP-20160007: Distraction Detection Using Sparse Discriminative Analysis, Dong Wang and Guozhen Zhao
TRP-20160006: Visualization Analysis for Recurrent Networks, Zhiyuan Tang, Ying Shi and Dong Wang
TRP-20160005: Distributed Representation Learning for Knowledge Graphs with Entity Descriptions; Miao Fan, Qiang Zhou, Thomas Fang Zheng, Ralph Grishman
TRP-20160004: A Review of Neural QA, Tianyi Luo and Dong Wang
TRP-20160003: A study of Similar Word Model for Unfrequent Word Enhancement in Speech Recognition, Xi Ma, Dong Wang and Javier Tejedor
TRP-20160002: Low-Frequency Words Embedding, Chao Xing, Yiqiao Pan, Dong Wang
TRP-20160001: Max-margin metric learning for speaker recognition, Lantian Li, Chao Xing, Dong Wang

Author Statistics for TRP

rank	name	TRP number	first author number
1	Dong Wang	32	6
2	Lantian Li	19	8
3	Zhiyuan Tang	9	5
4	Thomas Fang Zheng	8	2
5	Yang Feng	5	1
6	Tianyi Luo	3	1
7	Shiyue Zhang	3	1
8	Chao Xing	2	1
9	Ying Shi	2	1
10	Qixin Wang	2	2
11	Difei Tang	2	0
12	Askar Rozi	5	2
13	Qiang Zhou	2	1
14	Yixiang Chen	2	0
15	Chenghui Zhao	2	1
16	Javier Tejedor	2	0
17	Qing Chen	2	0
18	Nurbolat	1	0
19	Hang Luo	1	1
20	Renyu Wang	1	0
21	Chenhao Zhang	1	0
22	Askar Humdulla	1	0
23	Xi Ma	1	1
24	Guozhen Zhao	1	0
25	April Pu	1	0
26	Yiqiao Pan	1	0
27	Gang Wang	1	0
28	Jiyuan Zhang	1	1
29	Caixia Wang	1	0
30	Ravichander Vipperla	1	0
31	Shiyao Li	1	0
32	Rayilam Parhat	1	0
33	Ralph Grishman	1	0
34	Yang Wang	1	1
35	Dongxu Zhang	1	1

Book

现代机器学习技术导论

Paper

Journal:

Zhiyuan Tang, Lantian Li, Dong Wang, and Ravichander Vipperla, "Collaborative Joint Training with Multi-task Recurrent Model for Speech and Speaker Recognition", IEEE/ACM Transactions on Audio, Speech, and Language Processing. Preprint, 2016. (DOI: 10.1109/TASLP.2016.2639323)
Linlin Wang, Jun Wang, Lantian Li, Thomas Fang Zheng, Frank K.Soong, "Improving Speaker Verfication Performance against Long-Term Speaker Variability", Speech Communication, 79 (2016), 14-29, Mar. 2016.
Lantian Li, Dong Wang, Chenhao Zhang, Thomas Fang Zheng, "Improving Short Utterance Speaker Recognition by Modeling Speech Unit Classes", In IEEE/ACM Transactions on Audio, Speech, and Language Processing (Volume:PP, Issue:99) DOI:10.1109/TASLP 2016.
Thomas Fang Zheng, Rozi Askar, Renyu Wang, Lantian Li, "Overview of Biometric Recognition Technology", Journal of Information Security Research, 2(1): 12-26, Jan. 2016.
Thomas Fang Zheng, Lantian Li, Hui Zhang, Rozi Askar, "Overview of Voiceprint Recognition Technology and Applications", Journal of Information Security Research, 2(1): 44-57, Jan. 2016.
Xi Ma, Dong Wang, Javier Tejedor "Similar Word Model for Unfrequent Word Enhancement in Speech Recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol 24, no. 10, 2016.

Conference:

Dongxu Zhang, Dong Wang, "Relation Classification: CNN or RNN?", NLPCC-ICCPOL 2016
Dongxu Zhang, Tianyi Luo, Dong Wang, "Learning from LDA using Deep Neural Networks", NLPCC-ICCPOL 2016
Askar Rozi, Dong Wang, Lantian Li, Thomas Fang Zheng, " Language-aware PLDA for Multilingual Speaker Recognition", OCOCOSDA 2016 (best student paper)
Dong Wang, Zhiyuan Tang, Difei Tang, Qing Chen "OC16-CE80: A Chinese-English Mixlingual Database and A Speech Recognition Baseline", OCOCOSDA 2016 （best paper)
Chenghui Zhao, Lantian Li, Dong Wang and April Pu, " Local Training for PLDA in Speaker Verification", OCOCOSDA 2016
Lantian Li, Dong Wang, Thomoas Fang Zheng, "Max-Margin Metric Learning for Speaker Recognition", ISCSLP 2016
Lantian Li, Dong Wang, Thomas Fang Zheng, " Binary Speaker Embedding", ISCSLP 2016
Lantian Li, Dong Wang, Xiaodong Zhang, Thomas Fang Zheng, Panshi Jin, "System Combination for Short Utterance Speaker Recognition", APSIPA 2016
Zhiyuan Tang, Lantian Li, Dong Wang, "Multi-task Recurrent Model for Speech and Speaker Recognition", APSIPA 2016
Askar Rozi, Dong Wang, Lantian Li, Thomas Fang Zheng, "Feature Transformation For Speaker Verification Under Speaking Rate Mismatch Condition", APSIPA 2016
Aiting Liu, Chao Xing, Yang Feng, Dong Wang, "Learning Ordered Word Representations", APSIPA 2016
Dong Wang, Lantian Li, Difei Tang, Qing Chen, "AP16-OL7: A Multilingual Database for Oriental Languages and A Language Recognition Baseline", APSIPA 2016
Zhiyuan Tang, Lantian Li, Dong Wang, "Multi-task Recurrent Model for True Multilingual Speech Recognition", APSIPA 2016
Qinxin Wang, Tianyi Luo, Dong Wang, "Can Machine Generate Traditional Chinese Poetry? A Feigenbaum Test", BICS 2016
Dong Wang, Qiang Zhou, Amir Hussian, "Deep and Sparse Learning in Speech and Language Processing: An Overview", BICS 2016
Qixin Wang, Tianyi Luo, Dong Wang, Chao Xing, "Chinese Song Iambics Generation with Neural Attention-based Model", IJCAI 2016
Zhiyuan Tang, Dong Wang, Zhiyong Zhang, "Recurrent Neural Network Training with Dark Knowledge Transfer", ICASSP 2016
Mian Wang, Dong Wang, "VMF-SNE: EMBEDDING FOR SPHERICAL DATA", ICASSP 2016
Renyu Wang, Mingliang Gu, Lantian Li, Mingxing Xu, Thomas Fang Zheng, "Speaker Segmentation Using Deep Speaker Vectors for Fast Speaker Change Scenarios", ICASSP 2017

Patent

郑方李蓝天邬晓钧别凡虎王军语音重放检测方法和装置 2016100073590 [D-ear] 交底书
郑方李蓝天邬晓钧王刚刘乐基于声纹识别、人脸识别以及同步活体检测的身份认证方法及系统 2015108119085 [D-ear] 交底书
王东邢超张之勇赵梦原一种面向混合语言的语音合成方法 [FreeNeb] 交底书
王东张之勇赵梦原黄伟明李国强，一种日语语音识别系统训练方法 [同方] 交底书
汪洋，王东，刘荣在线强化学习交易策略 [张露] 文底书
王东，白紫薇，冯洋，杜新凯，游士学，基于LSTM模型的现代文到古诗的转换技术 [汇联] 交底书
王东，张记袁，冯洋，杜新凯，游士学，一种基于共同语义空间的个性化音乐生成技术 [汇联] 交底书

Talks

Date	Speaker	Title	Materials
2016/1/4	Zhiyong Zhang	Parallel training,MPE and natural gradient	slides
2016/1/18	Dongxu Zhang	Memoryless Document Vector	slides
2016/3/14	Zhiyuan Tang	Oral presentation for "vMF-SNE: Embedding for Spherical Data"	slides
2016/3/28	Tianyi Luo	Review for Neural QA	slides
2016/4/11	Rong Liu	Recommendation in Youku	slides
2016/5/09	Miao Fan	Learning contextual embeddings of knowledge base with entity descriptions.	slides
2016/5/16	Yang Wang	Research on conversation thread detection.	slides
2016/5/20	Yang Wang & Maoning Wang	Research on portfolio selection.	slides1 slides2
2016/5/20	Zhiyuan Tang	ICASSP 2016 summary	slides
2016/5/23	Dong Wang	graphical model and neural model	slides papers
2016/8/02	Zhiyuan Tang	Visualizing, Measuring and Understanding Neural Networks: A Brief Survey	slides
2016/8/03	Yang Wang	Neural networks and genetic programming for financial forecasting	slides
2016/11/05	Yang Wang	Reinforcement Learning Models and Simulations	slides
2016/11/12	Yang Wang	Generative Adversarial Nets	slides
2016/11/22	Zhiyuan Tang	INTERSPEECH 2016 summary	slides
2016/11/30	Dong Wang	Deep and sparse learning in speech and language: an overview	slides

Database

name	type	size	dir	description
ASVspoof 2017	wav	348M	corpora/lilt/ASVspoof 2017	ASVspoof 2017 data (from INTERSPEECH) collected by Lantian Li
CSLT_China300	wav	8.8G	corpora/lilt/CSLT_China300	300 chinese speakers data for SRE collected by Lantian Li
CSLT_Replay	wav	13G	corpora/lilt/CSLT_Replay	CSLT replay spoofing data collected by Lantian Li
CSLT_Digit	wav	1.3G	corpora/lilt/CSLT_Digit	Digit string data for SRE collected by Lantian Li
Idiap_avspoof	wav	21G	corpora/lilt/Idiap_avspoof	Idiap Avspoof data collected by Lantian Li
RedDots	wav	1.2G	corpora/lilt/RedDots	RedDots data for SRE collected by Lantian Li
SITW	wav	19G	corpora/lilt/SITW	SITW data for SRE collected by Lantian Li
VCTK	wav	11G	corpora/lilt/VCTK	VCTK data for ASR and SRE collected by Lantian Li
VoxForge	wav	11G	corpora/lilt/VoxForge	VoxForge data for ASR and SRE collected by Lantian Li
lyric	text	-	corpora/art/lyric	song lyric data collected by Jiyuan Zhang
poem	text	-	corpora/art/poem	Traditional Chinese poem data collected by Qixin Wang and Jiyuan Zhang
cnsong	text	-	corpora/art/song	Chinese Song sentences collected by Aiting Liu
factordb	transaction	-	corpora/finance/factordb	Factor database collected by Wangyang

Code

Code Name	Author	Description
THCHS30	Wang Dong, Zhang Xuewei	THCHS30 recipe link
THUYG20	Wang Dong, Zhang Xuewei	THUYG20 recipe link
viewExp	Wang Yang	The experiment platform of the Finance team link
vvBeam	Wang Dong	Beamforming using superdirective array link
vvEngine	Zhiyong Zhang	ASR Engine link
vvPoem	Jiyuan Zhang	Vivi Poem generatoin http://git.cslt.org/vivi/vvpoem link]
vvQA	Chao Xing	QA system link

@@ 第1行： / 第1行： @@
 =Technical report=
+#[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/5/53/TRP-20160039.pdf TRP-20160039 Speaker Segmentation Using Deep Speaker Vectors for Fast Speaker Change Scenarios, Renyu Wang, Mingliang Gu, Lantian Li, Mingxing Xu, Thomas Fang Zheng]
+#[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/0/06/TRP-20160038.pdf TRP-20160038 生物特征识别技术综述, Thomas Fang Zheng, Askar Rozi, Renyu Wang, Lantian Li]
+#[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/3/3d/TRP-20160037.pdf TRP-20160037 声纹识别技术及其应用现状, Thomas Fang Zheng, Lantian Li, Hui Zhang, Askar Rozi]
+#[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/5/5f/Dtq.pdf TRP-20160036 Deep Q-trading, Yang Wang, Dong Wang, Shiyue Zhang,Yang Feng, Shiyao Li,and Qiang Zhou]
+#[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/9/92/Moses%E6%93%8D%E4%BD%9C%E6%89%8B%E5%86%8C--%E5%86%AF%E6%B4%8B.pdf TRP-20160035  Moses中文操作手册, 冯洋]
+#[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/3/3d/CCF-ASR.pdf TRP-20160034  The Present and Future of Speech Recognition, Dong Wang]
+#[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/a/a2/Memory.pdf TRP-20160033  Memoryless Document Vector, Dongxu Zhang, Dong Wang]
+#[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/7/7a/Turing.pdf TRP-20160032 Can Machine Generate Traditional Chinese Poetry? A Turing Test, Qixin Wang, Tianyi Luo, Dong Wang]
+#[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/ce/TRP-20160031.pdf TRP-20160031 OC16-CE80: A Chinese-English Mixlingual Database and A Speech Recognition Baseline, Dong Wang, Zhiyuan Tang, Difei Tang and Qing Chen]
+#[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/9/9e/TRP-20160030.pdf TRP-20160030 Collaborative Joint Training with Multi-task Recurrent Model for Speech and Speaker Recognition, Zhiyuan Tang, Lantian Li, Dong Wang and Ravichander Vipperla]
+#[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/0/09/TRP-20160029.pdf TRP-20160029 Multi-task Recurrent Model for Speech and Speaker Recognition, Zhiyuan Tang, Lantian Li and Dong Wang]
+#[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/5/55/TRP-20160028.pdf TRP-20160028 Multi-task Recurrent Model for True Multilingual Speech Recognition, Zhiyuan Tang, Lantian Li and Dong Wang]
 #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/2/27/TRP-20160027.pdf TRP-20160027 Collaborative Learning for Language and Speaker Recognition, Lantian Li, Zhiyuan Tang, Dong Wang, Yang Feng and Shiyue Zhang]
 #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/9/9e/TRP-20160026.pdf TRP-20160026 Weakly Supervised PLDA Training, Lantian Li, Dong Wang, Yixiang Chen and Chenghui Zhao]
@@ 第10行： / 第22行： @@
 #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/7/76/TRP-20160020.pdf TRP-20160020 Feature Transformation For Speaker Verification Under Speaking Rate Mismatch Condition, Askar Rozi, Lantian Li, Dong Wang and Thomas Fang Zheng]
 #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/c9/TRP-20160019.pdf TRP-20160019 Language-aware PLDA for Multilingual Speaker Recognition, Askar Rozi, Dong Wang, Lantian Li and Thomas Fang Zheng]
-#[TRP-20160018]
+#[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/b/b5/Ijcai16.pdf TRP-20160018 Chinese Song Iambics Generation with Neural Attention-based Model, Qixin Wang, Tianyi Luo, Dong Wang]
 #[http://cslhttp://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/%E7%89%B9%E6%AE%8A:%E4%B8%8A%E4%BC%A0%E6%96%87%E4%BB%B6t.riit.tsinghua.edu.cn/mediawiki/images/7/7e/Nnet3_config.pdf TRP-20160017 How to Config Kaldi nnet3 (in Chinese), Zhiyuan Tang and Dong Wang]
 #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/e/e6/Joint_training_config.pdf TRP-20160016 How to deploy joint training in Kaldi (in Chinese), Hang Luo, Zhiyuan Tang and Dong Wang]
@@ 第28行： / 第40行： @@
 #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/3/39/How_to_deal_with_low_frequency_words.pdf TRP-20160002: Low-Frequency Words Embedding, Chao Xing, Yiqiao Pan, Dong Wang]
 #[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/b/b6/Max-margin.pdf TRP-20160001: Max-margin metric learning for speaker recognition, Lantian Li, Chao Xing, Dong Wang]
+=Author Statistics for TRP=
+{|class="wikitable"
+! rank !! name !! TRP number !! first author number
+|-
+| 1 || Dong Wang || 32 || 6
+|-
+| 2 || Lantian Li || 19 || 8
+|-
+| 3 || Zhiyuan Tang || 9 || 5
+|-
+| 4 || Thomas Fang Zheng || 8 || 2
+|-
+| 5 || Yang Feng || 5 || 1
+|-
+| 6 || Tianyi Luo || 3 || 1
+|-
+| 7 || Shiyue Zhang || 3 || 1
+|-
+| 8 || Chao Xing || 2 || 1
+|-
+| 9 || Ying Shi || 2 || 1
+|-
+| 10 || Qixin Wang || 2 || 2
+|-
+| 11 || Difei Tang || 2 || 0
+|-
+| 12 || Askar Rozi || 5 || 2
+|-
+| 13 || Qiang Zhou || 2 || 1
+|-
+| 14 || Yixiang Chen || 2 || 0
+|-
+| 15 || Chenghui Zhao || 2 || 1
+|-
+| 16 || Javier Tejedor || 2 || 0
+|-
+| 17 || Qing Chen || 2 || 0
+|-
+| 18 || Nurbolat || 1 || 0
+|-
+| 19 || Hang Luo || 1 || 1
+|-
+| 20 || Renyu Wang || 1 || 0
+|-
+| 21 || Chenhao Zhang || 1 || 0
+|-
+| 22 || Askar Humdulla || 1 || 0
+|-
+| 23 || Xi Ma || 1 || 1
+|-
+| 24 || Guozhen Zhao || 1 || 0
+|-
+| 25 || April Pu || 1 || 0
+|-
+| 26 || Yiqiao Pan || 1 || 0
+|-
+| 27 || Gang Wang || 1 || 0
+|-
+| 28 || Jiyuan Zhang || 1 || 1
+|-
+| 29 || Caixia Wang || 1 || 0
+|-
+| 30 || Ravichander Vipperla || 1 || 0
+|-
+| 31 || Shiyao Li || 1 || 0
+|-
+| 32 || Rayilam Parhat || 1 || 0
+|-
+| 33 || Ralph Grishman || 1 || 0
+|-
+| 34 || Yang Wang || 1 || 1
+|-
+| 35 || Dongxu Zhang || 1 || 1
+|-
+|}
+=Book=
+#[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/6/61/Book.pdf 现代机器学习技术导论]
+=Paper=
+Journal:
+#Zhiyuan Tang, Lantian Li, Dong Wang, and Ravichander Vipperla, "Collaborative Joint Training with Multi-task Recurrent Model for Speech and Speaker Recognition", IEEE/ACM Transactions on Audio, Speech, and Language Processing. Preprint, 2016. (DOI: 10.1109/TASLP.2016.2639323)
+#Linlin Wang, Jun Wang, Lantian Li, Thomas Fang Zheng, Frank K.Soong, "Improving Speaker Verfication Performance against Long-Term Speaker Variability", Speech Communication, 79 (2016), 14-29, Mar. 2016.
+#Lantian Li, Dong Wang, Chenhao Zhang, Thomas Fang Zheng, "Improving Short Utterance Speaker Recognition by Modeling Speech Unit Classes", In IEEE/ACM Transactions on Audio, Speech, and Language Processing (Volume:PP, Issue:99) DOI:10.1109/TASLP 2016.
+#Thomas Fang Zheng, Rozi Askar, Renyu Wang, Lantian Li, "Overview of Biometric Recognition Technology", Journal of Information Security Research, 2(1): 12-26, Jan. 2016.
+#Thomas Fang Zheng, Lantian Li, Hui Zhang, Rozi Askar, "Overview of Voiceprint Recognition Technology and Applications", Journal of Information Security Research, 2(1): 44-57, Jan. 2016.
+#Xi Ma, Dong Wang, Javier Tejedor "Similar Word Model for Unfrequent Word Enhancement in Speech Recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol 24, no. 10, 2016.
+Conference:
+#Dongxu Zhang, Dong Wang, "Relation Classification: CNN or RNN?", NLPCC-ICCPOL 2016
+#Dongxu Zhang, Tianyi Luo, Dong Wang, "Learning from LDA using Deep Neural Networks", NLPCC-ICCPOL 2016
+#Askar Rozi, Dong Wang, Lantian Li, Thomas Fang Zheng, " Language-aware PLDA for Multilingual Speaker Recognition", OCOCOSDA 2016 (best student paper)
+#Dong Wang, Zhiyuan Tang, Difei Tang, Qing Chen "OC16-CE80: A Chinese-English Mixlingual Database and A Speech Recognition Baseline", OCOCOSDA 2016 （best paper)
+#Chenghui Zhao, Lantian Li, Dong Wang and April Pu, " Local Training for PLDA in Speaker Verification", OCOCOSDA 2016
+#Lantian Li, Dong Wang, Thomoas Fang Zheng, "Max-Margin Metric Learning for Speaker Recognition", ISCSLP 2016
+#Lantian Li, Dong Wang, Thomas Fang Zheng, " Binary Speaker Embedding", ISCSLP 2016
+#Lantian Li, Dong Wang, Xiaodong Zhang, Thomas Fang Zheng, Panshi Jin, "System Combination for Short Utterance Speaker Recognition", APSIPA 2016
+#Zhiyuan Tang, Lantian Li, Dong Wang, "Multi-task Recurrent Model for Speech and Speaker Recognition", APSIPA 2016
+#Askar Rozi, Dong Wang, Lantian Li, Thomas Fang Zheng, "Feature Transformation For Speaker Verification Under Speaking Rate Mismatch Condition", APSIPA 2016
+#Aiting Liu, Chao Xing, Yang Feng, Dong Wang, "Learning Ordered Word Representations", APSIPA 2016
+#Dong Wang, Lantian Li, Difei Tang, Qing Chen, "AP16-OL7: A Multilingual Database for Oriental Languages and A Language Recognition Baseline", APSIPA 2016
+#Zhiyuan Tang, Lantian Li, Dong Wang, "Multi-task Recurrent Model for True Multilingual Speech Recognition", APSIPA 2016
+#Qinxin Wang, Tianyi Luo, Dong Wang, "Can Machine Generate Traditional Chinese Poetry? A Feigenbaum Test", BICS 2016
+#Dong Wang, Qiang Zhou, Amir Hussian, "Deep and Sparse Learning in Speech and Language Processing: An Overview", BICS 2016
+#Qixin Wang, Tianyi Luo, Dong Wang, Chao Xing, "Chinese Song Iambics Generation with Neural Attention-based Model", IJCAI 2016
+#Zhiyuan Tang, Dong Wang, Zhiyong Zhang, "Recurrent Neural Network Training with Dark Knowledge Transfer", ICASSP 2016
+#Mian Wang, Dong Wang, "VMF-SNE: EMBEDDING FOR SPHERICAL DATA", ICASSP 2016
+#Renyu Wang, Mingliang Gu, Lantian Li, Mingxing Xu, Thomas Fang Zheng, "Speaker Segmentation Using Deep Speaker Vectors for Fast Speaker Change Scenarios", ICASSP 2017
 =Patent=
-#郑方 李蓝天 邬晓钧 别凡虎 王军 语音重放检测方法和装置 [D-ear] [[媒体文件:一种中英文混合的语音合成方法.docx|交底书]]
+#郑方 李蓝天 邬晓钧 别凡虎 王军 语音重放检测方法和装置 2016100073590 [D-ear] [[媒体文件:2016100073590.pdf|交底书]]
-#郑方 李蓝天 邬晓钧 王刚 刘乐 基于声纹识别、人脸识别以及同步活体检测的身份认证方法及系统 [D-ear] [[媒体文件:一种中英文混合的语音合成方法.docx|交底书]]
+#郑方 李蓝天 邬晓钧 王刚 刘乐 基于声纹识别、人脸识别以及同步活体检测的身份认证方法及系统 2015108119085 [D-ear] [[媒体文件:2015108119085.pdf|交底书]]
 #王东 邢超 张之勇 赵梦原 一种面向混合语言的语音合成方法 [FreeNeb] [[媒体文件:一种中英文混合的语音合成方法.docx|交底书]]
 #王东 张之勇 赵梦原 黄伟明 李国强，一种日语语音识别系统训练方法  [同方] [[媒体文件:一种日语语音识别系统训练方法.docx|交底书]]
@@ 第81行： / 第206行： @@
 {| class="wikitable"
 ! name !! type!! size !! dir !! description
+|-
+|ASVspoof 2017||wav||348M||corpora/lilt/ASVspoof 2017|| ASVspoof 2017 data (from INTERSPEECH) collected by Lantian Li
+|-
+|CSLT_China300||wav||8.8G||corpora/lilt/CSLT_China300|| 300 chinese speakers data for SRE collected by Lantian Li
+|-
+|CSLT_Replay||wav||13G||corpora/lilt/CSLT_Replay|| CSLT replay spoofing data collected by Lantian Li
+|-
+|CSLT_Digit||wav||1.3G||corpora/lilt/CSLT_Digit|| Digit string data for SRE collected by Lantian Li
+|-
+|Idiap_avspoof||wav||21G||corpora/lilt/Idiap_avspoof|| Idiap Avspoof data collected by Lantian Li
+|-
+|RedDots||wav||1.2G||corpora/lilt/RedDots|| RedDots data for SRE collected by Lantian Li
+|-
+|SITW||wav||19G||corpora/lilt/SITW|| SITW data for SRE collected by Lantian Li
+|-
+|VCTK||wav||11G||corpora/lilt/VCTK|| VCTK data for ASR and SRE collected by Lantian Li
+|-
+|VoxForge||wav||11G||corpora/lilt/VoxForge|| VoxForge data for ASR and SRE collected by Lantian Li
 |-
 |lyric||text||-||corpora/art/lyric|| song lyric data collected by Jiyuan Zhang
@@ 第95行： / 第238行： @@
 {| class="wikitable"
 ! Code Name!! Author !! Description
+|-
+|THCHS30||Wang Dong, Zhang Xuewei|| THCHS30 recipe [https://github.com/kaldi-asr/kaldi link]
+|-
+|THUYG20||Wang Dong, Zhang Xuewei|| THUYG20 recipe [https://github.com/wangdong99/kaldi link]
 |-
 |viewExp||Wang Yang|| The experiment platform of the Finance team [http://git.cslt.org/finance/viewexp/tree/master link]
 |-
 |vvBeam||Wang Dong|| Beamforming using superdirective array [http://git.cslt.org/speech/beam link]
+|-
+|vvEngine||Zhiyong Zhang|| ASR Engine [http://git.cslt.org/zhangzy/freepulsar link]
+|-
+|vvPoem||Jiyuan Zhang|| Vivi Poem generatoin http://git.cslt.org/vivi/vvpoem link]
+|-
+|vvQA||Chao Xing|| QA system [http://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/FreeNeb_ViviQA_Release link]
 |-
 |}
-=Model and System=

“Delivery-2016”版本间的差异

2017年1月8日 (日) 02:59的最后版本

目录

Technical report

Author Statistics for TRP

Book

Paper

Patent

Talks

Database

Code

导航菜单

个人工具

名字空间

变种

查看

操作

搜索

导航

工具