2017年1月7日 (六) 16:47的版本

Technical report

TRP-20160034 The Present and Future of Speech Recognition, Dong Wang
TRP-20160033 Memoryless Document Vector, Dongxu Zhang, Dong Wang
TRP-20160032 Can Machine Generate Traditional Chinese Poetry? A Turing Test, Qixin Wang, Tianyi Luo, Dong Wang
TRP-20160031 OC16-CE80: A Chinese-English Mixlingual Database and A Speech Recognition Baseline, Dong Wang, Zhiyuan Tang, Difei Tang and Qing Chen
TRP-20160030 Collaborative Joint Training with Multi-task Recurrent Model for Speech and Speaker Recognition, Zhiyuan Tang, Lantian Li, Dong Wang and Ravichander Vipperla
TRP-20160029 Multi-task Recurrent Model for Speech and Speaker Recognition, Zhiyuan Tang, Lantian Li and Dong Wang
TRP-20160028 Multi-task Recurrent Model for True Multilingual Speech Recognition, Zhiyuan Tang, Lantian Li and Dong Wang
TRP-20160027 Collaborative Learning for Language and Speaker Recognition, Lantian Li, Zhiyuan Tang, Dong Wang, Yang Feng and Shiyue Zhang
TRP-20160026 Weakly Supervised PLDA Training, Lantian Li, Dong Wang, Yixiang Chen and Chenghui Zhao
TRP-20160025 Local Training for PLDA in Speaker Veriﬁcation, Chenghui Zhao, Lantian Li, Dong Wang and April Pu
TRP-20160024 Decision Making Based on Cohort Scores for Speaker Verification, Lantian Li, Renyu Wang, Gang Wang, Caixia Wang and Thomas Fang Zheng
TRP-20160023 AP16-OL7: A Multilingual Database for Oriental Languages and A Language Recognition Baseline, Dong Wang, Lantian Li, Difei Tang and Qing Chen
TRP-20160022 System Combination for Short Utterance Speaker Recognition, Lantian Li, Dong Wang and Thomas Fang Zheng
TRP-20160021 Improving Short Utterance Speaker Recognition by Modeling Speech Unit Classes, Lantian Li, Dong Wang, Chenhao Zhang and Thomas Fang Zheng
TRP-20160020 Feature Transformation For Speaker Verification Under Speaking Rate Mismatch Condition, Askar Rozi, Lantian Li, Dong Wang and Thomas Fang Zheng
TRP-20160019 Language-aware PLDA for Multilingual Speaker Recognition, Askar Rozi, Dong Wang, Lantian Li and Thomas Fang Zheng
TRP-20160018 Chinese Song Iambics Generation with Neural Attention-based Model, Qixin Wang, Tianyi Luo, Dong Wang
TRP-20160017 How to Config Kaldi nnet3 (in Chinese), Zhiyuan Tang and Dong Wang
TRP-20160016 How to deploy joint training in Kaldi (in Chinese), Hang Luo, Zhiyuan Tang and Dong Wang
TRP-20160015 How to run ASR system for Kazak (in Chinese), Ying Shi, Zhiyuan Tang，Nurbolat and Dong Wang
TRP-20160014 Exploring The Role of Deep Speaker Features for Speaker Verification, Lantian Li and Dong Wang
TRP-20160013 Sparse Discriminative Analysis and Its Application in Distraction Classification, Dong Wang
TRP-20160012 i-vector system in Kaldi (in Chinese) Yixang Chen, Lantian Li and Dong Wang
TRP-20160011 基于说话人信道相关的录音重放检测若干方法探究 Lantian Li, Yixiang Chen and Dong Wang
TRP-20160010 Highly Restricted Keyword Selection Based on Sparse Analysis for Uyghur Text Categorization, Dong Wang, Askar Humdulla, Rayilam Parhat, Javier Tejedor
TRP-20160009: RNNG Code User Guide, Shiyue Zhang and Yang Feng
TRP-20160008: Different styles of poetry generation based on memory model, Jiyuan Zhang,Yang Feng and Dong Wang
TRP-20160007: Distraction Detection Using Sparse Discriminative Analysis, Dong Wang and Guozhen Zhao
TRP-20160006: Visualization Analysis for Recurrent Networks, Zhiyuan Tang, Ying Shi and Dong Wang
TRP-20160005: Distributed Representation Learning for Knowledge Graphs with Entity Descriptions; Miao Fan, Qiang Zhou, Thomas Fang Zheng, Ralph Grishman
TRP-20160004: A Review of Neural QA, Tianyi Luo and Dong Wang
TRP-20160003: A study of Similar Word Model for Unfrequent Word Enhancement in Speech Recognition, Xi Ma, Dong Wang and Javier Tejedor
TRP-20160002: Low-Frequency Words Embedding, Chao Xing, Yiqiao Pan, Dong Wang
TRP-20160001: Max-margin metric learning for speaker recognition, Lantian Li, Chao Xing, Dong Wang

Book

现代机器学习技术导论

Paper

Journal:

Zhiyuan Tang, Lantian Li, Dong Wang, and Ravichander Vipperla, "Collaborative Joint Training with Multi-task Recurrent Model for Speech and Speaker Recognition", IEEE/ACM Transactions on Audio, Speech, and Language Processing. Preprint, 2016. (DOI: 10.1109/TASLP.2016.2639323)
Linlin Wang, Jun Wang, Lantian Li, Thomas Fang Zheng, Frank K.Soong, "Improving Speaker Verfication Performance against Long-Term Speaker Variability", Speech Communication, 79 (2016), 14-29, Mar. 2016.
Lantian Li, Dong Wang, Chenhao Zhang, Thomas Fang Zheng, "Improving Short Utterance Speaker Recognition by Modeling Speech Unit Classes", In IEEE/ACM Transactions on Audio, Speech, and Language Processing (Volume:PP, Issue:99) DOI:10.1109/TASLP 2016.
Thomas Fang Zheng, Rozi Askar, Renyu Wang, Lantian Li, "Overview of Biometric Recognition Technology", Journal of Information Security Research, 2(1): 12-26, Jan. 2016.
Thomas Fang Zheng, Lantian Li, Hui Zhang, Rozi Askar, "Overview of Voiceprint Recognition Technology and Applications", Journal of Information Security Research, 2(1): 44-57, Jan. 2016.
Xi Ma, Dong Wang, Javier Tejedor "Similar Word Model for Unfrequent Word Enhancement in Speech Recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol 24, no. 10, 2016.
Meng Sun, Xiongwei Zhang, Hugo Van hamme, and Thomas Fang Zheng, "Unseen noise estimation using separable deep auto encoder for speech enhancement", IEEE/ACM Transactions on Audio, Speech, and Language Processing, pp. 93-104, Vol. 24, No. 1, Jan. 2016 (DOI 10.1109/TASLP.2015.2498101)
殷实，张之勇，王东，郑方，李银国, 基于深度神经网络的语音端点检测, 清华学报，出版中。
Liang Weiqian, Zheng Fang, Chen Chaoyang, Chen Gaojun, "GSPAP based sub-band adaptive feedback cancellation algorithm", Tsinghua Xuebao (in Chinese)
Liang Weiqian, Zheng Fang, Zheng Jiachun, Piao Zhigang, "Sub-band based adaptive noise reduction algorithm for improved speech intelligibility", Tsinghua Xuebao (in Chinese)
Askar Rouze, Shi Yin, Zhiyong Zhang, Dong Wang, Askar Humdulla, Fang Zheng, "THUYG THUYG-20: A free Uyghur Speech Database", Tsinghua Xuebao (in Chinese)
Jun Wang, Lantian Li, Dong Wang, Thomas Fang Zheng, Research on Generalization Property of Time-varying Fbank-weighted MFCC for I-vector Based Speaker Verification, Tsinghua Xuebao (in Chinese)
Fanhu Bie, Dong Wang, Thomas Fang Zheng, Research on Truncated Speech in Speaker Verification, Tsinghua Xuebao (in Chinese)
Rong Liu, Dong Wang, Chao Xing, Document Classification Based on Word Vectors, Tsinghua Xuebao (in Chinese)

Conference:

Dongxu Zhang, Dong Wang, "Relation Classification: CNN or RNN?", NLPCC-ICCPOL 2016
Dongxu Zhang, Tianyi Luo, Dong Wang, "Learning from LDA using Deep Neural Networks", NLPCC-ICCPOL 2016
Askar Rozi, Dong Wang, Lantian Li, Thomas Fang Zheng, " Language-aware PLDA for Multilingual Speaker Recognition", OCOCOSDA 2016 (best student paper)
Dong Wang, Zhiyuan Tang, Difei Tang, Qing Chen "OC16-CE80: A Chinese-English Mixlingual Database and A Speech Recognition Baseline", OCOCOSDA 2016 （best paper)
Chenghui Zhao, Lantian Li, Dong Wang and April Pu, " Local Training for PLDA in Speaker Verification", OCOCOSDA 2016
Lantian Li, Dong Wang, Thomoas Fang Zheng, "Max-Margin Metric Learning for Speaker Recognition", ISCSLP 2016
Lantian Li, Dong Wang, Thomas Fang Zheng, " Binary Speaker Embedding", ISCSLP 2016
Lantian Li, Dong Wang, Xiaodong Zhang, Thomas Fang Zheng, Panshi Jin, "System Combination for Short Utterance Speaker Recognition", APSIPA 2016
Zhiyuan Tang, Lantian Li, Dong Wang, "Multi-task Recurrent Model for Speech and Speaker Recognition", APSIPA 2016
Askar Rozi, Dong Wang, Lantian Li, Thomas Fang Zheng, "Feature Transformation For Speaker Verification Under Speaking Rate Mismatch Condition", APSIPA 2016
Aiting Liu, Chao Xing, Yang Feng, Dong Wang, "Learning Ordered Word Representations", APSIPA 2016
Dong Wang, Lantian Li, Difei Tang, Qing Chen, "AP16-OL7: A Multilingual Database for Oriental Languages and A Language Recognition Baseline", APSIPA 2016
Zhiyuan Tang, Lantian Li, Dong Wang, "Multi-task Recurrent Model for True Multilingual Speech Recognition", APSIPA 2016
Qinxin Wang, Tianyi Luo, Dong Wang, "Can Machine Generate Traditional Chinese Poetry? A Feigenbaum Test", BICS 2016
Dong Wang, Qiang Zhou, Amir Hussian, "Deep and Sparse Learning in Speech and Language Processing: An Overview", BICS 2016
Qixin Wang, Tianyi Luo, Dong Wang, Chao Xing, "Chinese Song Iambics Generation with Neural Attention-based Model", IJCAI 2016
Zhiyuan Tang, Dong Wang, Zhiyong Zhang, "Recurrent Neural Network Training with Dark Knowledge Transfer", ICASSP 2016
Mian Wang, Dong Wang, "VMF-SNE: EMBEDDING FOR SPHERICAL DATA", ICASSP 2016

Patent

郑方李蓝天邬晓钧别凡虎王军语音重放检测方法和装置 2016100073590 [D-ear] 交底书
郑方李蓝天邬晓钧王刚刘乐基于声纹识别、人脸识别以及同步活体检测的身份认证方法及系统 2015108119085 [D-ear] 交底书
王东邢超张之勇赵梦原一种面向混合语言的语音合成方法 [FreeNeb] 交底书
王东张之勇赵梦原黄伟明李国强，一种日语语音识别系统训练方法 [同方] 交底书
汪洋，王东，刘荣在线强化学习交易策略 [张露] 文底书
王东，白紫薇，冯洋，杜新凯，游士学，基于LSTM模型的现代文到古诗的转换技术 [汇联] 交底书
王东，张记袁，冯洋，杜新凯，游士学，一种基于共同语义空间的个性化音乐生成技术 [汇联] 交底书

Talks

Date	Speaker	Title	Materials
2016/1/4	Zhiyong Zhang	Parallel training,MPE and natural gradient	slides
2016/1/18	Dongxu Zhang	Memoryless Document Vector	slides
2016/3/14	Zhiyuan Tang	Oral presentation for "vMF-SNE: Embedding for Spherical Data"	slides
2016/3/28	Tianyi Luo	Review for Neural QA	slides
2016/4/11	Rong Liu	Recommendation in Youku	slides
2016/5/09	Miao Fan	Learning contextual embeddings of knowledge base with entity descriptions.	slides
2016/5/16	Yang Wang	Research on conversation thread detection.	slides
2016/5/20	Yang Wang & Maoning Wang	Research on portfolio selection.	slides1 slides2
2016/5/20	Zhiyuan Tang	ICASSP 2016 summary	slides
2016/5/23	Dong Wang	graphical model and neural model	slides papers
2016/8/02	Zhiyuan Tang	Visualizing, Measuring and Understanding Neural Networks: A Brief Survey	slides
2016/8/03	Yang Wang	Neural networks and genetic programming for financial forecasting	slides
2016/11/05	Yang Wang	Reinforcement Learning Models and Simulations	slides
2016/11/12	Yang Wang	Generative Adversarial Nets	slides
2016/11/22	Zhiyuan Tang	INTERSPEECH 2016 summary	slides
2016/11/30	Dong Wang	Deep and sparse learning in speech and language: an overview	slides

Database

name	type	size	dir	description
ASVspoof 2017	wav	-	corpora/lilt/ASVspoof 2017	ASVspoof 2017 data (from INTERSPEECH) collected by Lantian Li
CSLT_China300	wav	-	corpora/lilt/CSLT_China300	300 chinese speakers data for SRE collected by Lantian Li
CSLT_Replay	wav	-	corpora/lilt/CSLT_Replay	CSLT replay spoofing data collected by Lantian Li
CSLT_Digit	wav	-	corpora/lilt/CSLT_Digit	Digit string data for SRE collected by Lantian Li
Idiap_avspoof	wav	-	corpora/lilt/Idiap_avspoof	Idiap Avspoof data collected by Lantian Li
RedDots	wav	-	corpora/lilt/RedDots	RedDots data for SRE collected by Lantian Li
SITW	wav	-	corpora/lilt/SITW	SITW data for SRE collected by Lantian Li
VCTK	wav	-	corpora/lilt/VCTK	VCTK data for ASR and SRE collected by Lantian Li
VoxForge	wav	-	corpora/lilt/VoxForge	VoxForge data for ASR and SRE collected by Lantian Li
lyric	text	-	corpora/art/lyric	song lyric data collected by Jiyuan Zhang
poem	text	-	corpora/art/poem	Traditional Chinese poem data collected by Qixin Wang and Jiyuan Zhang
cnsong	text	-	corpora/art/song	Chinese Song sentences collected by Aiting Liu
factordb	transaction	-	corpora/finance/factordb	Factor database collected by Wangyang

Code

Code Name	Author	Description
THCHS30	Wang Dong, Zhang Xuewei	THCHS30 recipe link
THUYG20	Wang Dong, Zhang Xuewei	THUYG20 recipe link
viewExp	Wang Yang	The experiment platform of the Finance team link
vvBeam	Wang Dong	Beamforming using superdirective array link

@@ 第166行： / 第166行： @@
 {| class="wikitable"
 ! Code Name!! Author !! Description
+|-
+|THCHS30||Wang Dong, Zhang Xuewei|| THCHS30 recipe [https://github.com/kaldi-asr/kaldi link]
+|-
+|THUYG20||Wang Dong, Zhang Xuewei|| THUYG20 recipe [https://github.com/wangdong99/kaldi link]
 |-
 |viewExp||Wang Yang|| The experiment platform of the Finance team [http://git.cslt.org/finance/viewexp/tree/master link]
@@ 第172行： / 第176行： @@
 |-
 |}
-=Model and System=

“Delivery-2016”版本间的差异

2017年1月7日 (六) 16:47的版本

目录

Technical report

Book

Paper

Patent

Talks

Database

Code

导航菜单

个人工具

名字空间

变种

查看

操作

搜索

导航

工具