“Delivery-2016”版本间的差异
来自cslt Wiki
(→Database) |
|||
第137行: | 第137行: | ||
! name !! type!! size !! dir !! description | ! name !! type!! size !! dir !! description | ||
|- | |- | ||
− | |ASVspoof 2017||wav|| | + | |ASVspoof 2017||wav||348M||corpora/lilt/ASVspoof 2017|| ASVspoof 2017 data (from INTERSPEECH) collected by Lantian Li |
|- | |- | ||
− | |CSLT_China300||wav|| | + | |CSLT_China300||wav||8.8G||corpora/lilt/CSLT_China300|| 300 chinese speakers data for SRE collected by Lantian Li |
|- | |- | ||
− | |CSLT_Replay||wav|| | + | |CSLT_Replay||wav||13G||corpora/lilt/CSLT_Replay|| CSLT replay spoofing data collected by Lantian Li |
|- | |- | ||
− | |CSLT_Digit||wav|| | + | |CSLT_Digit||wav||1.3G||corpora/lilt/CSLT_Digit|| Digit string data for SRE collected by Lantian Li |
|- | |- | ||
− | |Idiap_avspoof||wav|| | + | |Idiap_avspoof||wav||21G||corpora/lilt/Idiap_avspoof|| Idiap Avspoof data collected by Lantian Li |
|- | |- | ||
− | |RedDots||wav|| | + | |RedDots||wav||1.2G||corpora/lilt/RedDots|| RedDots data for SRE collected by Lantian Li |
|- | |- | ||
− | |SITW||wav|| | + | |SITW||wav||19G||corpora/lilt/SITW|| SITW data for SRE collected by Lantian Li |
|- | |- | ||
− | |VCTK||wav|| | + | |VCTK||wav||11G||corpora/lilt/VCTK|| VCTK data for ASR and SRE collected by Lantian Li |
|- | |- | ||
− | |VoxForge||wav|| | + | |VoxForge||wav||11G||corpora/lilt/VoxForge|| VoxForge data for ASR and SRE collected by Lantian Li |
|- | |- | ||
|lyric||text||-||corpora/art/lyric|| song lyric data collected by Jiyuan Zhang | |lyric||text||-||corpora/art/lyric|| song lyric data collected by Jiyuan Zhang |
2017年1月8日 (日) 00:26的版本
Technical report
- TRP-20160036 Deep Q-trading, Yang Wang, Dong Wang, Shiyue Zhang,Yang Feng, Shiyao Li,and Qiang Zhou
- TRP-20160035 Moses中文操作手册, 冯洋
- TRP-20160034 The Present and Future of Speech Recognition, Dong Wang
- TRP-20160033 Memoryless Document Vector, Dongxu Zhang, Dong Wang
- TRP-20160032 Can Machine Generate Traditional Chinese Poetry? A Turing Test, Qixin Wang, Tianyi Luo, Dong Wang
- TRP-20160031 OC16-CE80: A Chinese-English Mixlingual Database and A Speech Recognition Baseline, Dong Wang, Zhiyuan Tang, Difei Tang and Qing Chen
- TRP-20160030 Collaborative Joint Training with Multi-task Recurrent Model for Speech and Speaker Recognition, Zhiyuan Tang, Lantian Li, Dong Wang and Ravichander Vipperla
- TRP-20160029 Multi-task Recurrent Model for Speech and Speaker Recognition, Zhiyuan Tang, Lantian Li and Dong Wang
- TRP-20160028 Multi-task Recurrent Model for True Multilingual Speech Recognition, Zhiyuan Tang, Lantian Li and Dong Wang
- TRP-20160027 Collaborative Learning for Language and Speaker Recognition, Lantian Li, Zhiyuan Tang, Dong Wang, Yang Feng and Shiyue Zhang
- TRP-20160026 Weakly Supervised PLDA Training, Lantian Li, Dong Wang, Yixiang Chen and Chenghui Zhao
- TRP-20160025 Local Training for PLDA in Speaker Verification, Chenghui Zhao, Lantian Li, Dong Wang and April Pu
- TRP-20160024 Decision Making Based on Cohort Scores for Speaker Verification, Lantian Li, Renyu Wang, Gang Wang, Caixia Wang and Thomas Fang Zheng
- TRP-20160023 AP16-OL7: A Multilingual Database for Oriental Languages and A Language Recognition Baseline, Dong Wang, Lantian Li, Difei Tang and Qing Chen
- TRP-20160022 System Combination for Short Utterance Speaker Recognition, Lantian Li, Dong Wang and Thomas Fang Zheng
- TRP-20160021 Improving Short Utterance Speaker Recognition by Modeling Speech Unit Classes, Lantian Li, Dong Wang, Chenhao Zhang and Thomas Fang Zheng
- TRP-20160020 Feature Transformation For Speaker Verification Under Speaking Rate Mismatch Condition, Askar Rozi, Lantian Li, Dong Wang and Thomas Fang Zheng
- TRP-20160019 Language-aware PLDA for Multilingual Speaker Recognition, Askar Rozi, Dong Wang, Lantian Li and Thomas Fang Zheng
- TRP-20160018 Chinese Song Iambics Generation with Neural Attention-based Model, Qixin Wang, Tianyi Luo, Dong Wang
- TRP-20160017 How to Config Kaldi nnet3 (in Chinese), Zhiyuan Tang and Dong Wang
- TRP-20160016 How to deploy joint training in Kaldi (in Chinese), Hang Luo, Zhiyuan Tang and Dong Wang
- TRP-20160015 How to run ASR system for Kazak (in Chinese), Ying Shi, Zhiyuan Tang,Nurbolat and Dong Wang
- TRP-20160014 Exploring The Role of Deep Speaker Features for Speaker Verification, Lantian Li and Dong Wang
- TRP-20160013 Sparse Discriminative Analysis and Its Application in Distraction Classification, Dong Wang
- TRP-20160012 i-vector system in Kaldi (in Chinese) Yixang Chen, Lantian Li and Dong Wang
- TRP-20160011 基于说话人信道相关的录音重放检测若干方法探究 Lantian Li, Yixiang Chen and Dong Wang
- TRP-20160010 Highly Restricted Keyword Selection Based on Sparse Analysis for Uyghur Text Categorization, Dong Wang, Askar Humdulla, Rayilam Parhat, Javier Tejedor
- TRP-20160009: RNNG Code User Guide, Shiyue Zhang and Yang Feng
- TRP-20160008: Different styles of poetry generation based on memory model, Jiyuan Zhang,Yang Feng and Dong Wang
- TRP-20160007: Distraction Detection Using Sparse Discriminative Analysis, Dong Wang and Guozhen Zhao
- TRP-20160006: Visualization Analysis for Recurrent Networks, Zhiyuan Tang, Ying Shi and Dong Wang
- TRP-20160005: Distributed Representation Learning for Knowledge Graphs with Entity Descriptions; Miao Fan, Qiang Zhou, Thomas Fang Zheng, Ralph Grishman
- TRP-20160004: A Review of Neural QA, Tianyi Luo and Dong Wang
- TRP-20160003: A study of Similar Word Model for Unfrequent Word Enhancement in Speech Recognition, Xi Ma, Dong Wang and Javier Tejedor
- TRP-20160002: Low-Frequency Words Embedding, Chao Xing, Yiqiao Pan, Dong Wang
- TRP-20160001: Max-margin metric learning for speaker recognition, Lantian Li, Chao Xing, Dong Wang
Book
Paper
Journal:
- Zhiyuan Tang, Lantian Li, Dong Wang, and Ravichander Vipperla, "Collaborative Joint Training with Multi-task Recurrent Model for Speech and Speaker Recognition", IEEE/ACM Transactions on Audio, Speech, and Language Processing. Preprint, 2016. (DOI: 10.1109/TASLP.2016.2639323)
- Linlin Wang, Jun Wang, Lantian Li, Thomas Fang Zheng, Frank K.Soong, "Improving Speaker Verfication Performance against Long-Term Speaker Variability", Speech Communication, 79 (2016), 14-29, Mar. 2016.
- Lantian Li, Dong Wang, Chenhao Zhang, Thomas Fang Zheng, "Improving Short Utterance Speaker Recognition by Modeling Speech Unit Classes", In IEEE/ACM Transactions on Audio, Speech, and Language Processing (Volume:PP, Issue:99) DOI:10.1109/TASLP 2016.
- Thomas Fang Zheng, Rozi Askar, Renyu Wang, Lantian Li, "Overview of Biometric Recognition Technology", Journal of Information Security Research, 2(1): 12-26, Jan. 2016.
- Thomas Fang Zheng, Lantian Li, Hui Zhang, Rozi Askar, "Overview of Voiceprint Recognition Technology and Applications", Journal of Information Security Research, 2(1): 44-57, Jan. 2016.
- Xi Ma, Dong Wang, Javier Tejedor "Similar Word Model for Unfrequent Word Enhancement in Speech Recognition," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol 24, no. 10, 2016.
- Meng Sun, Xiongwei Zhang, Hugo Van hamme, and Thomas Fang Zheng, "Unseen noise estimation using separable deep auto encoder for speech enhancement", IEEE/ACM Transactions on Audio, Speech, and Language Processing, pp. 93-104, Vol. 24, No. 1, Jan. 2016 (DOI 10.1109/TASLP.2015.2498101)
- 殷实, 张之勇, 王东, 郑方, 李银国, 基于深度神经网络的语音端点检测, 清华学报,出版中。
- Liang Weiqian, Zheng Fang, Chen Chaoyang, Chen Gaojun, "GSPAP based sub-band adaptive feedback cancellation algorithm", Tsinghua Xuebao (in Chinese)
- Liang Weiqian, Zheng Fang, Zheng Jiachun, Piao Zhigang, "Sub-band based adaptive noise reduction algorithm for improved speech intelligibility", Tsinghua Xuebao (in Chinese)
- Askar Rouze, Shi Yin, Zhiyong Zhang, Dong Wang, Askar Humdulla, Fang Zheng, "THUYG THUYG-20: A free Uyghur Speech Database", Tsinghua Xuebao (in Chinese)
- Jun Wang, Lantian Li, Dong Wang, Thomas Fang Zheng, Research on Generalization Property of Time-varying Fbank-weighted MFCC for I-vector Based Speaker Verification, Tsinghua Xuebao (in Chinese)
- Fanhu Bie, Dong Wang, Thomas Fang Zheng, Research on Truncated Speech in Speaker Verification, Tsinghua Xuebao (in Chinese)
- Rong Liu, Dong Wang, Chao Xing, Document Classification Based on Word Vectors, Tsinghua Xuebao (in Chinese)
Conference:
- Dongxu Zhang, Dong Wang, "Relation Classification: CNN or RNN?", NLPCC-ICCPOL 2016
- Dongxu Zhang, Tianyi Luo, Dong Wang, "Learning from LDA using Deep Neural Networks", NLPCC-ICCPOL 2016
- Askar Rozi, Dong Wang, Lantian Li, Thomas Fang Zheng, " Language-aware PLDA for Multilingual Speaker Recognition", OCOCOSDA 2016 (best student paper)
- Dong Wang, Zhiyuan Tang, Difei Tang, Qing Chen "OC16-CE80: A Chinese-English Mixlingual Database and A Speech Recognition Baseline", OCOCOSDA 2016 (best paper)
- Chenghui Zhao, Lantian Li, Dong Wang and April Pu, " Local Training for PLDA in Speaker Verification", OCOCOSDA 2016
- Lantian Li, Dong Wang, Thomoas Fang Zheng, "Max-Margin Metric Learning for Speaker Recognition", ISCSLP 2016
- Lantian Li, Dong Wang, Thomas Fang Zheng, " Binary Speaker Embedding", ISCSLP 2016
- Lantian Li, Dong Wang, Xiaodong Zhang, Thomas Fang Zheng, Panshi Jin, "System Combination for Short Utterance Speaker Recognition", APSIPA 2016
- Zhiyuan Tang, Lantian Li, Dong Wang, "Multi-task Recurrent Model for Speech and Speaker Recognition", APSIPA 2016
- Askar Rozi, Dong Wang, Lantian Li, Thomas Fang Zheng, "Feature Transformation For Speaker Verification Under Speaking Rate Mismatch Condition", APSIPA 2016
- Aiting Liu, Chao Xing, Yang Feng, Dong Wang, "Learning Ordered Word Representations", APSIPA 2016
- Dong Wang, Lantian Li, Difei Tang, Qing Chen, "AP16-OL7: A Multilingual Database for Oriental Languages and A Language Recognition Baseline", APSIPA 2016
- Zhiyuan Tang, Lantian Li, Dong Wang, "Multi-task Recurrent Model for True Multilingual Speech Recognition", APSIPA 2016
- Qinxin Wang, Tianyi Luo, Dong Wang, "Can Machine Generate Traditional Chinese Poetry? A Feigenbaum Test", BICS 2016
- Dong Wang, Qiang Zhou, Amir Hussian, "Deep and Sparse Learning in Speech and Language Processing: An Overview", BICS 2016
- Qixin Wang, Tianyi Luo, Dong Wang, Chao Xing, "Chinese Song Iambics Generation with Neural Attention-based Model", IJCAI 2016
- Zhiyuan Tang, Dong Wang, Zhiyong Zhang, "Recurrent Neural Network Training with Dark Knowledge Transfer", ICASSP 2016
- Mian Wang, Dong Wang, "VMF-SNE: EMBEDDING FOR SPHERICAL DATA", ICASSP 2016
Patent
- 郑方 李蓝天 邬晓钧 别凡虎 王军 语音重放检测方法和装置 2016100073590 [D-ear] 交底书
- 郑方 李蓝天 邬晓钧 王刚 刘乐 基于声纹识别、人脸识别以及同步活体检测的身份认证方法及系统 2015108119085 [D-ear] 交底书
- 王东 邢超 张之勇 赵梦原 一种面向混合语言的语音合成方法 [FreeNeb] 交底书
- 王东 张之勇 赵梦原 黄伟明 李国强,一种日语语音识别系统训练方法 [同方] 交底书
- 汪洋,王东,刘荣 在线强化学习交易策略 [张露] 文底书
- 王东,白紫薇,冯洋,杜新凯,游士学,基于LSTM模型的现代文到古诗的转换技术 [汇联] 交底书
- 王东,张记袁,冯洋,杜新凯,游士学, 一种基于共同语义空间的个性化音乐生成技术 [汇联] 交底书
Talks
Date | Speaker | Title | Materials | On duty |
---|---|---|---|---|
2016/1/4 | Zhiyong Zhang | Parallel training,MPE and natural gradient | slides | |
2016/1/18 | Dongxu Zhang | Memoryless Document Vector | slides | |
2016/3/14 | Zhiyuan Tang | Oral presentation for "vMF-SNE: Embedding for Spherical Data" | slides | |
2016/3/28 | Tianyi Luo | Review for Neural QA | slides | |
2016/4/11 | Rong Liu | Recommendation in Youku | slides | |
2016/5/09 | Miao Fan | Learning contextual embeddings of knowledge base with entity descriptions. | slides | |
2016/5/16 | Yang Wang | Research on conversation thread detection. | slides | |
2016/5/20 | Yang Wang & Maoning Wang | Research on portfolio selection. | slides1 slides2 | |
2016/5/20 | Zhiyuan Tang | ICASSP 2016 summary | slides | |
2016/5/23 | Dong Wang | graphical model and neural model | slides papers | |
2016/8/02 | Zhiyuan Tang | Visualizing, Measuring and Understanding Neural Networks: A Brief Survey | slides | |
2016/8/03 | Yang Wang | Neural networks and genetic programming for financial forecasting | slides | |
2016/11/05 | Yang Wang | Reinforcement Learning Models and Simulations | slides | |
2016/11/12 | Yang Wang | Generative Adversarial Nets | slides | |
2016/11/22 | Zhiyuan Tang | INTERSPEECH 2016 summary | slides | |
2016/11/30 | Dong Wang | Deep and sparse learning in speech and language: an overview | slides |
Database
name | type | size | dir | description |
---|---|---|---|---|
ASVspoof 2017 | wav | 348M | corpora/lilt/ASVspoof 2017 | ASVspoof 2017 data (from INTERSPEECH) collected by Lantian Li |
CSLT_China300 | wav | 8.8G | corpora/lilt/CSLT_China300 | 300 chinese speakers data for SRE collected by Lantian Li |
CSLT_Replay | wav | 13G | corpora/lilt/CSLT_Replay | CSLT replay spoofing data collected by Lantian Li |
CSLT_Digit | wav | 1.3G | corpora/lilt/CSLT_Digit | Digit string data for SRE collected by Lantian Li |
Idiap_avspoof | wav | 21G | corpora/lilt/Idiap_avspoof | Idiap Avspoof data collected by Lantian Li |
RedDots | wav | 1.2G | corpora/lilt/RedDots | RedDots data for SRE collected by Lantian Li |
SITW | wav | 19G | corpora/lilt/SITW | SITW data for SRE collected by Lantian Li |
VCTK | wav | 11G | corpora/lilt/VCTK | VCTK data for ASR and SRE collected by Lantian Li |
VoxForge | wav | 11G | corpora/lilt/VoxForge | VoxForge data for ASR and SRE collected by Lantian Li |
lyric | text | - | corpora/art/lyric | song lyric data collected by Jiyuan Zhang |
poem | text | - | corpora/art/poem | Traditional Chinese poem data collected by Qixin Wang and Jiyuan Zhang |
cnsong | text | - | corpora/art/song | Chinese Song sentences collected by Aiting Liu |
factordb | transaction | - | corpora/finance/factordb | Factor database collected by Wangyang |
Code
Code Name | Author | Description |
---|---|---|
THCHS30 | Wang Dong, Zhang Xuewei | THCHS30 recipe link |
THUYG20 | Wang Dong, Zhang Xuewei | THUYG20 recipe link |
viewExp | Wang Yang | The experiment platform of the Finance team link |
vvBeam | Wang Dong | Beamforming using superdirective array link |
vvEngine | Zhiyong Zhang | ASR Engine link |
vvPoem | Jiyuan Zhang | Vivi Poem generatoin http://git.cslt.org/vivi/vvpoem link] |
vvQA | Chao Xing | QA system link |