“Weekly reading”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
 
(17位用户的144个中间修订版本未显示)
第1行: 第1行:
*Location: FIT-1-304
 
  
 +
'''清华大学语音语言中心内部学习会
  
{| class="wikitable"
+
'''时间: 每周五晚19:30'''
! Date !! Speaker!! Title !! Materials !! On duty
+
|-
+
| 2012/08/27  ||Dong Wang  || Heterogeneous Convolutive Non-negative Sparse Coding ||[[媒体文件:Heterogeneous_convolutive_non-negative_sparse_coding.pdf|slides]] [http://homepages.inf.ed.ac.uk/v1dwang2/public/pdf/inerspeech2012-hetero.pdf paper] ||
+
|-
+
|2012/09/03  ||NO Meeting|| || ||
+
|-
+
|2012/09/10  || NO Meeting|| || ||
+
|-
+
|2012/09/17  ||WALEED ABDULLA||Auditory Based Feature Vectors for Speech Recognition ||[[媒体文件:AuditoryBasedFeatureVectors.pdf|slides]]||范淼
+
|-
+
| rowspan="2"|2012/09/24  ||刘超|| N-gram FST indexing for Spoken Term Detection || [[媒体文件:120924-N_gram_FST_indexing_for_Spoken_Term_Detection-LC-0.pdf|slides]] ||尹聪
+
|-
+
|范淼||Micro-blogging, Wikipedia, Folksonomy, What's Next? ||[[媒体文件:120924-Micro-blogging, Wikipedia, Folksonomy, What's Next-FM--01-FM-.pdf|slides]] ||
+
|-
+
| 2012/10/08 ||NO Meeting|| || ||
+
|-
+
| 2012/10/15  ||NO Meeting|| || ||
+
|-
+
|2012/10/22||Wu Xiaojun||speaker recognition in CSLT ||[[媒体文件:VPR_in_CSLT.pdf|slides]]||卡尔
+
|-
+
| rowspan="1"|2012/10/29  ||王军||An overview of Automatic Speaker Diarization Systems || [[媒体文件:121027-Speaker Diarization-WJ.pdf|slides]] ||别凡虎
+
|-
+
| rowspan="1"|2012/11/05  ||别凡虎||Experiments on Emotional Speaker Recognition||[[媒体文件:121104-Experiments_on_Emotional_Speaker_Recognition-BFH.pdf|slides]] ||刘超
+
|-
+
| rowspan="1"|2012/11/12  ||唐国瑜||Statistical Word Sense Improves Document Clustering ||[[媒体文件:121112_Statistical_Word_Sense_Improves_Document_Clustering_TGY.pdf‎ |slides]]||邱晗
+
|-
+
| rowspan="1"|2012/11/19  ||张陈昊||TDSR with Long-term Features Based on Functional Data Analysis||[[媒体文件:121118-ISCSLP-FDA_SR-ZCH.pdf|slides]] ||王俊俊
+
|-
+
| rowspan="1"|2012/11/26  ||王琳琳||Time-Varying Speaker Recognition: An Introduction||[[媒体文件:121126-Time_Varying_Speaker_Recognition_I-Wll.pdf‎|slides]] ||龚宬
+
|-
+
| rowspan="1"|2012/12/03  ||No meeting|| || ||
+
|-
+
| rowspan="1"|2012/12/10  ||No meeting|| || ||
+
|-
+
| rowspan="1"|2012/12/17  ||No meeting|| || ||
+
  
|-
+
'''地点: 1区303'''
| rowspan="1"|2012/01/07  || || || ||
+
|-
+
|2012/01/07  ||王军||基于DF-MAP的说话人模型训练方法||[[媒体文件:130107-基于DFMAP的说话人模型训练方法-WJ.pdf|slides]] ||唐国瑜
+
|-
+
| rowspan="1"|2012/01/14  ||王东|| Computing in CSLT ||[[媒体文件:Computing_in_CSLT.pdf|slides]] ||王琳琳
+
|-
+
  
 +
 +
{| class="wikitable"
 +
! Date !! Speaker!! Title !! Materials
 
|-
 
|-
| rowspan="1"|2013/03/04  ||王军||Sequential Adaptive Learning for Speaker Verification ||[[媒体文件:130301-Sequential adaptive learning for speaker verification-WJ.pdf|slides]] ||别凡虎
+
|  ||  || PPT模板 ||[[媒体文件:Weeklyreading_template.rar]]
|-
+
| rowspan="1"|2013/03/11  || Du Jinle|| VAD stuff || ||
+
|-
+
| rowspan="1"|2013/03/18  || || || ||
+
|-
+
| rowspan="1"|2013/03/25  || || || ||
+
|-
+
| rowspan="1"|2013/04/01  || || || ||
+
|-
+
| rowspan="1"|2013/04/08  || 张陈昊|| A Fishervoice based Feature Fusion Method for SUSR ||[[媒体文件:130408-FisherVoice-ZCH.pdf|slides]] ||谢仲达
+
|-
+
| rowspan="1"|2013/04/15  ||龚宬|| An Exploration on Influence Factors of VAD's Performance in Speaker Recognition ||[[媒体文件:130415-An_Exploration_on_Influence_Factors_of_VAD-GC.pdf|slides]] ||
+
|-
+
| rowspan="1"|2013/04/22  ||王俊俊 || Understanding the Query: THCIB and THUIS at NTCIR-10 Intent Task ||[[媒体文件:130422-Understanding_the_Query-WJJ.pdf|slides‎]] ||
+
|-
+
| rowspan="1"|2013/04/29  || || || ||
+
|-
+
| rowspan="1"|2013/05/06  ||别凡虎 ||MLLR on Emotional Speaker Recognition ||[[媒体文件:130506-MLLR on Emotional Speaker Recognition-BFH.pdf|slides]] ||
+
|-
+
| rowspan="1"|2013/05/13  ||刘超 || The Use of Deep Neural Network for Speech Recognition || [[媒体文件:130513-the_use_of_dnn_for_asr-lc.pdf|slides]] ||
+
|-
+
| rowspan="1"|2013/05/20  || || || ||
+
|-
+
| rowspan="1"|2013/05/27  ||王琳琳|| 说话人识别中的时变鲁棒性问题研究 || [[媒体文件:130527-TVSV-Wll.pdf|slides]] ||
+
|-
+
| rowspan="1"|2013/06/03  ||王俊俊|| 汉语搜索结果聚类系统研究与实现 || [[媒体文件:130601-毕业答辩-02-WJJ.pdf|slides]] ||
+
|-
+
| rowspan="1"|2013/06/10  || || || ||
+
|-
+
| rowspan="1"|2013/06/17  ||范淼 || Relation Extraction ||[[媒体文件:130617-relation_extraction-fm.pdf|slides]] ||
+
|-
+
| rowspan="1"|2013/06/24  ||唐国瑜 || Incorporating Statistical Word Senses in Topic Model  ||[[媒体文件:130624_Incorporating Statistical Word Senses in Topic Model_TGY.pdf|slides]] ||
+
|-
+
| rowspan="1"|2013/07/01  || || || ||
+
|-
+
| rowspan="1"|2013/07/08  ||  || || ||
+
|-
+
| rowspan="1"|2013/07/15  || || || ||
+
|-
+
| rowspan="1"|2013/09/09  ||王东 || Research Frontier in Speech Technology||[[媒体文件:Research Frontier in Speech Technology.pdf|slides]] ||
+
|-
+
| rowspan="1"|2013/09/16  || || || ||
+
|-
+
| rowspan="1"|2013/09/23  || || || ||
+
|-
+
| rowspan="1"|2013/09/30  || || || ||
+
|-
+
| rowspan="1"|2013/10/07  || || || ||
+
|-
+
| rowspan="1"|2013/10/14  || || || ||
+
|-
+
| rowspan="1"|2013/10/21  ||范淼 ||Transduction Classification with Matrix Completion (中文报告)||[[媒体文件: Transduction_Classifiction_with_Matrix_Completion.pdf‎|slides]] [http://pages.cs.wisc.edu/~jerryzhu/pub/mc4ssl_FINAL.pdf paper]|| 李蓝天
+
|-
+
| rowspan="1"|2013/10/28  || || || ||
+
|-
+
| rowspan="1"|2013/11/04  || 王军 || 基于i-vector的intersession补偿及打分方法(综述) || [[媒体文件:131104-ivecto下intersession补偿及打分方法--01-WJ-.pdf‎|slides]]||
+
|-
+
| rowspan="1"|2013/11/11  ||张陈昊 ||PLDA介绍及PLDA在说话人识别中的应用 ||[[媒体文件:PLDA.pdf|slides]] || 唐国瑜
+
|-
+
| rowspan="1"|2013/11/18  ||别凡虎 ||i-vector理论介绍(讨论)||[[媒体文件:131118-i-vector_and_GMM-UBM-BFH.pdf|slides]]‎  ||王军
+
|-
+
| rowspan="1"|2013/11/25  ||刘超 || Pruning Neural Networks By Optimal Brain Damage(综述)||[[媒体文件:131125-OBD-LC-01.pdf|slides]] ||范淼
+
|-
+
| rowspan="1"|2013/12/02  ||范淼 ||Distant Supervision for Relation Extraction with Matrix Completion (英文报告)||[[媒体文件:131202-DRMC-FM-01.pdf|slides]] || 李蓝天
+
|-
+
| rowspan="1"|2013/12/09  || Dong Wang|| Introduction to the HMM-based speech synthesis||[http://hts.sp.nitech.ac.jp/archives/2.2/HTS_Slides.zip slides] ||
+
|-
+
| rowspan="1"|2013/12/16  ||张陈昊 ||语音研究中的基元介绍 ||[[媒体文件:131215-Phonology-ZCH.pdf|slides]]  ||
+
|-
+
| rowspan="1"|2013/12/23  || Dong Wang|| Introduction to the HMM-based speech synthesis (2)||[http://hts.sp.nitech.ac.jp/archives/2.2/HTS_Slides.zip slides] ||
+
|-
+
| rowspan="1"|2013/12/23  || || || ||
+
|-
+
| rowspan="1"|2013/12/30  ||刘荣 || continuous space language model||[[媒体文件:Cslm-cslt.pdf|slides]]  ||刘超
+
|-
+
| rowspan="1"|2014/01/06  || || || ||
+
|-
+
| rowspan="1"|2014/01/13  || || || ||
+
|-
+
| rowspan="1"|2014/01/20  || || || ||
+
|-
+
| rowspan="1"|2014/02/24  || || || ||
+
|-
+
| rowspan="1"|2014/03/03  || || || ||
+
|-
+
| rowspan="1"|2014/03/10  ||范淼|| Distant Supervision for Information Extraction (英文报告)|| || 李蓝天
+
|-
+
| rowspan="1"|2014/03/17  ||唐国瑜 || Topic Models Incorporating Statistical Word Senses || [[媒体文件:TMISWS_For_CICLing2014.pdf|slides]]||
+
|-
+
| rowspan="1"|2014/03/24  ||孟祥涛 || Noisy training for Deep Neural Networks|| ||
+
|-
+
| rowspan="1"|2014/03/31  ||范淼|| Translating Embeddings for Modeling Multi-relational Data (中文报告) || [https://www.hds.utc.fr/everest/lib/exe/fetch.php?id=en%3Atranse&cache=cache&media=en:cr_paper_nips13.pdf paper]||李蓝天
+
|-
+
| rowspan="1"|2014/04/07  || || || ||
+
|-
+
| rowspan="1"|2014/04/14  || Wang Jun|| I-vector and PLDA in depth ||[[媒体文件:131104-ivector-microsoft-wj.pdf|slides]]  ||
+
|-
+
| rowspan="1"|2014/04/21  || 邱晗||汉语事件句式规范化处理 ||[[媒体文件:140421-汉语事件句式规范化-QH.pdf‎|slides]] ||
+
|-
+
| rowspan="1"|2014/04/28  || 唐国瑜|| Some papers in CICLing2014 ||[[媒体文件:Some_papers_in_CICling2014.pdf|slides]]  ||刘超
+
|-
+
| rowspan="1"|2014/05/05  || || || ||
+
|-
+
| rowspan="1"|2014/05/12  || 卡尔|| paper introduction || [[媒体文件:Acoustic Factor Analysis.pdf|slides]] || 邱晗
+
|-
+
| rowspan="2"|2014/05/19  || 邱晗|| 汉语事件句式CCG推导树重构 ||[[媒体文件:140519-CCG_reConstruction.pdf‎|slides]]‎|| 卡尔
+
|-
+
|Liu Chao|| master proposal: sparse and deep neural networks || [[媒体文件:140519-proposal-LC-01.pdf|slides]] ||
+
|-
+
| rowspan="1"| || Liu Chao|| 2nd master proposal: sparse and deep neural networks|| ||
+
|-
+
| rowspan="1"|2014/06/16  || 别凡虎 || Truncated Wave based VPR and Some Recent Work || [[媒体文件:140614-Truncated_Speech_based_VPR.pdf‎|slides]]‎ || 别凡虎
+
|-
+
| rowspan="1"|2014/06/23  || 别凡虎 || Block-wise training for I-vector || [[媒体文件:140623-Block-wise training for I-vector.pdf‎|slides]]‎ || 别凡虎
+
|-
+
| rowspan="1"| 2014/07/07||王军 ||Discriminative Scoring for Speaker Recognition Based on I-vectors || [[媒体文件:140707-work_report.pdf|slides]]|| 王军
+
|-
+
| rowspan="1"| 2014/09/01|| || || ||
+
|-
+
| rowspan="1"|2014/09/09 ||别凡虎 ||Reseach on Truncated Wave based VPR||[[媒体文件:140909-Truncated Speech based VPR.pdf|slides]] || 别凡虎
+
|-
+
| rowspan="1"| 2014/09/15|| || || ||
+
|-
+
| rowspan="1"|2014/09/22  || Miao Fan|| Large-scale Entity Relation Extraction based on Low-dimensional Representations (中文报告,博士开题)
+
||[[媒体文件:基于低维表示的大规模实体关系挖掘技术.pdf‎|slides]] || Lan TianLi
+
|-
+
| rowspan="1"| 2014/09/29 || || || ||
+
|-
+
| rowspan="1"|2014/10/13  || Miao Fan|| The Frontier of Knowledge Embedding (英文报告)|| [[媒体文件:The_Frontier_of_Knowledge_Embedding.pdf‎|slides]]|| Lan TianLi
+
|-
+
| rowspan="1"|2014/10/20  || || || ||
+
|-
+
| rowspan="1"|2014/10/27  || Li Yi || Phonemes, Features, and Syllables: Converting Onset and Rime Inventories to Consonants and Vowels||[[媒体文件:Lanzhou Phonemes, Features, and Syllables- fianl.pdf|paper]] [[媒体文件:Syllables and phonemes - 20141027.pdf|slides]]||
+
|-
+
| rowspan="1"|2014/11/3   || 米吉提|| Automatic Speech Recognition of Agglutinative Language based on Lexicon Optimization||[[媒体文件:Mijit-slides-清华大学-2014-11-3.pdf|slides]] ||
+
|-
+
| rowspan="1"|2014/11/10 || || || ||
+
|-
+
| rowspan="1"|2014/11/17  ||Dong Wang || Highly restricted keyword spotting for Uyghur using sparse analysis|| [[媒体文件:Highly Restricted Keyword Selection Based on Sparse Analysis.pdf|slides]]||
+
|-
+
| rowspan="1"|2014/11/24  || || || ||
+
|-
+
| rowspan="1"|2014/12/1  ||ZhongDa Xie ||Incorporating Fine-Grained Ontological Relations in Medical Document Ranking || [[媒体文件:Fine-grained_relations.pdf|slides]]|| Lantian Li
+
|-
+
| rowspan="1"|2014/12/8  || || || ||
+
|-
+
| rowspan="1"|2014/12/15  || 唐国瑜 || 跨语言话题分析关键技术研究 ||[[媒体文件:141205-答辩-TGY.pdf|slides]] ||
+
|-
+
| rowspan="1"|2014/12/22  || || || ||
+
|-
+
| rowspan="1"|2014/12/29  || Askar || Language Mismatch in Speaker Recognition System||[[媒体文件:141229--askar.pdf|slides]] ||
+
|-
+
| rowspan="1"|2015/1/5  ||Lantian Li || Deep Neural Networks for Speaker Recognition || [[媒体文件:150104_Deep_Neural_Networks_for_Speaker_Recognition_LLT.pdf|slides]]||
+
|-
+
| rowspan="1"|2015/1/12  || || || ||
+
|-
+
| rowspan="1"|2015/1/19  || Dong Wang || Machine Learning Paradigms for Speech Recognition||[[媒体文件:Machine Learning Paradigms for Speech Recognition.pdf|slides]]  [http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6423821 paper] ||
+
|-
+
| rowspan="1"|2015/1/26  || Chen Guorong || Information Transmission and Distribution on Web ||[[媒体文件:An_introduction_of_complex_network1.pdf|slides]] ||
+
|-
+
| rowspan="1" |2015/3/9 || Dong Wang || Joint Deep Learning || [[媒体文件:Joint Deep Learning.pdf|slides]] ||
+
|-
+
| rowspan="1"|2015/3/16  || Dongxu Zhang || Knowledge learning from text data and knowledge bases || [[媒体文件:Joint Deep Learning.pdf|slides]] ||
+
|-
+
| rowspan="1"|2015/4/13  || Xuewei Zhang || Lasso-based Reverberation Suppression In Automatic Speech Recognition || [[媒体文件:Lasso-based Reverberation Suppression In Automatic Speech Recognition.pdf|slides]] ||
+
|-
+
| rowspan="1"|2015/5/11  || Dong Wang ||ASR and SID Research Frontier ||[[媒体文件:ASR and SID Research Frontier.pdf|slides]] ||
+
|-
+
| rowspan="1"|2015/11/23  || Zhiyuan Tang|| CTC learning|| [[媒体文件:CTC.pdf|slides]] ||
+
|-
+
| rowspan="1"|2015/11/30  || Mengyuan Zhao|| CNN-based music removal|| [[媒体文件:Music Removal by Convolutional Denoising.pdf | slides]] ||
+
|-
+
| rowspan="1"|2015/12/3  || Zhiyuan Tang|| Networks of Memory|| [[媒体文件:Memory_net.pdf|slides]] ||
+
|-
+
| rowspan="1"|2015/12/7  || Yiqiao Pan|| Document Classification with Spherical Word Vectors||[[媒体文件:Document Classification with Spherical Word Vectors.pdf|slides]] ||
+
|-
+
| rowspan="1"|2015/12/14  || Dong Wang || Transfer Learning for Speech and Language Processing ||[[媒体文件:Transfer_Learning_for_Speech_and_Language_Processing.pdf|slides]] ||
+
|-
+
| rowspan="1"|2015/12/21  || Qixin Wang || Attention for poem generation ||[[媒体文件:Ijcai 2016.pptx|slides]] ||
+
|-
+
| rowspan="1"|2015/12/28  || Lantian Li || Max-margin metric learning for speaker recognition || [[媒体文件:Max-margin-Metric-Learning.pdf|slides]]||
+
|-
+
| rowspan="1"|2016/1/4  || Zhiyong Zhang || Parallel training,MPE and natural gradient||[[媒体文件:20160104_张之勇_Large-scale Parallel Training in Speech Recognition.pdf|slides]]|| 
+
|-
+
| rowspan="1"|2016/1/18  || Dongxu Zhang || Memoryless Document Vector ||[[媒体文件:Memoryless_document_vector.pdf|slides]]||
+
|-
+
| rowspan="1"|2016/3/14  || Zhiyuan Tang|| Oral presentation for "vMF-SNE: Embedding for Spherical Data"|| [[媒体文件:embedding.pdf|slides]] || 
+
|-
+
| rowspan="1"|2016/3/28  || Tianyi Luo || Review for Neural QA || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/2/29/CSLT_Weekly_Report--20160328.pdf slides] || 
+
|-
+
| rowspan="1"|2016/4/11  || Rong Liu || Recommendation in Youku || [http://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/%E6%96%87%E4%BB%B6:Cslt%E5%AE%9E%E9%AA%8C%E5%AE%A4%E4%BA%A4%E6%B5%81.pptx slides] || 
+
|-
+
| rowspan="1"|2016/5/09 || Miao Fan || Learning contextual embeddings of knowledge base with entity descriptions.|| [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/9/9c/Techreport_CSLT_2016_M.F..pdf slides]  ||
+
|-
+
| rowspan="1"|2016/5/16 || Yang Wang || Research on conversation thread detection. || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/b/bb/%E6%B1%AA%E6%B4%8B-%E6%AF%95%E8%AE%BE-CSLT.pdf slides]  ||
+
|-
+
| rowspan="1"|2016/5/20 || Yang Wang &  Maoning Wang || Research on portfolio selection. || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/8/89/%E6%B1%AA%E6%B4%8B-%E9%87%91%E8%9E%8D%E7%AC%AC%E4%B8%80%E6%AC%A1%E5%88%86%E4%BA%AB.pdf slides1]  [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/b/bb/%E6%B1%87%E6%8A%A5_%E8%B5%84%E4%BA%A7%E7%BB%84%E5%90%88%E4%B8%AD%E5%87%A0%E4%B8%AA%E8%AF%84%E4%BB%B7%E6%8C%87%E6%A0%87%E7%9A%84%E8%A7%A3%E9%87%8A.pdf slides2]||
+
|-
+
| rowspan="1"|2016/5/20  || Zhiyuan Tang || ICASSP 2016 summary || [[媒体文件:Note icassp16.pdf|slides]] ||
+
|-
+
| rowspan="1"|2016/5/23 || Dong Wang || graphical model and neural model || [[媒体文件:Graphic Model and Neural Model.pdf|slides]] [[媒体文件:Generative-Pdf.rar|papers]] ||
+
|-
+
| rowspan="1"|2016/8/02 || Zhiyuan Tang || Visualizing, Measuring and Understanding Neural Networks: A Brief Survey|| [[媒体文件:Nn analysis.pdf|slides]] ||
+
|-
+
| rowspan="1"|2016/8/03 || Yang Wang || Neural networks and genetic programming for financial forecasting || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/7/79/GeneticNN.pdf slides] ||
+
|-
+
| rowspan="1"|2016/11/05 || Yang Wang || Reinforcement Learning Models and Simulations || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/ca/RRL_and_sim.pdf slides] ||
+
|-
+
| rowspan="1"|2016/11/08 || April Pu || SOFTWARE DEVELIPMENT METHODOLOGIES || [http://wangd.cslt.org/talks/pdf/april_software.pptx slides] ||
+
|-
+
| rowspan="1"|2016/11/12 || Yang Wang || Generative Adversarial Nets || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/c9/Generative_adversarial_network.pdf slides] ||
+
|-
+
| rowspan="1"|2016/11/22 || Zhiyuan Tang || INTERSPEECH 2016 summary || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/6/65/Interspeech16_review.pdf slides] ||
+
|-
+
| rowspan="1"|2016/11/30 || Dong Wang || Deep and sparse learning in speech and language: an overview || [http://wangd.cslt.org/talks/pdf/bics2016.pptx slides] ||
+
|-
+
| rowspan="1"|2017/2/17 || Yang Wang || Review understanding deep learning requires rethinking generalization || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/3/3b/Review_understanding_deep_learning_requires_rethinking_generalization.pdf slides] ||
+
|-
+
| rowspan="1"|2017/6/5 || Dong Wang || Deep speech factorization || [http://wangd.cslt.org/talks/pdf/Deep-Speech-Factorization.pdf slides] ||
+
|-
+
| rowspan="1"|2017/6/8 || Shiyue Zhang || Convolutional Sequence to Sequence Learning  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/f/f3/Conv_seq2seq.pptx slides] ||
+
|-
+
| rowspan="1"|2017/6/12 || Shiyue Zhang || Memory-augmented Neural Machine Translation || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/3/36/Memory-augmented_Neural_Machine_Translation_.pptx slides] ||
+
|-
+
| rowspan="1"|2017/6/21 || Shiyue Zhang || Attention Is All You Need  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/6/68/Attention_is_all_you_need.pptx slides] ||
+
 
|-
 
|-
| rowspan="1"|2017/6/26 || Jiyuan Zhang || Chinese poem generation using neural model  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/5/50/Flexible_and_Creative_Chinese_Poetry_Generation_Using_Neural_Memory_.pptx slides] ||
+
| 2021/04/01  ||Haoran Sun    || Zeus code regularization ||[[媒体文件:代码规范.pdf]]
 
|-
 
|-
| rowspan="1"|2017/6/21 || Miao Zhang || Speaker recognition on cough,laugh and wei  ||  
+
| 2021/05/20  ||Chen Chen    || Overview of speech enhancement|| [[媒体文件:Speech_enhancement.pdf]]
[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/f/f6/Zm_cough.pdf slides]
+
||
+
 
|-
 
|-
| rowspan="1"|2017/7/10 || Aodong Li || Enhanced Neural Machine Translation by Learning from Draft  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/ca/Learning_from_draft.pptx slides] ||
+
| 2021/05/27  ||Di Wang      || Secret of 'hard trials' || [[媒体文件:Secret_of_hard_trials.pdf]]
 
|-
 
|-
| rowspan="1"|2017/7/17 || Lantian Li || Study on Speaker Recognition  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/e/ec/170716-Study_on_SRE.pdf slides] ||
+
| 2021/06/10  ||Jingxin Shen  ||Expriments about thermal to RGB face synthesis with cycleGan and pix2pix || [[媒体文件:Expriments about thermal to RGB face synthesis with cycleGan and pix2pix.pdf]]
 
|-
 
|-
| rowspan="1"|2018/12/6 || Xiuqi Jiang || Meta-Learning and Zero-Shot Learning  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/1/18/181205_Meta-Learning_and_Zero-Shot_Learning_JXQ.pdf slides] ||
+
| 2021/06/17  ||Yang Zhang    || NIPS2020: Long-Tailed Classification by Keeping the Good and Removing the Bad Momentum Causal Effect || [[媒体文件:long-tail.pdf]]
 
|-
 
|-
| rowspan="1"|2018/12/12 || Dan He || Tensor factorization neural net  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/3/3d/Tensor_factorization_neural_net.pdf slides] ||
+
| 2021/07/08  ||Tiankai Zhi  || Some experiments on stargan ||[[媒体文件:Some experiments on stargan.pdf]]
 
|-
 
|-
| rowspan="1"|2018/12/26 || Dong Wang || Towards deep statistical speaker representation  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/4/48/V.pdf slides] ||
+
| 2021/07/15  ||Jiao Han      || MG experiments based on ASV system || [[媒体文件:MG experiments based on ASV system..pptx]]  
 
|-
 
|-
| rowspan="1"|2019/01/04 || Dong Wang || Speech in NIPS 2017/2018  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/c8/Speech_in_NIPS_2017.pdf slides] ||
+
| 2021/07/22  ||Zixi Yan & Sirui Li || Unsupervised Speech Recognition || [[媒体文件:Unsupervised_Speech_Recognition.pdf]]
 
|-
 
|-
| rowspan="1"|2019/07/17 || Dong Wang || Deep Feature Learning and Normalization for Speaker Recognition  || [http://wangd.cslt.org/talks/pdf/india.pdf slides] ||
+
| 2021/07/29  ||Pengqi Li    || A Simulation Study on 􏰛􏰜 Ro􏰛bust MAML || [[媒体文件:A Simulation Study on 􏰛􏰜 Ro􏰛bust MAML.pdf]]
 
|-
 
|-
| rowspan="1"|2019/08/19 || Sitong Cheng & Pengyuan Zhang || Periodic Report of Celebrity Video Data Collection.  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/0/08/C-STAR.pdf slides] ||
+
| 2021/08/12  ||Qingyang Zhu || Noise-aware method for Speech Enhancement || [[媒体文件:Noise-aware method for Speech Enhancement.pdf]]
 
|-
 
|-
| rowspan="1"|2019/08/19 || Dong Wang|| Continuous Learning for Neural Nets || [[媒体文件:Continuous Learning for Neural Nets.pdf|slides]]||
+
| 2021/08/12  ||Weida Liang  || Unsupervised Audio-Visual Synthesis via Exemplar Autoencoders  || [[媒体文件:Bi-weekly_report_Liangwd.pdf]]
 
|-
 
|-
| rowspan="1"|2019/09/11 || Dong Wang || Language Recognition in ICASSP 2019  || [http://wangd.cslt.org/talks/pdf/LRE-ICASSP-2019.pdf slides] ||
+
| 2021/08/19  ||Di Wang     || Inter Dataset Variability Compensation ||   [[媒体文件:Inter_dataset_variability_compensation.pdf]]
 
|-
 
|-
| rowspan="1"|2019/09/11 || Sitong Cheng || Language Recognition in Interspeech 2019  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/a/a9/Language_Recognition_in_Interspeech_2019.pdf slides] ||
+
| 2021/09/02  ||Tiankai Zhi  || One Shot VC || [[媒体文件:One_shot_VC.pdf]]
 
|-
 
|-
| rowspan="1"|2019/10/14 || Haoran Sun || Dimension Reduction  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/7/7b/DimensionReduction.pdf slides] ||
+
| 2021/09/09  ||Jingxin Shen || Thermal Speaking || [[媒体文件:Thermal_Speaking_2021.pdf]]
 
|-
 
|-
| rowspan="1"|2019/10/27 || Dong Wang || Back to Matrix  || [[媒体文件:Back to Matrix.pdf|slides]] ||
+
| 2021/09/23  ||Sirui Li & Zixi Yan || Wav2vec-U Experimental Report || [[媒体文件:Wav2vec-U_experimental_report.pdf ]]
 
|-
 
|-
| rowspan="1"|2019/11/11 || Dong Wang || Helmholtz Machine & The ML criterion  || [[媒体文件:Helmholtz Machine & The ML criterion.pdf|slides]] ||
+
| 2021/10/20  ||Renmiao Chen || Is Someone Speaking? || [[媒体文件:Is_Someone_Speaking_Exploring_Long-term_Temporal_Features.pdf ]]
 
|-
 
|-
| rowspan="1"|2019/12/02 || Jiawen Kang || Gan Laten Space Manipulation & Flow Application Papers  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/ca/GAN_Lantent_Space_manunipulation_%26_Flow_Application.pdf slides] ||
+
| 2021/10/28  ||Chen Chen    || WenetSpeech Introduction || [[媒体文件:WenetSpeech_Dataset_Introduction.pdf ‎]]
 
|-
 
|-
| rowspan="1"|2019/12/09 || Dong Wang || Style transfer and information factorization || [[媒体文件:Style Transfer with Generative Models.pdf|slides]] ||
+
| 2021/11/10  ||Weida Liang  || Cycle-loss Exemplar Autoencoder || [[媒体文件:Cycle-loss_Exemplar_Autoencoder.pdf ]]
 
|-
 
|-
| rowspan="1"|2019/12/16 || Zhiyuan Tang || Conditional Generative Flow  || [[媒体文件:Conditional GLow.pdf|slides]] ||
+
| 2021/11/17  ||吾买尔江      || Modulation Spectrum || [[媒体文件:Modulation_Spectrum.pdf ]]
 
|-
 
|-
| rowspan="1"|2019/12/23 || Lantian Li || Deep Generative Model in Speaker Recognition || [[媒体文件:Deep Generative Model in Speaker Recognition.pdf|slides]] ||
+
| 2021/11/24  ||Chen Chen    || S-DCCRN || [[媒体文件:S-DCCRN_pdf.pdf ]]
 
|-
 
|-
| rowspan="1"|2019/12/30 || Wenqiang Du || Cross-bandwidth Train || [[媒体文件:Cross-bandwidth_Train.pdf|slides]] ||
+
| 2021/12/01  ||Pengqi Li    || GuidedMix: An on-the-fly data augmentation approach for robust speaker recognition system || [[媒体文件:201201-GuidedMix-LPQ.pdf ]]
 
|-
 
|-
| rowspan="1"|2019/01/06 || Yunqi Cai || Do Deep Generative Models Know What They Don't Know ?|| [[媒体文件:2020.1.6_group_meeting.pdf|slides]] ||
+
| 2021/12/08  ||Renmiao Chen || Multimodal preson verification || [[媒体文件:Multimodal_preson_verification.pdf]]
 
|-
 
|-
| rowspan="1"|2019/01/10 || Haoran Sun || Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture Design || [[媒体文件:Flow++.pdf|slides]] ||
+
| 2021/12/15  ||Ruihai Hou  || Crossmodal clustered contrastive learning: Grounding of spoken language to gesture || [[媒体文件:Crossmodal_clustered_contrasti.pdf]]
 
|-
 
|-
| rowspan="1"|2020/01/13 || Ying Shi || Deep Generative Model Energy Based Model || [[媒体文件:Deep_Generative_Model.pdf|slides]] ||
+
| 2021/12/29  ||Zixi Yan    || Capsules Network || [[媒体文件:Capsules_Network.pdf]]
 
|-
 
|-
| rowspan="1"|2020/02/10 || Dong Wang || Deep Generative Models for Discriminative Tasks || [[媒体文件:Re-Thinking for Discriminative and Generative Models.pdf|slides]]||
+
| 2022/01/05  ||Sirui Li    || Self-Supervised Learning for speech recognition with Intermediate layer supervision || [[媒体文件:SSL with Intermediate layer supervision.pdf]]
 
|-
 
|-
| rowspan="1"|2020/02/17 || Zhiyuan Tang || Unsupervised Learning of Disentangled Representations  || [[媒体文件:20200217 Unsupervised disentanglement.pdf|slides]] ||
+
| 2022/01/12  ||Weida Liang  || FragmentVC || [[媒体文件:FragmentVC.pdf]]
 
|-
 
|-
| rowspan="1"|2020/02/24 || Lantian Li || Weakly- & Self-Supervised Learning || [[媒体文件:Weakly-_%26_Self-Supervised_Learning.pdf|slides]] ||
+
| 2022/01/19  ||Haoyu Jiang  || Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video || [[媒体文件:Multi-modality_Associative_Bridging_through_Memory.pdf]]
 
|-
 
|-
| rowspan="1"|2020/03/02 || Yunqi Cai || Deep Normalization for Speaker Vectors|| [[媒体文件:Deep_Normalization_for_Speaker_Vectors_.pdf|slides]]||
+
| 2022/02/14  ||             || Interspeech 2021 Review || [[媒体文件:Interspeech_paper_review_min.pdf]]
 
|-
 
|-
| rowspan="1"|2020/03/09 || Ying Shi || Speech Enhancement base on Double Flow || [[媒体文件:Speech_Enhancement_base_on_Double_Flow.pdf|slides]]||
+
| 2022/02/16  ||Chen Chen    || Audio Visual HuBERT || [[媒体文件:AVHuBERT.pdf]]
 
|-
 
|-
| rowspan="1"|2020/03/16 || Dong Wang || Bayesian scoring and uncertainty manipulation || [[媒体文件:Uncertainty Propagation.pdf|slides]]||
+
| 2022/03/04  ||Pengqi Li    || Study of Visualization || [[媒体文件:Visualization.pdf]]
 
|-
 
|-
| rowspan="1"|2020/03/23 || Zhiyuan Tang || Classifier involves Energy Based Model  || [[媒体文件:200323 energy model.pdf|slides]] ||
+
| 2022/03/11  ||Renmiao Chen || Can audio-visual integration strengthen robustness under multimodal attacks? || [[媒体文件:Audio-Visual_Robustness_Under_Multimodal_Attacks.pdf]]
 
|-
 
|-
| rowspan="1"|2020/03/30 || Lantian Li ||  Bayesian scoring in speaker verification || Temporarily held for security ||  
+
| 2022/03/11  ||吾买尔江      || Signal Separation || [[媒体文件:Signal_Separation.pdf]]
 
|-
 
|-
| rowspan="1"|2020/04/06 || Yunqi Cai || Posterior Collapse|| [[媒体文件:Posterior_Collapse.pdf|slides]]||
+
| 2022/03/18  ||Chen Chen    || Overview on Lip Reading and Audio-visual Speech Recognition || [[媒体文件:LipReadingAndAVSR.pdf]]
 
|-
 
|-
| rowspan="1"|2020/04/13 || Lantian Li || NDA in ASV || Temporarily held for security [cvss 761] ||
+
| 2022/04/01  ||Ruihai Hou  || Scalable Identity-Oriented Speech Retrieval || [[媒体文件:Scalable_Identity-Oriented_Speech_Retrieval.pdf]]
 
|-
 
|-
| rowspan="1"|2020/04/20 || Ying Shi || Speech_Enhancement_base_on_Flow ||[[媒体文件:Speech_Enhancement_base_on_Flow.pdf|slides]] ||
+
| 2022/04/08  ||Zixi Yan    || Wav2vec related papers share || [[媒体文件:Wav2vec_related_papers.pdf]]
 
|-
 
|-
| rowspan="1"|2020/05/11 || Dong Wang  || Real DNF || [[媒体文件:Real_DNF.pdf|Slide]] ||
+
| 2022/04/22  ||Sirui Li    || Speech-Based Language Modelling || [[媒体文件:Speech-Based Language Modelling.pdf]]
 
|-
 
|-
| rowspan="1"|2020/05/26 || Sitong Cheng || ASR-Free Pronunciation Assessment || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/9/9a/ASR-Free_Pronunciation_Assessment.pdf slides] ||
+
| 2022/04/29  ||Haoyu Jiang  || Models of Speaker Recognition || [[媒体文件:Models_of_Speaker_Recognition.pdf]]
 
|-
 
|-
| rowspan="1"|2020/05/26 || Jiawen Kang ||  RobustMAML || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/8/8e/RobustMAML.pdf slides] ||
+
| 2022/05/13  ||Chen Chen    || Audio-visual Representation Learning || [[媒体文件:Audio_visual_representation_learning.pdf]]
 
|-
 
|-
| rowspan="1"|2020/05/26 || Jiawen Kang ||  Domain adaptation review || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/6/6d/Presentation-Meta-learning.pdf slides] ||  
+
| 2022/05/20  ||Haoran Sun  ||  ||  
 
|-
 
|-
| rowspan="1"|2020/05/26 || Jiawen Kang || SOTA models for VPR || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/d/d2/SOTA_models_for_VPR.pdf slides] ||
+
| 2022/05/27  ||Pengqi Li    || The important ”feature” for speaker recognition || [[媒体文件:The important ”feature” for speaker recognition.pdf]]
 
|-
 
|-
| rowspan="1"|2020/06/01 || Dong Wang || How MAML succeeded?  || [https://arxiv.org/pdf/1909.09157.pdf][https://pdfs.semanticscholar.org/e6e9/c9d50b11ced939faf42f1c65bf9360eefd73.pdf][https://arxiv.org/pdf/1706.05806.pdf] ||
+
| 2022/06/10  ||Zixi Yan    || Paper Share || [[媒体文件:Paper_share_yzx0610.pdf]]
 
|-
 
|-
| rowspan="1"|2020/06/09 || Zhiyuan Tang  || Flow Wheels || [[媒体文件:20200408 flow wheels.pdf|slides]] ||
+
| 2022/06/24  ||Renmiao Chen || Transformer in multimodal || [[媒体文件:Transformer_in_multimodal.pdf]]
 
|-
 
|-
| rowspan="1"|2020/06/15 || Lantian Li  ||  Uncertainty Modeling and Inference || [[媒体文件:200615-Uncertainty.pdf|slides]]  ||
+
|             ||             || ICASSP 2022 review || [[媒体文件:ICASSP2022_review.pdf]]  [[媒体文件:ICASSP-2022-readinglist.pdf]]
 
|-
 
|-
| rowspan="1"|2020/06/22 || Lantian Li  || Gaussians in High Dimension || [[媒体文件:High-dimensioaln-Gaussian.pdf|slides]] ||
+
| 2022/07/04  ||Chen Chen    || Video to Speech papers || [[媒体文件:VTS_cc.pdf]]
 
|-
 
|-
| rowspan="1"|2020/06/22 || Dong Wang  || Self training for SE and ASR || [[媒体文件:Self-Training.pdf|slides]] ||
+
| 2022/07/08  ||Ruihai Hou  || ICASSP 2022 review (part) || [[媒体文件:Weeklyreading_hrh.pdf]]
 
|-
 
|-
| rowspan="1"|2020/06/29 || Ying Shi  || Speech enhancement & separation || [[媒体文件:Speech-Separation-and-Enhancement.pdf|slides]] ||
+
| 2022/07/15  ||Sirui Li    || Towards End-to-end Unsupervised Speech Recognition || [[媒体文件:Towards_End_to_end_Unsupervised_Speech_Recognition.pdf]]
 
|-
 
|-
| rowspan="1"|2020/07/06 || Haolin Chen  || Self-supervised Learning in Speech Processing || [[媒体文件:Self-Supervised.pptx|slides]] ||
+
| 2022/07/22  ||Wan Lin      || AutoED: Text-independent unsupervised speaker recognition Model|| [[媒体文件:AutoED_spk_reg.pdf]]
 
|-
 
|-
| rowspan="1"|2020/07/13 || Zhiyuan Tang || Exploding inverse in INN || [[媒体文件:20200713 dig into flow.pdf|slides]] ||
+
| 2022/07/29  ||Haoyu Jiang || ArcFace_iQIYI-VID || [[媒体文件:ArcFace_iQIYI-VID.pdf]]
 
|-
 
|-
| rowspan="1"|2020/07/20 || Lantian Li  || Principle Solution for Enroll-Test Mismatch || [[媒体文件:200720-mismatch.pdf|slides]] ||
+
| 2022/08/05  ||Chen Chen    || Recent advance in VTS task || [[媒体文件:RecentVTS.pdf]]
 
|-
 
|-
| rowspan="1"|2020/08/17 || Dong Wang || Decoupled scoring || [[媒体文件:Decoupled.pdf|slides]] ||
+
| 2022/08/12  ||Tianhao Wang || Extremal Perturbations || [[媒体文件:Extremal_perturbations.pdf]]
 
|-
 
|-
| rowspan="1"|2020/08/24 || Zhiyuan Tang || G & D Acoustic model || [[媒体文件:20200824 flow asr.pdf | slides]]   ||
+
| 2022/08/19  ||Renmiao Chen || The correlation of face and vioce || [[媒体文件:The_correlation_of_face_and_vioce_CRM.pdf]]
 
|-
 
|-
| rowspan="1"|2020/09/01 || Lantian Li || Decoupled NL ||    ||  
+
| 2022/09/02  ||Zixi Yan    || Non-Contrastive Self-supervised Learning || [[媒体文件:Non_contrastive_Self_supervised_Learning.pdf]]
 
|-
 
|-
| rowspan="1"|2020/09/07 || Yunqi Cai ||Deep generative model based Anomaly detection||[[媒体文件:Anomaly_detection.pdf | slides]]||
+
| 2022/09/09  ||Sirui Li    || Low Resource Speech Recognition || [[媒体文件:Low_Resource_Speech_Recognition_lsr_0909.pdf]]
 
|-
 
|-
| rowspan="1"|2020/09/14 || Dong Wang || How we factorize speech? || [[媒体文件:Factorization.pdf|slides]]     ||
+
| 2022/09/16  ||Xipin Wei    || Controllable Multi-style Music Generation Model based on simple Contrastive Learning || [[媒体文件:Controllable_Multi_style_Music_Generation_Model_based_on_simple_Contrastive_learning.pdf]]
 
|-
 
|-
| rowspan="1"|2020/10/05 || Dong Wang || Remarks on DNF || [[媒体文件:Remakrs on DNF.pptx|slides]]     ||
+
| 2022/09/23  ||Haoyu Jiang  || Audio Visual Learning || [[媒体文件:Audio_Visual_Learning.pdf]]
 
|-
 
|-
| rowspan="1"|2020/10/12 || Dong Wang || Paper Reading: Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations || [[媒体文件:Challenge-disentanglement.pptx|slides]] [http://proceedings.mlr.press/v97/locatello19a/locatello19a.pdf paper link]      ||
+
| 2022/09/30  ||Chen Chen    || Speech Quality Assessment || [[媒体文件:220930_cchen_SpeechQualityAssessment.pdf]]
 
|-
 
|-
| rowspan="1"|2020/10/19 || Haoran Sun || Informational Speech Factorization by Factorial Discriminative Normalization Flow || [[媒体文件:Informational_Speech_Factorization.pdf|slides]]     ||
+
| 2022/10/07  ||Wan Lin      || Cross-Domain Speaker Recognition || [[媒体文件:Cross_Domain_Speaker_Recognition.pdf]]
 
|-
 
|-
| rowspan="1"|2020/10/27 || Jiao Han || Experimental report mainly based on DNF models || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/e/e9/Experimental_report_mainly_based_on_DNF_models.pdf slides]   ||
+
| 2022/10/14  ||Tianhao Wang || How do deep speaker models treat silence and noises || [[媒体文件:20221014_wth.pdf]]
 
|-
 
|-
| rowspan="1"|2020/11/02 || Lantian Li || INTERSPEECH 2020 (SRE) || [[媒体文件:201102-INTERSPEECH_2020-SRE-LLT.pdf|slides]]     ||
+
| 2022/10/31  ||Pengqi Li   || Visualization of a specific filter in CNN || [[媒体文件:Visualization of a specific filter in CNN.pdf]]
 
|-
 
|-
| rowspan="1"|2020/11/09 || Yunqi Cai || Deep normalization_V1 || [[媒体文件:Deep_norm_trilogy_v1.pdf|slides]] [http://caiyq.cslt.org/doc/deepnorm_v1.mp4 video]    ||
+
| 2022/11/04  ||Zhenyu Zhou  || Acoustic-aware Training for Multi-genre Speaker Recognition || [[媒体文件:20221104_acoustic_training.pdf]]
 
|-
 
|-
| rowspan="1"|2020/11/16 || Yunqi Cai || Deep normalization_V2 || [http://caiyq.cslt.org/doc/deep-norm-trilogy_v2.pptx slides] [http://caiyq.cslt.org/doc/deepnorm_v2.mp4 video]     ||
+
| 2022/11/07  ||Chen Chen & Renmiao Chen || Experience and perceptions of collecting Audio-Visual dataset || [[媒体文件:20221107_cc_crm.pdf]]
 
|-
 
|-
| rowspan="1"|2020/11/17 || Di Wang || Statistics decomposition for NL Scoring || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/9/97/Statistics_decomposition_for_NL_Scoring.pdf slides]   ||
+
| 2022/12/23  ||Renmiao Chen || IS22 and Perceiver IO|| [[媒体文件:221223CRM.pdf]]
 
|-
 
|-
| rowspan="1"|2020/11/23 || Yunqi Cai || Deep normalization_V3 || [http://caiyq.cslt.org/doc/deep-norm-trilogy_v3.pptx slides] [http://caiyq.cslt.org/doc/deepnorm_v3.mp4 video]   ||
+
| 2022/12/23 ||Dong Wang    || NIPS2022 || [[媒体文件:NIPS2022.pdf]]
 
|-
 
|-
| rowspan="1"|2020/12/08 || Yunqi Cai || From materials science to perceptual intelligence || [http://caiyq.cslt.org/doc/perceptual_intelligence.pptx slides] [http://caiyq.cslt.org/doc/**.mp4 video]   ||
+
| 2022/12/30  ||Chen Chen    || Perceptual in Generative Audio Models || [[媒体文件:221230_cc.pdf]]
 
|-
 
|-
| rowspan="1"|2020/12/08 || Dong Wang || From noise injection to Bayes PLDA || [[媒体文件:Bayes-plda.ppt|slides]]   ||
+
|             ||             || IS22_review || [[媒体文件:IS22_review_all.pdf]]
 
|-
 
|-
| rowspan="1"|2020/12/21 || Lantian Li || Speech in NIPS 2019/2020 || [[媒体文件:Speech in NIPS 19&20.pdf|slides]]   ||
+
| 2023/02/10  ||Jiaying Wang || Ordered binary speaker embedding || [[媒体文件:230210wjy.pdf]]
 
|-
 
|-
| rowspan="1"|2020/12/28 || Pengqi Li || Domain generalization via robust optimization || [[媒体文件:201228-Device_Generalization.pdf|slides]]   ||
+
| 2023/02/17  ||Xipin Wei    || MSAT: A Multi-Scale Attentive Transformer for Multi-Instrument Symbolic Music Generation || [[媒体文件:MSAT_wxp.pdf]]
 
|-
 
|-
| rowspan="1"|2021/01/07 || Dong Wang || What we believe || [[媒体文件:What we believe.pdf|slides]]   ||
+
| 2023/03/10  ||Zhenyu Zhou  || consistence_loss&BCE_loss || [[媒体文件:consistence_loss&BCE_loss.pdf]]
 
|-
 
|-
| rowspan="1"|2021/01/14 || Dong Wang || Reparametric trick || [[媒体文件:Reparametric.pdf|slides]]   ||
+
| 2023/03/17  ||Tianhao Wang || Score calibration in speaker verification || [[媒体文件:Score_calibration_in_speaker_verification.pdf]]
 
|-
 
|-
| rowspan="1"|2021/02/01 || Dong Wang || Data augmentation as regularization || [[媒体文件:Data-augmentation.pdf|slides]]   ||
+
| 2023/03/31  ||Wan Lin      || Understand contrast and non-contrast in self-supervised learning || [[媒体文件:Understand contrast and non-contrast in self-supervised learning.pdf]]
 
|-
 
|-
| rowspan="1"|2021/02/22 || Lantian Li || Ensemble and Distillation || [[媒体文件:2012.09816.pdf|paper]] [[媒体文件:Ensemble_And_Distillation.pdf|slides]]  ||
+
| 2023/04/14  ||Pengqi Li   || Towards Attribution Methods in Deep Speaker Recognition || [[媒体文件:Towards_Attribution_Methods_in_Deep_Speaker_Recognition_230414_lpq.pdf]]
 
|-
 
|-
| rowspan="1"|2021/03/08 || Dong Wang || HIERARCHICAL GENERATIVE MODELING FOR CONTROLLABLE SPEECH SYNTHESIS || [https://arxiv.org/pdf/1810.07217.pdf paper] [[媒体文件:HIERARCHICALGENERATIVEMODELING FORCONTROLLABLESPEECHSYNTHESIS.pdf|slides]]   ||
+
| 2023/04/21  ||Chen Chen    || Masked Prediction Task Based Self-supervised Multimodal Learning || [[媒体文件:Masked_prediction_task_based_self-supervised_multimodal_learning.pdf]]
 
|-
 
|-
| rowspan="1"|2021/03/15 || Dong Wang || 第三代人工智能 || [http://scis.scichina.com/cn/2020/SSI-2020-0204.pdf  paper] [[媒体文件:第三代人工智能.pdf|slides]]   ||
+
| 2023/04/28  ||Xiaolou Li  || Incomplete Multimodal Method Exploration || [[媒体文件:Incomplete_Multimodal_Method_Exploration.pdf]]
 
|-
 
|-
| rowspan="1"|2021/03/22 || Chao Xing || Complexity neural net in speech enhancement || [http://web.cse.ohio-state.edu/~wang.77/papers/WWW.taslp20.pdf paper1][https://openreview.net/pdf?id=SkeRTsAcYm paper2] [https://arxiv.org/pdf/2008.00264.pdf paper3] ||
+
| 2023/05/04  ||Renmiao Chen || Applications of Diffusion Model || [[媒体文件:230505_Applications_of_Diffusion_Model_CRM.pdf]]
 
|-
 
|-
| rowspan="1"|2021/03/29 || Ying Shi || Some methods about speech enhancement || [[媒体文件:SPEECH ENHANCMENGT.pdf|slides]]   ||
+
| 2023/05/19  ||Jiaying Wang || DSH based method||[[媒体文件:230519_DSH_based_paper.pptx]]
 
|-
 
|-
| rowspan="1"|2021/04/05 || Jiyuan Zhang || 推理 & 知识推理调研 || [[媒体文件:知识推理相关调研.pdf|slides]]   ||
+
| 2023/05/26  ||Zhenyu Zhou  || representation learning approach for domain adaptation || [[媒体文件:Representation_learning_approach_for_domain_adaptation.pptx]]
 
|-
 
|-
| rowspan="1"|2021/04/12 || Zicheng Qiu || Some work on minorlingual speech recognition||  ||
+
| 2023/06/02  ||Pengqi Li    ||  ||
 
|-
 
|-
| rowspan="1"|2021/04/19 || Shiyue Zhang || Text summarization||  ||
+
| 2023/06/30  ||Tianhao Wang || Robust Speaker Verification ICASSP2023 || [[媒体文件:20230630_Robust_Speaker_Verification_ICASSP2023.pdf]]
 
|-
 
|-
| rowspan="1"|2021/04/26 || Dong Wang || Paper reading: Metadata normalization || [[媒体文件:Meta normalization.pdf|slides]] [https://arxiv.org/pdf/2104.09052.pdf paper]  ||
+
| 2023/10/13  ||Xiaolou Li  || ||
 
|-
 
|-
| rowspan="1"|2021/05/10 || Lantian Li || Explainable ML || [[媒体文件:Explainable_ML.pdf|slides]] ||  
+
| 2023/10/20  ||Zehua Liu    || ||
 
|-
 
|-
| rowspan="1"|2021/05/17 || Jie Li ||  || Tea cake Re-identification ||
+
| 2023/10/27  ||Junhui Chen  ||  ||
 
|-
 
|-
 
|}
 
|}
  
*[[媒体文件:代码规范.pdf  | 代码规范说明]]
 
  
  
 
[[Old readings|Past Events]]
 
[[Old readings|Past Events]]

2023年10月10日 (二) 02:22的最后版本

清华大学语音语言中心内部学习会

时间: 每周五晚19:30

地点: 1区303


Date Speaker Title Materials
PPT模板 媒体文件:Weeklyreading_template.rar
2021/04/01 Haoran Sun Zeus code regularization 媒体文件:代码规范.pdf
2021/05/20 Chen Chen Overview of speech enhancement 媒体文件:Speech_enhancement.pdf
2021/05/27 Di Wang Secret of 'hard trials' 媒体文件:Secret_of_hard_trials.pdf
2021/06/10 Jingxin Shen Expriments about thermal to RGB face synthesis with cycleGan and pix2pix 媒体文件:Expriments about thermal to RGB face synthesis with cycleGan and pix2pix.pdf
2021/06/17 Yang Zhang NIPS2020: Long-Tailed Classification by Keeping the Good and Removing the Bad Momentum Causal Effect 媒体文件:long-tail.pdf
2021/07/08 Tiankai Zhi Some experiments on stargan 媒体文件:Some experiments on stargan.pdf
2021/07/15 Jiao Han MG experiments based on ASV system 媒体文件:MG experiments based on ASV system..pptx
2021/07/22 Zixi Yan & Sirui Li Unsupervised Speech Recognition 媒体文件:Unsupervised_Speech_Recognition.pdf
2021/07/29 Pengqi Li A Simulation Study on 􏰛􏰜 Ro􏰛bust MAML 媒体文件:A Simulation Study on 􏰛􏰜 Ro􏰛bust MAML.pdf
2021/08/12 Qingyang Zhu Noise-aware method for Speech Enhancement 媒体文件:Noise-aware method for Speech Enhancement.pdf
2021/08/12 Weida Liang Unsupervised Audio-Visual Synthesis via Exemplar Autoencoders 媒体文件:Bi-weekly_report_Liangwd.pdf
2021/08/19 Di Wang Inter Dataset Variability Compensation 媒体文件:Inter_dataset_variability_compensation.pdf
2021/09/02 Tiankai Zhi One Shot VC 媒体文件:One_shot_VC.pdf
2021/09/09 Jingxin Shen Thermal Speaking 媒体文件:Thermal_Speaking_2021.pdf
2021/09/23 Sirui Li & Zixi Yan Wav2vec-U Experimental Report 媒体文件:Wav2vec-U_experimental_report.pdf ‎
2021/10/20 Renmiao Chen Is Someone Speaking? 媒体文件:Is_Someone_Speaking_Exploring_Long-term_Temporal_Features.pdf ‎
2021/10/28 Chen Chen WenetSpeech Introduction 媒体文件:WenetSpeech_Dataset_Introduction.pdf ‎
2021/11/10 Weida Liang Cycle-loss Exemplar Autoencoder 媒体文件:Cycle-loss_Exemplar_Autoencoder.pdf ‎
2021/11/17 吾买尔江 Modulation Spectrum 媒体文件:Modulation_Spectrum.pdf ‎
2021/11/24 Chen Chen S-DCCRN 媒体文件:S-DCCRN_pdf.pdf ‎
2021/12/01 Pengqi Li GuidedMix: An on-the-fly data augmentation approach for robust speaker recognition system 媒体文件:201201-GuidedMix-LPQ.pdf ‎
2021/12/08 Renmiao Chen Multimodal preson verification 媒体文件:Multimodal_preson_verification.pdf
2021/12/15 Ruihai Hou Crossmodal clustered contrastive learning: Grounding of spoken language to gesture 媒体文件:Crossmodal_clustered_contrasti.pdf
2021/12/29 Zixi Yan Capsules Network 媒体文件:Capsules_Network.pdf
2022/01/05 Sirui Li Self-Supervised Learning for speech recognition with Intermediate layer supervision 媒体文件:SSL with Intermediate layer supervision.pdf
2022/01/12 Weida Liang FragmentVC 媒体文件:FragmentVC.pdf
2022/01/19 Haoyu Jiang Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video 媒体文件:Multi-modality_Associative_Bridging_through_Memory.pdf
2022/02/14 Interspeech 2021 Review 媒体文件:Interspeech_paper_review_min.pdf
2022/02/16 Chen Chen Audio Visual HuBERT 媒体文件:AVHuBERT.pdf
2022/03/04 Pengqi Li Study of Visualization 媒体文件:Visualization.pdf
2022/03/11 Renmiao Chen Can audio-visual integration strengthen robustness under multimodal attacks? 媒体文件:Audio-Visual_Robustness_Under_Multimodal_Attacks.pdf
2022/03/11 吾买尔江 Signal Separation 媒体文件:Signal_Separation.pdf
2022/03/18 Chen Chen Overview on Lip Reading and Audio-visual Speech Recognition 媒体文件:LipReadingAndAVSR.pdf
2022/04/01 Ruihai Hou Scalable Identity-Oriented Speech Retrieval 媒体文件:Scalable_Identity-Oriented_Speech_Retrieval.pdf
2022/04/08 Zixi Yan Wav2vec related papers share 媒体文件:Wav2vec_related_papers.pdf
2022/04/22 Sirui Li Speech-Based Language Modelling 媒体文件:Speech-Based Language Modelling.pdf
2022/04/29 Haoyu Jiang Models of Speaker Recognition 媒体文件:Models_of_Speaker_Recognition.pdf
2022/05/13 Chen Chen Audio-visual Representation Learning 媒体文件:Audio_visual_representation_learning.pdf
2022/05/20 Haoran Sun
2022/05/27 Pengqi Li The important ”feature” for speaker recognition 媒体文件:The important ”feature” for speaker recognition.pdf
2022/06/10 Zixi Yan Paper Share 媒体文件:Paper_share_yzx0610.pdf
2022/06/24 Renmiao Chen Transformer in multimodal 媒体文件:Transformer_in_multimodal.pdf
ICASSP 2022 review 媒体文件:ICASSP2022_review.pdf 媒体文件:ICASSP-2022-readinglist.pdf
2022/07/04 Chen Chen Video to Speech papers 媒体文件:VTS_cc.pdf
2022/07/08 Ruihai Hou ICASSP 2022 review (part) 媒体文件:Weeklyreading_hrh.pdf
2022/07/15 Sirui Li Towards End-to-end Unsupervised Speech Recognition 媒体文件:Towards_End_to_end_Unsupervised_Speech_Recognition.pdf
2022/07/22 Wan Lin AutoED: Text-independent unsupervised speaker recognition Model 媒体文件:AutoED_spk_reg.pdf
2022/07/29 Haoyu Jiang ArcFace_iQIYI-VID 媒体文件:ArcFace_iQIYI-VID.pdf
2022/08/05 Chen Chen Recent advance in VTS task 媒体文件:RecentVTS.pdf
2022/08/12 Tianhao Wang Extremal Perturbations 媒体文件:Extremal_perturbations.pdf
2022/08/19 Renmiao Chen The correlation of face and vioce 媒体文件:The_correlation_of_face_and_vioce_CRM.pdf
2022/09/02 Zixi Yan Non-Contrastive Self-supervised Learning 媒体文件:Non_contrastive_Self_supervised_Learning.pdf
2022/09/09 Sirui Li Low Resource Speech Recognition 媒体文件:Low_Resource_Speech_Recognition_lsr_0909.pdf
2022/09/16 Xipin Wei Controllable Multi-style Music Generation Model based on simple Contrastive Learning 媒体文件:Controllable_Multi_style_Music_Generation_Model_based_on_simple_Contrastive_learning.pdf
2022/09/23 Haoyu Jiang Audio Visual Learning 媒体文件:Audio_Visual_Learning.pdf
2022/09/30 Chen Chen Speech Quality Assessment 媒体文件:220930_cchen_SpeechQualityAssessment.pdf
2022/10/07 Wan Lin Cross-Domain Speaker Recognition 媒体文件:Cross_Domain_Speaker_Recognition.pdf
2022/10/14 Tianhao Wang How do deep speaker models treat silence and noises 媒体文件:20221014_wth.pdf
2022/10/31 Pengqi Li Visualization of a specific filter in CNN 媒体文件:Visualization of a specific filter in CNN.pdf
2022/11/04 Zhenyu Zhou Acoustic-aware Training for Multi-genre Speaker Recognition 媒体文件:20221104_acoustic_training.pdf
2022/11/07 Chen Chen & Renmiao Chen Experience and perceptions of collecting Audio-Visual dataset 媒体文件:20221107_cc_crm.pdf
2022/12/23 Renmiao Chen IS22 and Perceiver IO 媒体文件:221223CRM.pdf
2022/12/23 Dong Wang NIPS2022 媒体文件:NIPS2022.pdf
2022/12/30 Chen Chen Perceptual in Generative Audio Models 媒体文件:221230_cc.pdf
IS22_review 媒体文件:IS22_review_all.pdf
2023/02/10 Jiaying Wang Ordered binary speaker embedding 媒体文件:230210wjy.pdf
2023/02/17 Xipin Wei MSAT: A Multi-Scale Attentive Transformer for Multi-Instrument Symbolic Music Generation 媒体文件:MSAT_wxp.pdf
2023/03/10 Zhenyu Zhou consistence_loss&BCE_loss 媒体文件:consistence_loss&BCE_loss.pdf
2023/03/17 Tianhao Wang Score calibration in speaker verification 媒体文件:Score_calibration_in_speaker_verification.pdf
2023/03/31 Wan Lin Understand contrast and non-contrast in self-supervised learning 媒体文件:Understand contrast and non-contrast in self-supervised learning.pdf
2023/04/14 Pengqi Li Towards Attribution Methods in Deep Speaker Recognition 媒体文件:Towards_Attribution_Methods_in_Deep_Speaker_Recognition_230414_lpq.pdf
2023/04/21 Chen Chen Masked Prediction Task Based Self-supervised Multimodal Learning 媒体文件:Masked_prediction_task_based_self-supervised_multimodal_learning.pdf
2023/04/28 Xiaolou Li Incomplete Multimodal Method Exploration 媒体文件:Incomplete_Multimodal_Method_Exploration.pdf
2023/05/04 Renmiao Chen Applications of Diffusion Model 媒体文件:230505_Applications_of_Diffusion_Model_CRM.pdf
2023/05/19 Jiaying Wang DSH based method 媒体文件:230519_DSH_based_paper.pptx
2023/05/26 Zhenyu Zhou representation learning approach for domain adaptation 媒体文件:Representation_learning_approach_for_domain_adaptation.pptx
2023/06/02 Pengqi Li
2023/06/30 Tianhao Wang Robust Speaker Verification ICASSP2023 媒体文件:20230630_Robust_Speaker_Verification_ICASSP2023.pdf
2023/10/13 Xiaolou Li
2023/10/20 Zehua Liu
2023/10/27 Junhui Chen


Past Events