“Weekly reading”版本间的差异

2023年5月8日 (一) 06:37的版本

清华大学语音语言中心内部学习会

时间：每周五晚19:30

地点： 1区303

Date	Speaker	Title	Materials
		PPT模板	媒体文件:Weeklyreading_template.rar
2021/04/01	Haoran Sun	Zeus code regularization	媒体文件:代码规范.pdf
2021/05/20	Chen Chen	Overview of speech enhancement	媒体文件:Speech_enhancement.pdf
2021/05/27	Di Wang	Secret of 'hard trials'	媒体文件:Secret_of_hard_trials.pdf
2021/06/10	Jingxin Shen	Expriments about thermal to RGB face synthesis with cycleGan and pix2pix	媒体文件:Expriments about thermal to RGB face synthesis with cycleGan and pix2pix.pdf
2021/06/17	Yang Zhang	NIPS2020: Long-Tailed Classification by Keeping the Good and Removing the Bad Momentum Causal Effect	媒体文件:long-tail.pdf
2021/07/08	Tiankai Zhi	Some experiments on stargan	媒体文件:Some experiments on stargan.pdf
2021/07/15	Jiao Han	MG experiments based on ASV system	媒体文件:MG experiments based on ASV system..pptx
2021/07/22	Zixi Yan & Sirui Li	Unsupervised Speech Recognition	媒体文件:Unsupervised_Speech_Recognition.pdf
2021/07/29	Pengqi Li	A Simulation Study on 􏰛􏰜 Ro􏰛bust MAML	媒体文件:A Simulation Study on 􏰛􏰜 Ro􏰛bust MAML.pdf
2021/08/12	Qingyang Zhu	Noise-aware method for Speech Enhancement	媒体文件:Noise-aware method for Speech Enhancement.pdf
2021/08/12	Weida Liang	Unsupervised Audio-Visual Synthesis via Exemplar Autoencoders	媒体文件:Bi-weekly_report_Liangwd.pdf
2021/08/19	Di Wang	Inter Dataset Variability Compensation	媒体文件:Inter_dataset_variability_compensation.pdf
2021/09/02	Tiankai Zhi	One Shot VC	媒体文件:One_shot_VC.pdf
2021/09/09	Jingxin Shen	Thermal Speaking	媒体文件:Thermal_Speaking_2021.pdf
2021/09/23	Sirui Li & Zixi Yan	Wav2vec-U Experimental Report	媒体文件:Wav2vec-U_experimental_report.pdf ‎
2021/10/20	Renmiao Chen	Is Someone Speaking?	媒体文件:Is_Someone_Speaking_Exploring_Long-term_Temporal_Features.pdf ‎
2021/10/28	Chen Chen	WenetSpeech Introduction	媒体文件:WenetSpeech_Dataset_Introduction.pdf ‎
2021/11/10	Weida Liang	Cycle-loss Exemplar Autoencoder	媒体文件:Cycle-loss_Exemplar_Autoencoder.pdf ‎
2021/11/17	吾买尔江	Modulation Spectrum	媒体文件:Modulation_Spectrum.pdf ‎
2021/11/24	Chen Chen	S-DCCRN	媒体文件:S-DCCRN_pdf.pdf ‎
2021/12/01	Pengqi Li	GuidedMix: An on-the-fly data augmentation approach for robust speaker recognition system	媒体文件:201201-GuidedMix-LPQ.pdf ‎
2021/12/08	Renmiao Chen	Multimodal preson verification	媒体文件:Multimodal_preson_verification.pdf
2021/12/15	Ruihai Hou	Crossmodal clustered contrastive learning: Grounding of spoken language to gesture	媒体文件:Crossmodal_clustered_contrasti.pdf
2021/12/29	Zixi Yan	Capsules Network	媒体文件:Capsules_Network.pdf
2022/01/05	Sirui Li	Self-Supervised Learning for speech recognition with Intermediate layer supervision	媒体文件:SSL with Intermediate layer supervision.pdf
2022/01/12	Weida Liang	FragmentVC	媒体文件:FragmentVC.pdf
2022/01/19	Haoyu Jiang	Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video	媒体文件:Multi-modality_Associative_Bridging_through_Memory.pdf
2022/02/14		Interspeech 2021 Review	媒体文件:Interspeech_paper_review_min.pdf
2022/02/16	Chen Chen	Audio Visual HuBERT	媒体文件:AVHuBERT.pdf
2022/03/04	Pengqi Li	Study of Visualization	媒体文件:Visualization.pdf
2022/03/11	Renmiao Chen	Can audio-visual integration strengthen robustness under multimodal attacks?	媒体文件:Audio-Visual_Robustness_Under_Multimodal_Attacks.pdf
2022/03/11	吾买尔江	Signal Separation	媒体文件:Signal_Separation.pdf
2022/03/18	Chen Chen	Overview on Lip Reading and Audio-visual Speech Recognition	媒体文件:LipReadingAndAVSR.pdf
2022/04/01	Ruihai Hou	Scalable Identity-Oriented Speech Retrieval	媒体文件:Scalable_Identity-Oriented_Speech_Retrieval.pdf
2022/04/08	Zixi Yan	Wav2vec related papers share	媒体文件:Wav2vec_related_papers.pdf
2022/04/22	Sirui Li	Speech-Based Language Modelling	媒体文件:Speech-Based Language Modelling.pdf
2022/04/29	Haoyu Jiang	Models of Speaker Recognition	媒体文件:Models_of_Speaker_Recognition.pdf
2022/05/13	Chen Chen	Audio-visual Representation Learning	媒体文件:Audio_visual_representation_learning.pdf
2022/05/20	Haoran Sun
2022/05/27	Pengqi Li	The important ”feature” for speaker recognition	媒体文件:The important ”feature” for speaker recognition.pdf
2022/06/10	Zixi Yan	Paper Share	媒体文件:Paper_share_yzx0610.pdf
2022/06/24	Renmiao Chen	Transformer in multimodal	媒体文件:Transformer_in_multimodal.pdf
		ICASSP 2022 review	媒体文件:ICASSP2022_review.pdf 媒体文件:ICASSP-2022-readinglist.pdf
2022/07/04	Chen Chen	Video to Speech papers	媒体文件:VTS_cc.pdf
2022/07/08	Ruihai Hou	ICASSP 2022 review (part)	媒体文件:Weeklyreading_hrh.pdf
2022/07/15	Sirui Li	Towards End-to-end Unsupervised Speech Recognition	媒体文件:Towards_End_to_end_Unsupervised_Speech_Recognition.pdf
2022/07/22	Wan Lin	AutoED: Text-independent unsupervised speaker recognition Model	媒体文件:AutoED_spk_reg.pdf
2022/07/29	Haoyu Jiang	ArcFace_iQIYI-VID	媒体文件:ArcFace_iQIYI-VID.pdf
2022/08/05	Chen Chen	Recent advance in VTS task	媒体文件:RecentVTS.pdf
2022/08/12	Tianhao Wang	Extremal Perturbations	媒体文件:Extremal_perturbations.pdf
2022/08/19	Renmiao Chen	The correlation of face and vioce	媒体文件:The_correlation_of_face_and_vioce_CRM.pdf
2022/09/02	Zixi Yan	Non-Contrastive Self-supervised Learning	媒体文件:Non_contrastive_Self_supervised_Learning.pdf
2022/09/09	Sirui Li	Low Resource Speech Recognition	媒体文件:Low_Resource_Speech_Recognition_lsr_0909.pdf
2022/09/16	Xipin Wei	Controllable Multi-style Music Generation Model based on simple Contrastive Learning	媒体文件:Controllable_Multi_style_Music_Generation_Model_based_on_simple_Contrastive_learning.pdf
2022/09/23	Haoyu Jiang	Audio Visual Learning	媒体文件:Audio_Visual_Learning.pdf
2022/09/30	Chen Chen	Speech Quality Assessment	媒体文件:220930_cchen_SpeechQualityAssessment.pdf
2022/10/07	Wan Lin	Cross-Domain Speaker Recognition	媒体文件:Cross_Domain_Speaker_Recognition.pdf
2022/10/14	Tianhao Wang	How do deep speaker models treat silence and noises	媒体文件:20221014_wth.pdf
2022/10/31	Pengqi Li	Visualization of a specific filter in CNN	媒体文件:Visualization of a specific filter in CNN.pdf
2022/11/04	Zhenyu Zhou	Acoustic-aware Training for Multi-genre Speaker Recognition	媒体文件:20221104_acoustic_training.pdf
2022/11/07	Chen Chen & Renmiao Chen	Experience and perceptions of collecting Audio-Visual dataset	媒体文件:20221107_cc_crm.pdf
2022/12/23	Renmiao Chen	IS22 and Perceiver IO	媒体文件:221223CRM.pdf
2022/12/23	Dong Wang	NIPS2022	媒体文件:NIPS2022.pdf
2022/12/30	Chen Chen	Perceptual in Generative Audio Models	媒体文件:221230_cc.pdf
		IS22_review	媒体文件:IS22_review_all.pdf
2023/02/10	Jiaying Wang	Ordered binary speaker embedding	媒体文件:230210wjy.pdf
2023/02/17	Xipin Wei	MSAT: A Multi-Scale Attentive Transformer for Multi-Instrument Symbolic Music Generation	媒体文件:MSAT_wxp.pdf
2023/03/10	Zhenyu Zhou	consistence_loss&BCE_loss	媒体文件:consistence_loss&BCE_loss.pdf
2023/03/17	Tianhao Wang	Score calibration in speaker verification	媒体文件:Score_calibration_in_speaker_verification.pdf
2023/03/31	Wan Lin	Understand contrast and non-contrast in self-supervised learning	媒体文件:Understand contrast and non-contrast in self-supervised learning.pdf
2023/04/14	Pengqi Li	Towards Attribution Methods in Deep Speaker Recognition	媒体文件:Towards_Attribution_Methods_in_Deep_Speaker_Recognition_230414_lpq.pdf
2023/04/21	Chen Chen	Masked Prediction Task Based Self-supervised Multimodal Learning	媒体文件:Masked_prediction_task_based_self-supervised_multimodal_learning.pdf
2022/04/28	Xiaolou Li	Incomplete Multimodal Method Exploration	媒体文件:Incomplete_Multimodal_Method_Exploration.pdf
2022/05/04	Renmiao Chen	Applications of Diffusion Model	媒体文件:230505_Applications_of_Diffusion_Model_CRM.pdf
2022/05/12	Jiaying Wang
2022/05/19	Zhenyu Zhou
2022/05/26	Tianhao Wang
2022/06/02	Pengqi Li
2022/06/09	Wan Lin

Past Events

“Weekly reading”版本间的差异

2023年5月8日 (一) 06:37的版本

导航菜单

个人工具

名字空间

变种

查看

操作

搜索

导航

工具

@@ 第1行： / 第1行： @@
-*Location: FIT-1-304
+'''清华大学语音语言中心内部学习会
-{| class="wikitable"
+'''时间： 每周五晚19:30'''
-! Date !! Speaker!! Title !! Materials !! On duty
-|-
-| 2012/08/27  ||Dong Wang  || Heterogeneous Convolutive Non-negative Sparse Coding ||[[媒体文件:Heterogeneous_convolutive_non-negative_sparse_coding.pdf|slides]] [http://homepages.inf.ed.ac.uk/v1dwang2/public/pdf/inerspeech2012-hetero.pdf paper] ||
-|-
-|2012/09/03  ||NO Meeting|| || ||
-|-
-|2012/09/10  || NO Meeting|| || ||
-|-
-|2012/09/17  ||WALEED ABDULLA||Auditory Based Feature Vectors for Speech Recognition ||[[媒体文件:AuditoryBasedFeatureVectors.pdf|slides]]||范淼
-|-
-| rowspan="2"|2012/09/24  ||刘超|| N-gram FST indexing for Spoken Term Detection || [[媒体文件:120924-N_gram_FST_indexing_for_Spoken_Term_Detection-LC-0.pdf|slides]] ||尹聪
-|-
-|范淼||Micro-blogging, Wikipedia, Folksonomy, What's Next? ||[[媒体文件:120924-Micro-blogging, Wikipedia, Folksonomy, What's Next-FM--01-FM-.pdf|slides]] ||
-|-
-| 2012/10/08 ||NO Meeting|| || ||
-|-
-| 2012/10/15  ||NO Meeting|| || ||
-|-
-|2012/10/22||Wu Xiaojun||speaker recognition in CSLT ||[[媒体文件:VPR_in_CSLT.pdf|slides]]||卡尔
-|-
-| rowspan="1"|2012/10/29  ||王军||An overview of Automatic Speaker Diarization Systems || [[媒体文件:121027-Speaker Diarization-WJ.pdf|slides]] ||别凡虎
-|-
-| rowspan="1"|2012/11/05  ||别凡虎||Experiments on Emotional Speaker Recognition||[[媒体文件:121104-Experiments_on_Emotional_Speaker_Recognition-BFH.pdf|slides]] ||刘超
-|-
-| rowspan="1"|2012/11/12  ||唐国瑜||Statistical Word Sense Improves Document Clustering ||[[媒体文件:121112_Statistical_Word_Sense_Improves_Document_Clustering_TGY.pdf‎ |slides]]||邱晗
-|-
-| rowspan="1"|2012/11/19  ||张陈昊||TDSR with Long-term Features Based on Functional Data Analysis||[[媒体文件:121118-ISCSLP-FDA_SR-ZCH.pdf|slides]] ||王俊俊
-|-
-| rowspan="1"|2012/11/26  ||王琳琳||Time-Varying Speaker Recognition: An Introduction||[[媒体文件:121126-Time_Varying_Speaker_Recognition_I-Wll.pdf‎|slides]] ||龚宬
-|-
-| rowspan="1"|2012/12/03  ||No meeting|| || ||
-|-
-| rowspan="1"|2012/12/10  ||No meeting|| || ||
-|-
-| rowspan="1"|2012/12/17  ||No meeting|| || ||
-|-
+'''地点： 1区303'''
-| rowspan="1"|2012/01/07  || || || ||
-|-
-|2012/01/07  ||王军||基于DF-MAP的说话人模型训练方法||[[媒体文件:130107-基于DFMAP的说话人模型训练方法-WJ.pdf|slides]] ||唐国瑜
-|-
-| rowspan="1"|2012/01/14  ||王东|| Computing in CSLT ||[[媒体文件:Computing_in_CSLT.pdf|slides]] ||王琳琳
-|-
+{| class="wikitable"
+! Date !! Speaker!! Title !! Materials
 |-
-| rowspan="1"|2013/03/04  ||王军||Sequential Adaptive Learning for Speaker Verification ||[[媒体文件:130301-Sequential adaptive learning for speaker verification-WJ.pdf|slides]] ||别凡虎
+|   ||  || PPT模板 ||[[媒体文件:Weeklyreading_template.rar]]
 |-
-| rowspan="1"|2013/03/11  || Du Jinle|| VAD stuff || ||
+| 2021/04/01  ||Haoran Sun    || Zeus code regularization ||[[媒体文件:代码规范.pdf]]
 |-
-| rowspan="1"|2013/03/18  || || || ||
+| 2021/05/20  ||Chen Chen     || Overview of speech enhancement|| [[媒体文件:Speech_enhancement.pdf]]
 |-
-| rowspan="1"|2013/03/25  || || || ||
+| 2021/05/27  ||Di Wang       || Secret of 'hard trials' || [[媒体文件:Secret_of_hard_trials.pdf]]
 |-
-| rowspan="1"|2013/04/01  || || || ||
+| 2021/06/10  ||Jingxin Shen  ||Expriments about thermal to RGB face synthesis with cycleGan and pix2pix || [[媒体文件:Expriments about thermal to RGB face synthesis with cycleGan and pix2pix.pdf]]
 |-
-| rowspan="1"|2013/04/08  || 张陈昊|| A Fishervoice based Feature Fusion Method for SUSR ||[[媒体文件:130408-FisherVoice-ZCH.pdf|slides]] ||谢仲达
+| 2021/06/17  ||Yang Zhang    || NIPS2020: Long-Tailed Classification by Keeping the Good and Removing the Bad Momentum Causal Effect || [[媒体文件:long-tail.pdf]]
 |-
-| rowspan="1"|2013/04/15  ||龚宬|| An Exploration on Influence Factors of VAD's Performance in Speaker Recognition ||[[媒体文件:130415-An_Exploration_on_Influence_Factors_of_VAD-GC.pdf|slides]] ||
+| 2021/07/08  ||Tiankai Zhi   || Some experiments on stargan ||[[媒体文件:Some experiments on stargan.pdf]]
 |-
-| rowspan="1"|2013/04/22  ||王俊俊 || Understanding the Query: THCIB and THUIS at NTCIR-10 Intent Task ||[[媒体文件:130422-Understanding_the_Query-WJJ.pdf|slides‎]] ||
+| 2021/07/15  ||Jiao Han      || MG experiments based on ASV system || [[媒体文件:MG experiments based on ASV system..pptx]]
 |-
-| rowspan="1"|2013/04/29  || || || ||
+| 2021/07/22  ||Zixi Yan & Sirui Li || Unsupervised Speech Recognition || [[媒体文件:Unsupervised_Speech_Recognition.pdf]]
 |-
-| rowspan="1"|2013/05/06  ||别凡虎 ||MLLR on Emotional Speaker Recognition ||[[媒体文件:130506-MLLR on Emotional Speaker Recognition-BFH.pdf|slides]] ||
+| 2021/07/29  ||Pengqi Li    || A Simulation Study on 􏰛􏰜 Ro􏰛bust MAML || [[媒体文件:A Simulation Study on 􏰛􏰜 Ro􏰛bust MAML.pdf]]
 |-
-| rowspan="1"|2013/05/13  ||刘超 || The Use of Deep Neural Network for Speech Recognition || [[媒体文件:130513-the_use_of_dnn_for_asr-lc.pdf|slides]] ||
+| 2021/08/12  ||Qingyang Zhu || Noise-aware method for Speech Enhancement || [[媒体文件:Noise-aware method for Speech Enhancement.pdf]]
 |-
-| rowspan="1"|2013/05/20  || || || ||
+| 2021/08/12  ||Weida Liang  ||  Unsupervised Audio-Visual Synthesis via Exemplar Autoencoders  ||  [[媒体文件:Bi-weekly_report_Liangwd.pdf]]
 |-
-| rowspan="1"|2013/05/27  ||王琳琳|| 说话人识别中的时变鲁棒性问题研究 || [[媒体文件:130527-TVSV-Wll.pdf|slides]] ||
+| 2021/08/19  ||Di Wang      || Inter Dataset Variability Compensation ||   [[媒体文件:Inter_dataset_variability_compensation.pdf]]
 |-
-| rowspan="1"|2013/06/03  ||王俊俊|| 汉语搜索结果聚类系统研究与实现 || [[媒体文件:130601-毕业答辩-02-WJJ.pdf|slides]] ||
+| 2021/09/02  ||Tiankai Zhi  || One Shot VC || [[媒体文件:One_shot_VC.pdf]]
 |-
-| rowspan="1"|2013/06/10  || || || ||
+| 2021/09/09  ||Jingxin Shen || Thermal Speaking || [[媒体文件:Thermal_Speaking_2021.pdf]]
 |-
-| rowspan="1"|2013/06/17  ||范淼 || Relation Extraction ||[[媒体文件:130617-relation_extraction-fm.pdf|slides]] ||
+| 2021/09/23  ||Sirui Li & Zixi Yan || Wav2vec-U Experimental Report || [[媒体文件:Wav2vec-U_experimental_report.pdf ‎]]
 |-
-| rowspan="1"|2013/06/24  ||唐国瑜 || Incorporating Statistical Word Senses in Topic Model  ||[[媒体文件:130624_Incorporating Statistical Word Senses in Topic Model_TGY.pdf|slides]] ||
+| 2021/10/20  ||Renmiao Chen || Is Someone Speaking? || [[媒体文件:Is_Someone_Speaking_Exploring_Long-term_Temporal_Features.pdf ‎]]
 |-
-| rowspan="1"|2013/07/01  || || || ||
+| 2021/10/28  ||Chen Chen    || WenetSpeech Introduction || [[媒体文件:WenetSpeech_Dataset_Introduction.pdf ‎]]
 |-
-| rowspan="1"|2013/07/08  ||  || || ||
+| 2021/11/10  ||Weida Liang  || Cycle-loss Exemplar Autoencoder || [[媒体文件:Cycle-loss_Exemplar_Autoencoder.pdf ‎]]
 |-
-| rowspan="1"|2013/07/15  || || || ||
+| 2021/11/17  ||吾买尔江      || Modulation Spectrum || [[媒体文件:Modulation_Spectrum.pdf ‎]]
 |-
-| rowspan="1"|2013/09/09  ||王东 || Research Frontier in Speech Technology||[[媒体文件:Research Frontier in Speech Technology.pdf|slides]] ||
+| 2021/11/24  ||Chen Chen    || S-DCCRN || [[媒体文件:S-DCCRN_pdf.pdf ‎]]
 |-
-| rowspan="1"|2013/09/16  || || || ||
+| 2021/12/01  ||Pengqi Li    || GuidedMix: An on-the-fly data augmentation approach for robust speaker recognition system || [[媒体文件:201201-GuidedMix-LPQ.pdf ‎]]
 |-
-| rowspan="1"|2013/09/23  || || || ||
+| 2021/12/08  ||Renmiao Chen || Multimodal preson verification ||  [[媒体文件:Multimodal_preson_verification.pdf]]
 |-
-| rowspan="1"|2013/09/30  || || || ||
+| 2021/12/15  ||Ruihai Hou   || Crossmodal clustered contrastive learning: Grounding of spoken language to gesture || [[媒体文件:Crossmodal_clustered_contrasti.pdf]]
 |-
-| rowspan="1"|2013/10/07  || || || ||
+| 2021/12/29  ||Zixi Yan     || Capsules Network || [[媒体文件:Capsules_Network.pdf]]
 |-
-| rowspan="1"|2013/10/14  || || || ||
+| 2022/01/05  ||Sirui Li     || Self-Supervised Learning for speech recognition with Intermediate layer supervision || [[媒体文件:SSL with Intermediate layer supervision.pdf]]
 |-
-| rowspan="1"|2013/10/21  ||范淼 ||Transduction Classification with Matrix Completion （中文报告）||[[媒体文件: Transduction_Classifiction_with_Matrix_Completion.pdf‎|slides]] [http://pages.cs.wisc.edu/~jerryzhu/pub/mc4ssl_FINAL.pdf paper]|| 李蓝天
+| 2022/01/12  ||Weida Liang  || FragmentVC || [[媒体文件:FragmentVC.pdf]]
 |-
-| rowspan="1"|2013/10/28  || || || ||
+| 2022/01/19  ||Haoyu Jiang  || Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video || [[媒体文件:Multi-modality_Associative_Bridging_through_Memory.pdf]]
 |-
-| rowspan="1"|2013/11/04  || 王军 || 基于i-vector的intersession补偿及打分方法(综述) || [[媒体文件:131104-ivecto下intersession补偿及打分方法--01-WJ-.pdf‎|slides]]||
+| 2022/02/14  ||             || Interspeech 2021 Review || [[媒体文件:Interspeech_paper_review_min.pdf]]
 |-
-| rowspan="1"|2013/11/11  ||张陈昊 ||PLDA介绍及PLDA在说话人识别中的应用 ||[[媒体文件:PLDA.pdf|slides]] || 唐国瑜
+| 2022/02/16  ||Chen Chen    || Audio Visual HuBERT || [[媒体文件:AVHuBERT.pdf]]
 |-
-| rowspan="1"|2013/11/18  ||别凡虎 ||i-vector理论介绍（讨论）||[[媒体文件:131118-i-vector_and_GMM-UBM-BFH.pdf|slides]]‎  ||王军
+| 2022/03/04  ||Pengqi Li    || Study of Visualization || [[媒体文件:Visualization.pdf]]
 |-
-| rowspan="1"|2013/11/25  ||刘超 || Pruning Neural Networks By Optimal Brain Damage(综述)||[[媒体文件:131125-OBD-LC-01.pdf|slides]] ||范淼
+| 2022/03/11  ||Renmiao Chen || Can audio-visual integration strengthen robustness under multimodal attacks? || [[媒体文件:Audio-Visual_Robustness_Under_Multimodal_Attacks.pdf]]
 |-
-| rowspan="1"|2013/12/02  ||范淼 ||Distant Supervision for Relation Extraction with Matrix Completion （英文报告）||[[媒体文件:131202-DRMC-FM-01.pdf|slides]] || 李蓝天
+| 2022/03/11  ||吾买尔江      || Signal Separation || [[媒体文件:Signal_Separation.pdf]]
 |-
-| rowspan="1"|2013/12/09  || Dong Wang|| Introduction to the HMM-based speech synthesis||[http://hts.sp.nitech.ac.jp/archives/2.2/HTS_Slides.zip slides] ||
+| 2022/03/18  ||Chen Chen    || Overview on Lip Reading and Audio-visual Speech Recognition || [[媒体文件:LipReadingAndAVSR.pdf]]
 |-
-| rowspan="1"|2013/12/16  ||张陈昊 ||语音研究中的基元介绍 ||[[媒体文件:131215-Phonology-ZCH.pdf|slides]]  ||
+| 2022/04/01  ||Ruihai Hou   || Scalable Identity-Oriented Speech Retrieval || [[媒体文件:Scalable_Identity-Oriented_Speech_Retrieval.pdf]]
 |-
-| rowspan="1"|2013/12/23  || Dong Wang|| Introduction to the HMM-based speech synthesis (2)||[http://hts.sp.nitech.ac.jp/archives/2.2/HTS_Slides.zip slides] ||
+| 2022/04/08  ||Zixi Yan     || Wav2vec related papers share || [[媒体文件:Wav2vec_related_papers.pdf]]
 |-
-| rowspan="1"|2013/12/23  || || || ||
+| 2022/04/22  ||Sirui Li     || Speech-Based Language Modelling || [[媒体文件:Speech-Based Language Modelling.pdf]]
 |-
-| rowspan="1"|2013/12/30  ||刘荣 || continuous space language model||[[媒体文件:Cslm-cslt.pdf|slides]]  ||刘超
+| 2022/04/29  ||Haoyu Jiang  || Models of Speaker Recognition || [[媒体文件:Models_of_Speaker_Recognition.pdf]]
 |-
-| rowspan="1"|2014/01/06  || || || ||
+| 2022/05/13  ||Chen Chen    || Audio-visual Representation Learning  || [[媒体文件:Audio_visual_representation_learning.pdf]]
 |-
-| rowspan="1"|2014/01/13  || || || ||
+| 2022/05/20  ||Haoran Sun   ||  ||
 |-
-| rowspan="1"|2014/01/20  || || || ||
+| 2022/05/27  ||Pengqi Li    || The important ”feature” for speaker recognition || [[媒体文件:The important ”feature” for speaker recognition.pdf]]
 |-
-| rowspan="1"|2014/02/24  || || || ||
+| 2022/06/10  ||Zixi Yan     || Paper Share || [[媒体文件:Paper_share_yzx0610.pdf]]
 |-
-| rowspan="1"|2014/03/03  || || || ||
+| 2022/06/24  ||Renmiao Chen || Transformer in multimodal || [[媒体文件:Transformer_in_multimodal.pdf]]
 |-
-| rowspan="1"|2014/03/10  ||范淼|| Distant Supervision for Information Extraction (英文报告)|| || 李蓝天
+|             ||             || ICASSP 2022 review || [[媒体文件:ICASSP2022_review.pdf]]  [[媒体文件:ICASSP-2022-readinglist.pdf]]
 |-
-| rowspan="1"|2014/03/17  ||唐国瑜 || Topic Models Incorporating Statistical Word Senses || [[媒体文件:TMISWS_For_CICLing2014.pdf|slides]]||
+| 2022/07/04  ||Chen Chen    || Video to Speech papers || [[媒体文件:VTS_cc.pdf]]
 |-
-| rowspan="1"|2014/03/24  ||孟祥涛 || Noisy training for Deep Neural Networks|| ||
+| 2022/07/08  ||Ruihai Hou   || ICASSP 2022 review (part) || [[媒体文件:Weeklyreading_hrh.pdf]]
 |-
-| rowspan="1"|2014/03/31  ||范淼|| Translating Embeddings for Modeling Multi-relational Data （中文报告） || [https://www.hds.utc.fr/everest/lib/exe/fetch.php?id=en%3Atranse&cache=cache&media=en:cr_paper_nips13.pdf paper]||李蓝天
+| 2022/07/15  ||Sirui Li     || Towards End-to-end Unsupervised Speech Recognition || [[媒体文件:Towards_End_to_end_Unsupervised_Speech_Recognition.pdf]]
 |-
-| rowspan="1"|2014/04/07  || || || ||
+| 2022/07/22  ||Wan Lin      || AutoED: Text-independent unsupervised speaker recognition Model|| [[媒体文件:AutoED_spk_reg.pdf]]
 |-
-| rowspan="1"|2014/04/14  || Wang Jun|| I-vector and PLDA in depth ||[[媒体文件:131104-ivector-microsoft-wj.pdf|slides]]  ||
+| 2022/07/29  ||Haoyu Jiang  || ArcFace_iQIYI-VID || [[媒体文件:ArcFace_iQIYI-VID.pdf]]
 |-
-| rowspan="1"|2014/04/21  || 邱晗||汉语事件句式规范化处理 ||[[媒体文件:140421-汉语事件句式规范化-QH.pdf‎|slides]] ||
+| 2022/08/05  ||Chen Chen    || Recent advance in VTS task || [[媒体文件:RecentVTS.pdf]]
 |-
-| rowspan="1"|2014/04/28  || 唐国瑜|| Some papers in　CICLing2014 ||[[媒体文件:Some_papers_in_CICling2014.pdf|slides]]  ||刘超
+| 2022/08/12  ||Tianhao Wang || Extremal Perturbations || [[媒体文件:Extremal_perturbations.pdf]]
 |-
-| rowspan="1"|2014/05/05  || || || ||
+| 2022/08/19  ||Renmiao Chen || The correlation of face and vioce || [[媒体文件:The_correlation_of_face_and_vioce_CRM.pdf]]
 |-
-| rowspan="1"|2014/05/12  || 卡尔|| paper introduction || [[媒体文件:Acoustic Factor Analysis.pdf|slides]] || 邱晗
+| 2022/09/02  ||Zixi Yan     || Non-Contrastive Self-supervised Learning || [[媒体文件:Non_contrastive_Self_supervised_Learning.pdf]]
 |-
-| rowspan="2"|2014/05/19  || 邱晗|| 汉语事件句式CCG推导树重构 ||[[媒体文件:140519-CCG_reConstruction.pdf‎|slides]]‎|| 卡尔
+| 2022/09/09  ||Sirui Li     || Low Resource Speech Recognition || [[媒体文件:Low_Resource_Speech_Recognition_lsr_0909.pdf]]
 |-
-|Liu Chao|| master proposal: sparse and deep neural networks || [[媒体文件:140519-proposal-LC-01.pdf|slides]] ||
+| 2022/09/16  ||Xipin Wei    || Controllable Multi-style Music Generation Model based on simple Contrastive Learning || [[媒体文件:Controllable_Multi_style_Music_Generation_Model_based_on_simple_Contrastive_learning.pdf]]
 |-
-| rowspan="1"| || Liu Chao|| 2nd master proposal: sparse and deep neural networks|| ||
+| 2022/09/23  ||Haoyu Jiang  || Audio Visual Learning || [[媒体文件:Audio_Visual_Learning.pdf]]
 |-
-| rowspan="1"|2014/06/16  || 别凡虎 || Truncated Wave based VPR and Some Recent Work || [[媒体文件:140614-Truncated_Speech_based_VPR.pdf‎|slides]]‎ || 别凡虎
+| 2022/09/30  ||Chen Chen    || Speech Quality Assessment || [[媒体文件:220930_cchen_SpeechQualityAssessment.pdf]]
 |-
-| rowspan="1"|2014/06/23  || 别凡虎 || Block-wise training for I-vector || [[媒体文件:140623-Block-wise training for I-vector.pdf‎|slides]]‎ || 别凡虎
+| 2022/10/07  ||Wan Lin      || Cross-Domain Speaker Recognition || [[媒体文件:Cross_Domain_Speaker_Recognition.pdf]]
 |-
-| rowspan="1"| 2014/07/07||王军 ||Discriminative Scoring for Speaker Recognition Based on I-vectors || [[媒体文件:140707-work_report.pdf|slides]]|| 王军
+| 2022/10/14  ||Tianhao Wang || How do deep speaker models treat silence and noises || [[媒体文件:20221014_wth.pdf]]
 |-
-| rowspan="1"| 2014/09/01|| || || ||
+| 2022/10/31  ||Pengqi Li    || Visualization of a specific filter in CNN || [[媒体文件:Visualization of a specific filter in CNN.pdf]]
 |-
-| rowspan="1"|2014/09/09 ||别凡虎 ||Reseach on Truncated Wave based VPR||[[媒体文件:140909-Truncated Speech based VPR.pdf|slides]] || 别凡虎
+| 2022/11/04  ||Zhenyu Zhou  || Acoustic-aware Training for Multi-genre Speaker Recognition || [[媒体文件:20221104_acoustic_training.pdf]]
 |-
-| rowspan="1"| 2014/09/15|| || || ||
+| 2022/11/07  ||Chen Chen & Renmiao Chen || Experience and perceptions of collecting Audio-Visual dataset || [[媒体文件:20221107_cc_crm.pdf]]
 |-
-| rowspan="1"|2014/09/22  || Miao Fan|| Large-scale Entity Relation Extraction based on Low-dimensional Representations (中文报告，博士开题)
+| 2022/12/23  ||Renmiao Chen || IS22 and Perceiver IO|| [[媒体文件:221223CRM.pdf]]
-||[[媒体文件:基于低维表示的大规模实体关系挖掘技术.pdf‎|slides]] || Lan TianLi
 |-
-| rowspan="1"| 2014/09/29 || || || ||
+| 2022/12/23  ||Dong Wang    || NIPS2022 || [[媒体文件:NIPS2022.pdf]]
 |-
-| rowspan="1"|2014/10/13  || Miao Fan|| The Frontier of Knowledge Embedding （英文报告）|| [[媒体文件:The_Frontier_of_Knowledge_Embedding.pdf‎|slides]]|| Lan TianLi
+| 2022/12/30  ||Chen Chen    || Perceptual in Generative Audio Models || [[媒体文件:221230_cc.pdf]]
 |-
-| rowspan="1"|2014/10/20  || || || ||
+|             ||             || IS22_review || [[媒体文件:IS22_review_all.pdf]]
 |-
-| rowspan="1"|2014/10/27  || Li Yi || Phonemes, Features, and Syllables: Converting Onset and Rime Inventories to Consonants and Vowels||[[媒体文件:Lanzhou Phonemes, Features, and Syllables- fianl.pdf|paper]] [[媒体文件:Syllables and phonemes - 20141027.pdf|slides]]||
+| 2023/02/10  ||Jiaying Wang || Ordered binary speaker embedding || [[媒体文件:230210wjy.pdf]]
 |-
-| rowspan="1"|2014/11/3   || 米吉提|| Automatic Speech Recognition of Agglutinative Language based on Lexicon Optimization||[[媒体文件:Mijit-slides-清华大学-2014-11-3.pdf|slides]] ||
+| 2023/02/17  ||Xipin Wei    || MSAT: A Multi-Scale Attentive Transformer for Multi-Instrument Symbolic Music Generation || [[媒体文件:MSAT_wxp.pdf]]
 |-
-| rowspan="1"|2014/11/10  || || || ||
+| 2023/03/10  ||Zhenyu Zhou  || consistence_loss&BCE_loss ||  [[媒体文件:consistence_loss&BCE_loss.pdf]]
 |-
-| rowspan="1"|2014/11/17  ||Dong Wang || Highly restricted keyword spotting for Uyghur using sparse analysis|| [[媒体文件:Highly Restricted Keyword Selection Based on Sparse Analysis.pdf|slides]]||
+| 2023/03/17  ||Tianhao Wang || Score calibration in speaker verification || [[媒体文件:Score_calibration_in_speaker_verification.pdf]]
 |-
-| rowspan="1"|2014/11/24  || || || ||
+| 2023/03/31  ||Wan Lin      || Understand contrast and non-contrast in self-supervised learning || [[媒体文件:Understand contrast and non-contrast in self-supervised learning.pdf]]
 |-
-| rowspan="1"|2014/12/1  ||ZhongDa Xie ||Incorporating Fine-Grained Ontological Relations in Medical Document Ranking || [[媒体文件:Fine-grained_relations.pdf|slides]]|| Lantian Li
+| 2023/04/14  ||Pengqi Li    || Towards Attribution Methods in Deep Speaker Recognition || [[媒体文件:Towards_Attribution_Methods_in_Deep_Speaker_Recognition_230414_lpq.pdf]]
 |-
-| rowspan="1"|2014/12/8  || || || ||
+| 2023/04/21  ||Chen Chen    || Masked Prediction Task Based Self-supervised Multimodal Learning || [[媒体文件:Masked_prediction_task_based_self-supervised_multimodal_learning.pdf]]
 |-
-| rowspan="1"|2014/12/15  || 唐国瑜 || 跨语言话题分析关键技术研究 ||[[媒体文件:141205-答辩-TGY.pdf|slides]] ||
+| 2022/04/28  ||Xiaolou Li   || Incomplete Multimodal Method Exploration || [[媒体文件:Incomplete_Multimodal_Method_Exploration.pdf]]
 |-
-| rowspan="1"|2014/12/22  || || || ||
+| 2022/05/04  ||Renmiao Chen || Applications of Diffusion Model || [[媒体文件:230505_Applications_of_Diffusion_Model_CRM.pdf]]
 |-
-| rowspan="1"|2014/12/29  || Askar || Language Mismatch in Speaker Recognition System||[[媒体文件:141229--askar.pdf|slides]] ||
+| 2022/05/12  ||Jiaying Wang ||  ||
 |-
-| rowspan="1"|2015/1/5  ||Lantian Li || Deep Neural Networks for Speaker Recognition || [[媒体文件:150104_Deep_Neural_Networks_for_Speaker_Recognition_LLT.pdf|slides]]||
+| 2022/05/19  ||Zhenyu Zhou  ||  ||
 |-
-| rowspan="1"|2015/1/12  || || || ||
+| 2022/05/26  ||Tianhao Wang ||  ||
 |-
-| rowspan="1"|2015/1/19  || Dong Wang || Machine Learning Paradigms for Speech Recognition||[[媒体文件:Machine Learning Paradigms for Speech Recognition.pdf|slides]]  [http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6423821 paper] ||
+| 2022/06/02  ||Pengqi Li    ||  ||
 |-
-| rowspan="1"|2015/1/26  || Chen Guorong || Information Transmission and Distribution on Web ||[[媒体文件:An_introduction_of_complex_network1.pdf|slides]] ||
+| 2022/06/09  ||Wan Lin      ||  ||
-|-
-| rowspan="1" |2015/3/9 || Dong Wang || Joint Deep Learning || [[媒体文件:Joint Deep Learning.pdf|slides]] ||
-|-
-| rowspan="1"|2015/3/16  || Dongxu Zhang || Knowledge learning from text data and knowledge bases || [[媒体文件:Joint Deep Learning.pdf|slides]] ||
-|-
-| rowspan="1"|2015/4/13  || Xuewei Zhang || Lasso-based Reverberation Suppression In Automatic Speech Recognition || [[媒体文件:Lasso-based Reverberation Suppression In Automatic Speech Recognition.pdf|slides]] ||
-|-
-| rowspan="1"|2015/5/11  || Dong Wang ||ASR and SID Research Frontier ||[[媒体文件:ASR and SID Research Frontier.pdf|slides]] ||
-|-
-| rowspan="1"|2015/11/23  || Zhiyuan Tang|| CTC learning|| [[媒体文件:CTC.pdf|slides]] ||
-|-
-| rowspan="1"|2015/11/30  || Mengyuan Zhao|| CNN-based music removal|| [[媒体文件:Music Removal by Convolutional Denoising.pdf | slides]] ||
-|-
-| rowspan="1"|2015/12/3  || Zhiyuan Tang|| Networks of Memory|| [[媒体文件:Memory_net.pdf|slides]] ||
-|-
-| rowspan="1"|2015/12/7  || Yiqiao Pan|| Document Classification with Spherical Word Vectors||[[媒体文件:Document Classification with Spherical Word Vectors.pdf|slides]] ||
-|-
-| rowspan="1"|2015/12/14  || Dong Wang || Transfer Learning for Speech and Language Processing ||[[媒体文件:Transfer_Learning_for_Speech_and_Language_Processing.pdf|slides]] ||
-|-
-| rowspan="1"|2015/12/21  || Qixin Wang || Attention for poem generation ||[[媒体文件:Ijcai 2016.pptx|slides]] ||
-|-
-| rowspan="1"|2015/12/28  || Lantian Li || Max-margin metric learning for speaker recognition || [[媒体文件:Max-margin-Metric-Learning.pdf|slides]]||
-|-
-| rowspan="1"|2016/1/4  || Zhiyong Zhang || Parallel training,MPE and natural gradient||[[媒体文件:20160104_张之勇_Large-scale Parallel Training in Speech Recognition.pdf|slides]]||
-|-
-| rowspan="1"|2016/1/18  || Dongxu Zhang || Memoryless Document Vector ||[[媒体文件:Memoryless_document_vector.pdf|slides]]||
-|-
-| rowspan="1"|2016/3/14  || Zhiyuan Tang|| Oral presentation for "vMF-SNE: Embedding for Spherical Data"|| [[媒体文件:embedding.pdf|slides]] ||
-|-
-| rowspan="1"|2016/3/28  || Tianyi Luo || Review for Neural QA || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/2/29/CSLT_Weekly_Report--20160328.pdf slides] ||
-|-
-| rowspan="1"|2016/4/11  || Rong Liu || Recommendation in Youku || [http://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/%E6%96%87%E4%BB%B6:Cslt%E5%AE%9E%E9%AA%8C%E5%AE%A4%E4%BA%A4%E6%B5%81.pptx slides] ||
-|-
-| rowspan="1"|2016/5/09 || Miao Fan || Learning contextual embeddings of knowledge base with entity descriptions.|| [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/9/9c/Techreport_CSLT_2016_M.F..pdf slides]  ||
-|-
-| rowspan="1"|2016/5/16 || Yang Wang || Research on conversation thread detection. || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/b/bb/%E6%B1%AA%E6%B4%8B-%E6%AF%95%E8%AE%BE-CSLT.pdf slides]  ||
-|-
-| rowspan="1"|2016/5/20 || Yang Wang &  Maoning Wang || Research on portfolio selection. || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/8/89/%E6%B1%AA%E6%B4%8B-%E9%87%91%E8%9E%8D%E7%AC%AC%E4%B8%80%E6%AC%A1%E5%88%86%E4%BA%AB.pdf slides1]  [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/b/bb/%E6%B1%87%E6%8A%A5_%E8%B5%84%E4%BA%A7%E7%BB%84%E5%90%88%E4%B8%AD%E5%87%A0%E4%B8%AA%E8%AF%84%E4%BB%B7%E6%8C%87%E6%A0%87%E7%9A%84%E8%A7%A3%E9%87%8A.pdf slides2]||
-|-
-| rowspan="1"|2016/5/20  || Zhiyuan Tang || ICASSP 2016 summary || [[媒体文件:Note icassp16.pdf|slides]] ||
-|-
-| rowspan="1"|2016/5/23 || Dong Wang || graphical model and neural model || [[媒体文件:Graphic Model and Neural Model.pdf|slides]] [[媒体文件:Generative-Pdf.rar|papers]]  ||
-|-
-| rowspan="1"|2016/8/02 || Zhiyuan Tang || Visualizing, Measuring and Understanding Neural Networks: A Brief Survey|| [[媒体文件:Nn analysis.pdf|slides]] ||
-|-
-| rowspan="1"|2016/8/03 || Yang Wang || Neural networks and genetic programming for financial forecasting || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/7/79/GeneticNN.pdf slides] ||
-|-
-| rowspan="1"|2016/11/05 || Yang Wang || Reinforcement Learning Models and Simulations || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/ca/RRL_and_sim.pdf slides] ||
-|-
-| rowspan="1"|2016/11/08 || April Pu || SOFTWARE DEVELIPMENT METHODOLOGIES || [http://wangd.cslt.org/talks/pdf/april_software.pptx slides] ||
-|-
-| rowspan="1"|2016/11/12 || Yang Wang || Generative Adversarial Nets || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/c9/Generative_adversarial_network.pdf slides] ||
-|-
-| rowspan="1"|2016/11/22 || Zhiyuan Tang || INTERSPEECH 2016 summary || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/6/65/Interspeech16_review.pdf slides] ||
-|-
-| rowspan="1"|2016/11/30 || Dong Wang || Deep and sparse learning in speech and language: an overview || [http://wangd.cslt.org/talks/pdf/bics2016.pptx slides] ||
-|-
-| rowspan="1"|2017/2/17 || Yang Wang || Review understanding deep learning requires rethinking generalization || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/3/3b/Review_understanding_deep_learning_requires_rethinking_generalization.pdf slides] ||
-|-
-| rowspan="1"|2017/6/5 || Dong Wang || Deep speech factorization || [http://wangd.cslt.org/talks/pdf/Deep-Speech-Factorization.pdf slides] ||
-|-
-| rowspan="1"|2017/6/8 || Shiyue Zhang || Convolutional Sequence to Sequence Learning  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/f/f3/Conv_seq2seq.pptx slides] ||
-|-
-| rowspan="1"|2017/6/12 || Shiyue Zhang || Memory-augmented Neural Machine Translation || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/3/36/Memory-augmented_Neural_Machine_Translation_.pptx slides] ||
-|-
-| rowspan="1"|2017/6/21 || Shiyue Zhang || Attention Is All You Need  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/6/68/Attention_is_all_you_need.pptx slides] ||
-|-
-| rowspan="1"|2017/6/26 || Jiyuan Zhang || Chinese poem generation using neural model  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/5/50/Flexible_and_Creative_Chinese_Poetry_Generation_Using_Neural_Memory_.pptx slides] ||
-|-
-| rowspan="1"|2017/6/21 || Miao Zhang || Speaker recognition on cough,laugh and wei  ||
-[http://cslt.riit.tsinghua.edu.cn/mediawiki/images/f/f6/Zm_cough.pdf slides]
-||
-|-
-| rowspan="1"|2017/7/10 || Aodong Li || Enhanced Neural Machine Translation by Learning from Draft  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/ca/Learning_from_draft.pptx slides] ||
-|-
-| rowspan="1"|2017/7/17 || Lantian Li || Study on Speaker Recognition  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/e/ec/170716-Study_on_SRE.pdf slides] ||
-|-
-| rowspan="1"|2018/12/6 || Xiuqi Jiang ||  Meta-Learning and Zero-Shot Learning  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/1/18/181205_Meta-Learning_and_Zero-Shot_Learning_JXQ.pdf slides] ||
-|-
-| rowspan="1"|2018/12/12 || Dan He ||  Tensor factorization neural net  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/3/3d/Tensor_factorization_neural_net.pdf slides] ||
-|-
-| rowspan="1"|2018/12/26 || Dong Wang || Towards deep statistical speaker representation  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/4/48/V.pdf slides] ||
-|-
-| rowspan="1"|2019/01/04 || Dong Wang || Speech in NIPS 2017/2018  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/c8/Speech_in_NIPS_2017.pdf slides] ||
-|-
-| rowspan="1"|2019/07/17 || Dong Wang || Deep Feature Learning and Normalization for Speaker Recognition  || [http://wangd.cslt.org/talks/pdf/india.pdf slides] ||
-|-
-| rowspan="1"|2019/08/19 || Sitong Cheng & Pengyuan Zhang || Periodic Report of Celebrity Video Data Collection.   || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/0/08/C-STAR.pdf slides] ||
-|-
-| rowspan="1"|2019/08/19 || Dong Wang|| Continuous Learning for Neural Nets || [[媒体文件:Continuous Learning for Neural Nets.pdf|slides]]||
-|-
-| rowspan="1"|2019/09/11 || Dong Wang || Language Recognition in ICASSP 2019   || [http://wangd.cslt.org/talks/pdf/LRE-ICASSP-2019.pdf slides] ||
-|-
-| rowspan="1"|2019/09/11 || Sitong Cheng || Language Recognition in Interspeech 2019   || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/a/a9/Language_Recognition_in_Interspeech_2019.pdf slides] ||
-|-
-| rowspan="1"|2019/10/14 || Haoran Sun || Dimension Reduction  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/7/7b/DimensionReduction.pdf slides] ||
-|-
-| rowspan="1"|2019/10/27 || Dong Wang || Back to Matrix  || [[媒体文件:Back to Matrix.pdf|slides]] ||
-|-
-| rowspan="1"|2019/11/11 || Dong Wang || Helmholtz Machine & The ML criterion  || [[媒体文件:Helmholtz Machine & The ML criterion.pdf|slides]] ||
-|-
-| rowspan="1"|2019/12/02 || Jiawen Kang || Gan Laten Space Manipulation & Flow Application Papers  || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/ca/GAN_Lantent_Space_manunipulation_%26_Flow_Application.pdf slides] ||
-|-
-| rowspan="1"|2019/12/09 || Dong Wang || Style transfer and information factorization || [[媒体文件:Style Transfer with Generative Models.pdf|slides]] ||
-|-
-| rowspan="1"|2019/12/16 || Zhiyuan Tang ||  Conditional Generative Flow  ||  [[媒体文件:Conditional GLow.pdf|slides]] ||
-|-
-| rowspan="1"|2019/12/23 || Lantian Li ||  Deep Generative Model in Speaker Recognition || [[媒体文件:Deep Generative Model in Speaker Recognition.pdf|slides]] ||
-|-
-| rowspan="1"|2019/12/30 || Wenqiang Du ||  Cross-bandwidth Train || [[媒体文件:Cross-bandwidth_Train.pdf|slides]] ||
-|-
-| rowspan="1"|2019/01/06 || Yunqi Cai ||  Do Deep Generative Models Know What They Don't Know ?|| [[媒体文件:2020.1.6_group_meeting.pdf|slides]] ||
-|-
-| rowspan="1"|2019/01/10 || Haoran Sun ||  Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture Design || [[媒体文件:Flow++.pdf|slides]] ||
-|-
-| rowspan="1"|2020/01/13 || Ying Shi ||  Deep Generative Model Energy Based Model || [[媒体文件:Deep_Generative_Model.pdf|slides]] ||
-|-
-| rowspan="1"|2020/02/10 || Dong Wang ||  Deep Generative Models for Discriminative Tasks || [[媒体文件:Re-Thinking for Discriminative and Generative Models.pdf|slides]]||
-|-
-| rowspan="1"|2020/02/17 || Zhiyuan Tang ||  Unsupervised Learning of Disentangled Representations  || [[媒体文件:20200217 Unsupervised disentanglement.pdf|slides]] ||
-|-
-| rowspan="1"|2020/02/24 || Lantian Li ||  Weakly- & Self-Supervised Learning || [[媒体文件:Weakly-_%26_Self-Supervised_Learning.pdf|slides]] ||
-|-
-| rowspan="1"|2020/03/02 || Yunqi Cai ||  Deep Normalization for Speaker Vectors|| [[媒体文件:Deep_Normalization_for_Speaker_Vectors_.pdf|slides]]||
-|-
-| rowspan="1"|2020/03/09 || Ying Shi ||  Speech Enhancement base on Double Flow || [[媒体文件:Speech_Enhancement_base_on_Double_Flow.pdf|slides]]||
-|-
-| rowspan="1"|2020/03/16 || Dong Wang ||  Bayesian scoring and uncertainty manipulation || [[媒体文件:Uncertainty Propagation.pdf|slides]]||
-|-
-| rowspan="1"|2020/03/23 || Zhiyuan Tang || Classifier involves Energy Based Model  || [[媒体文件:200323 energy model.pdf|slides]] ||
-|-
-| rowspan="1"|2020/03/30 || Lantian Li ||  Bayesian scoring in speaker verification || Temporarily held for security ||
-|-
-| rowspan="1"|2020/04/06 || Yunqi Cai ||  Posterior Collapse|| [[媒体文件:Posterior_Collapse.pdf|slides]]||
-|-
-| rowspan="1"|2020/04/13 || Lantian Li || NDA in ASV || Temporarily held for security [cvss 761] ||
-|-
-| rowspan="1"|2020/04/20 || Ying Shi ||  Speech_Enhancement_base_on_Flow ||[[媒体文件:Speech_Enhancement_base_on_Flow.pdf|slides]] ||
-|-
-| rowspan="1"|2020/05/11 || Dong Wang  ||  Real DNF || [[媒体文件:Real_DNF.pdf|Slide]]  ||
-|-
-| rowspan="1"|2020/05/26 || Sitong Cheng ||  ASR-Free Pronunciation Assessment || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/9/9a/ASR-Free_Pronunciation_Assessment.pdf slides] ||
-|-
-| rowspan="1"|2020/05/26 || Jiawen Kang ||  RobustMAML || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/8/8e/RobustMAML.pdf slides] ||
-|-
-| rowspan="1"|2020/05/26 || Jiawen Kang ||  Domain adaptation review || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/6/6d/Presentation-Meta-learning.pdf slides] ||
-|-
-| rowspan="1"|2020/05/26 || Jiawen Kang ||  SOTA models for VPR || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/d/d2/SOTA_models_for_VPR.pdf slides] ||
-|-
-| rowspan="1"|2020/06/01 || Dong Wang || How MAML succeeded?  || [https://arxiv.org/pdf/1909.09157.pdf][https://pdfs.semanticscholar.org/e6e9/c9d50b11ced939faf42f1c65bf9360eefd73.pdf][https://arxiv.org/pdf/1706.05806.pdf] ||
-|-
-| rowspan="1"|2020/06/09 || Zhiyuan Tang  ||  Flow Wheels || [[媒体文件:20200408 flow wheels.pdf|slides]]  ||
-|-
-| rowspan="1"|2020/06/15 || Lantian Li  ||  Uncertainty Modeling and Inference || [[媒体文件:200615-Uncertainty.pdf|slides]]  ||
-|-
-| rowspan="1"|2020/06/22 || Lantian Li  ||  Gaussians in High Dimension || [[媒体文件:High-dimensioaln-Gaussian.pdf|slides]]  ||
-|-
-| rowspan="1"|2020/06/22 || Dong Wang  ||  Self training for SE and ASR || [[媒体文件:Self-Training.pdf|slides]]  ||
-|-
-| rowspan="1"|2020/06/29 || Ying Shi  ||  Speech enhancement & separation || [[媒体文件:Speech-Separation-and-Enhancement.pdf|slides]]  ||
-|-
-| rowspan="1"|2020/07/06 || Haolin Chen  ||  Self-supervised Learning in Speech Processing || [[媒体文件:Self-Supervised.pptx|slides]]  ||
-|-
-| rowspan="1"|2020/07/13 || Zhiyuan Tang  || Exploding inverse in INN || [[媒体文件:20200713 dig into flow.pdf|slides]]  ||
-|-
-| rowspan="1"|2020/07/20 || Lantian Li  || Principle Solution for Enroll-Test Mismatch || [[媒体文件:200720-mismatch.pdf|slides]]  ||
-|-
-| rowspan="1"|2020/08/17 || Dong Wang  || Decoupled scoring || [[媒体文件:Decoupled.pdf|slides]] ||
-|-
-| rowspan="1"|2020/08/24 || Zhiyuan Tang || G & D Acoustic model ||  [[媒体文件:20200824 flow asr.pdf | slides]]   ||
-|-
-| rowspan="1"|2020/09/01 || Lantian Li || Decoupled NL ||     ||
-|-
-| rowspan="1"|2020/09/07 || Yunqi Cai ||Deep generative model based Anomaly detection||[[媒体文件:Anomaly_detection.pdf | slides]]||
-|-
-| rowspan="1"|2020/09/14 || Dong Wang || How we factorize speech? || [[媒体文件:Factorization.pdf|slides]]      ||
-|-
-| rowspan="1"|2020/10/05 || Dong Wang || Remarks on DNF || [[媒体文件:Remakrs on DNF.pptx|slides]]      ||
-|-
-| rowspan="1"|2020/10/12 || Dong Wang || Paper Reading: Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations || [[媒体文件:Challenge-disentanglement.pptx|slides]]  [http://proceedings.mlr.press/v97/locatello19a/locatello19a.pdf paper link]      ||
-|-
-| rowspan="1"|2020/10/19 || Haoran Sun || Informational Speech Factorization by Factorial Discriminative Normalization Flow || [[媒体文件:Informational_Speech_Factorization.pdf|slides]]      ||
-|-
-| rowspan="1"|2020/10/27 || Jiao Han || Experimental report mainly based on DNF models || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/e/e9/Experimental_report_mainly_based_on_DNF_models.pdf slides]    ||
-|-
-| rowspan="1"|2020/11/02 || Lantian Li || INTERSPEECH 2020 (SRE) || [[媒体文件:201102-INTERSPEECH_2020-SRE-LLT.pdf|slides]]      ||
-|-
-| rowspan="1"|2020/11/09 || Yunqi Cai || Deep normalization_V1 || [[媒体文件:Deep_norm_trilogy_v1.pdf|slides]] [http://caiyq.cslt.org/doc/deepnorm_v1.mp4 video]    ||
-|-
-| rowspan="1"|2020/11/16 || Yunqi Cai || Deep normalization_V2 || [http://caiyq.cslt.org/doc/deep-norm-trilogy_v2.pptx slides] [http://caiyq.cslt.org/doc/deepnorm_v2.mp4 video]     ||
-|-
-| rowspan="1"|2020/11/17 || Di Wang || Statistics decomposition for NL Scoring || [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/9/97/Statistics_decomposition_for_NL_Scoring.pdf slides]    ||
-|-
-| rowspan="1"|2020/11/23 || Yunqi Cai || Deep normalization_V3 || [http://caiyq.cslt.org/doc/deep-norm-trilogy_v3.pptx slides] [http://caiyq.cslt.org/doc/deepnorm_v3.mp4 video]    ||
-|-
-| rowspan="1"|2020/12/08 || Yunqi Cai || From materials science to perceptual intelligence || [http://caiyq.cslt.org/doc/perceptual_intelligence.pptx slides] [http://caiyq.cslt.org/doc/**.mp4 video]    ||
-|-
-| rowspan="1"|2020/12/08 || Dong Wang || From noise injection to Bayes PLDA || [[媒体文件:Bayes-plda.ppt|slides]]   ||
-|-
-| rowspan="1"|2020/12/21 || Lantian Li || Speech in NIPS 2019/2020 || [[媒体文件:Speech in NIPS 19&20.pdf|slides]]   ||
-|-
-| rowspan="1"|2020/12/28 || Pengqi Li || Domain generalization via robust optimization || [[媒体文件:201228-Device_Generalization.pdf|slides]]   ||
-|-
-| rowspan="1"|2021/01/07 || Dong Wang || What we believe || [[媒体文件:What we believe.pdf|slides]]   ||
-|-
-| rowspan="1"|2021/01/14 || Dong Wang || Reparametric trick || [[媒体文件:Reparametric.pdf|slides]]   ||
-|-
-| rowspan="1"|2021/02/01 || Dong Wang || Data augmentation as regularization || [[媒体文件:Data-augmentation.pdf|slides]]   ||
-|-
-| rowspan="1"|2021/02/22 || Lantian Li || Ensemble and Distillation || [[媒体文件:2012.09816.pdf|paper]] [[媒体文件:Ensemble_And_Distillation.pdf|slides]]   ||
-|-
-| rowspan="1"|2021/03/08 || Dong Wang || HIERARCHICAL GENERATIVE MODELING FOR CONTROLLABLE SPEECH SYNTHESIS || [https://arxiv.org/pdf/1810.07217.pdf paper] [[媒体文件:HIERARCHICALGENERATIVEMODELING FORCONTROLLABLESPEECHSYNTHESIS.pdf|slides]]   ||
-|-
-| rowspan="1"|2021/03/15 || Dong Wang || 第三代人工智能 || [http://scis.scichina.com/cn/2020/SSI-2020-0204.pdf  paper] [[媒体文件:第三代人工智能.pdf|slides]]   ||
-|-
-| rowspan="1"|2021/03/22 || Chao Xing || Complexity neural net in speech enhancement || [http://web.cse.ohio-state.edu/~wang.77/papers/WWW.taslp20.pdf paper1][https://openreview.net/pdf?id=SkeRTsAcYm paper2] [https://arxiv.org/pdf/2008.00264.pdf paper3] ||
-|-
-| rowspan="1"|2021/03/29 || Ying Shi || Some methods about speech enhancement || [[媒体文件:SPEECH ENHANCMENGT.pdf|slides]]   ||
-|-
-| rowspan="1"|2021/04/05 || Jiyuan Zhang || 推理 & 知识推理调研 || [[媒体文件:知识推理相关调研.pdf|slides]]   ||
-|-
-| rowspan="1"|2021/04/12 || Zicheng Qiu || Some work on minorlingual speech recognition||  ||
-|-
-| rowspan="1"|2021/04/19 || Shiyue Zhang || Text summarization||  ||
-|-
-| rowspan="1"|2021/04/26 || Dong Wang || Paper reading: Metadata normalization || [[媒体文件:Meta normalization.pdf|slides]] [https://arxiv.org/pdf/2104.09052.pdf paper]   ||
-|-
-| rowspan="1"|2021/05/10 || Lantian Li || Explainable ML || [[媒体文件:Explainable_ML.pdf|slides]] ||
-|-
-| rowspan="1"|2021/05/17 || Jie Li ||  || Tea cake Re-identification ||
 |-
 |}
@@ 第443行： / 第174行： @@
-[[Old readings]]
+[[Old readings|Past Events]]