“Weekly reading”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
 
(9位用户的66个中间修订版本未显示)
第92行: 第92行:
 
| 2022/05/27  ||Pengqi Li    || The important ”feature” for speaker recognition || [[媒体文件:The important ”feature” for speaker recognition.pdf]]
 
| 2022/05/27  ||Pengqi Li    || The important ”feature” for speaker recognition || [[媒体文件:The important ”feature” for speaker recognition.pdf]]
 
|-
 
|-
| 2022/06/10  ||Zixi Yan    || ||  
+
| 2022/06/10  ||Zixi Yan    || Paper Share || [[媒体文件:Paper_share_yzx0610.pdf]]
 
|-
 
|-
 
| 2022/06/24  ||Renmiao Chen || Transformer in multimodal || [[媒体文件:Transformer_in_multimodal.pdf]]
 
| 2022/06/24  ||Renmiao Chen || Transformer in multimodal || [[媒体文件:Transformer_in_multimodal.pdf]]
第98行: 第98行:
 
|            ||            || ICASSP 2022 review || [[媒体文件:ICASSP2022_review.pdf]]  [[媒体文件:ICASSP-2022-readinglist.pdf]]
 
|            ||            || ICASSP 2022 review || [[媒体文件:ICASSP2022_review.pdf]]  [[媒体文件:ICASSP-2022-readinglist.pdf]]
 
|-
 
|-
| 2022/07/01 ||Chen Chen    || ||  
+
| 2022/07/04 ||Chen Chen    || Video to Speech papers || [[媒体文件:VTS_cc.pdf]]
 
|-
 
|-
|             ||Ruihai Hou  || ||  
+
| 2022/07/08  ||Ruihai Hou  || ICASSP 2022 review (part) || [[媒体文件:Weeklyreading_hrh.pdf]]
 
|-
 
|-
|             ||Sirui Li    || ||  
+
| 2022/07/15  ||Sirui Li    || Towards End-to-end Unsupervised Speech Recognition || [[媒体文件:Towards_End_to_end_Unsupervised_Speech_Recognition.pdf]]
 
|-
 
|-
|             ||Haoyu Jiang  ||  ||  
+
| 2022/07/22  ||Wan Lin      || AutoED: Text-independent unsupervised speaker recognition Model|| [[媒体文件:AutoED_spk_reg.pdf]]
 +
|-
 +
| 2022/07/29  ||Haoyu Jiang || ArcFace_iQIYI-VID || [[媒体文件:ArcFace_iQIYI-VID.pdf]]
 +
|-
 +
| 2022/08/05  ||Chen Chen    || Recent advance in VTS task || [[媒体文件:RecentVTS.pdf]]
 +
|-
 +
| 2022/08/12  ||Tianhao Wang || Extremal Perturbations || [[媒体文件:Extremal_perturbations.pdf]]
 +
|-
 +
| 2022/08/19  ||Renmiao Chen || The correlation of face and vioce || [[媒体文件:The_correlation_of_face_and_vioce_CRM.pdf]]
 +
|-
 +
| 2022/09/02  ||Zixi Yan    || Non-Contrastive Self-supervised Learning || [[媒体文件:Non_contrastive_Self_supervised_Learning.pdf]]
 +
|-
 +
| 2022/09/09  ||Sirui Li    || Low Resource Speech Recognition || [[媒体文件:Low_Resource_Speech_Recognition_lsr_0909.pdf]]
 +
|-
 +
| 2022/09/16  ||Xipin Wei    || Controllable Multi-style Music Generation Model based on simple Contrastive Learning || [[媒体文件:Controllable_Multi_style_Music_Generation_Model_based_on_simple_Contrastive_learning.pdf]]
 +
|-
 +
| 2022/09/23  ||Haoyu Jiang  || Audio Visual Learning || [[媒体文件:Audio_Visual_Learning.pdf]]
 +
|-
 +
| 2022/09/30  ||Chen Chen    || Speech Quality Assessment || [[媒体文件:220930_cchen_SpeechQualityAssessment.pdf]]
 +
|-
 +
| 2022/10/07  ||Wan Lin      || Cross-Domain Speaker Recognition || [[媒体文件:Cross_Domain_Speaker_Recognition.pdf]]
 +
|-
 +
| 2022/10/14  ||Tianhao Wang || How do deep speaker models treat silence and noises || [[媒体文件:20221014_wth.pdf]]
 +
|-
 +
| 2022/10/31  ||Pengqi Li    || Visualization of a specific filter in CNN || [[媒体文件:Visualization of a specific filter in CNN.pdf]]
 +
|-
 +
| 2022/11/04  ||Zhenyu Zhou  || Acoustic-aware Training for Multi-genre Speaker Recognition || [[媒体文件:20221104_acoustic_training.pdf]]
 +
|-
 +
| 2022/11/07  ||Chen Chen & Renmiao Chen || Experience and perceptions of collecting Audio-Visual dataset || [[媒体文件:20221107_cc_crm.pdf]]
 +
|-
 +
| 2022/12/23  ||Renmiao Chen || IS22 and Perceiver IO|| [[媒体文件:221223CRM.pdf]]
 +
|-
 +
| 2022/12/23  ||Dong Wang    || NIPS2022 || [[媒体文件:NIPS2022.pdf]]
 +
|-
 +
| 2022/12/30  ||Chen Chen    || Perceptual in Generative Audio Models || [[媒体文件:221230_cc.pdf]]
 +
|-
 +
|            ||            || IS22_review || [[媒体文件:IS22_review_all.pdf]]
 +
|-
 +
| 2023/02/10  ||Jiaying Wang || Ordered binary speaker embedding || [[媒体文件:230210wjy.pdf]]
 +
|-
 +
| 2023/02/17  ||Xipin Wei    || MSAT: A Multi-Scale Attentive Transformer for Multi-Instrument Symbolic Music Generation || [[媒体文件:MSAT_wxp.pdf]]
 +
|-
 +
| 2023/03/10  ||Zhenyu Zhou  || consistence_loss&BCE_loss ||  [[媒体文件:consistence_loss&BCE_loss.pdf]]
 +
|-
 +
| 2023/03/17  ||Tianhao Wang || Score calibration in speaker verification || [[媒体文件:Score_calibration_in_speaker_verification.pdf]]
 +
|-
 +
| 2023/03/31  ||Wan Lin      || Understand contrast and non-contrast in self-supervised learning || [[媒体文件:Understand contrast and non-contrast in self-supervised learning.pdf]]
 +
|-
 +
| 2023/04/14  ||Pengqi Li    || Towards Attribution Methods in Deep Speaker Recognition || [[媒体文件:Towards_Attribution_Methods_in_Deep_Speaker_Recognition_230414_lpq.pdf]]
 +
|-
 +
| 2023/04/21  ||Chen Chen    || Masked Prediction Task Based Self-supervised Multimodal Learning || [[媒体文件:Masked_prediction_task_based_self-supervised_multimodal_learning.pdf]]
 +
|-
 +
| 2023/04/28  ||Xiaolou Li  || Incomplete Multimodal Method Exploration || [[媒体文件:Incomplete_Multimodal_Method_Exploration.pdf]]
 +
|-
 +
| 2023/05/04  ||Renmiao Chen || Applications of Diffusion Model || [[媒体文件:230505_Applications_of_Diffusion_Model_CRM.pdf]]
 +
|-
 +
| 2023/05/19  ||Jiaying Wang ||  DSH based method||[[媒体文件:230519_DSH_based_paper.pptx]]
 +
|-
 +
| 2023/05/26  ||Zhenyu Zhou  || representation learning approach for domain adaptation || [[媒体文件:Representation_learning_approach_for_domain_adaptation.pptx]]
 +
|-
 +
| 2023/06/02  ||Pengqi Li    ||  ||
 +
|-
 +
| 2023/06/30  ||Tianhao Wang || Robust Speaker Verification ICASSP2023 || [[媒体文件:20230630_Robust_Speaker_Verification_ICASSP2023.pdf]]
 +
|-
 +
| 2023/10/13  ||Xiaolou Li  ||  ||
 +
|-
 +
| 2023/10/20  ||Zehua Liu    ||  ||
 +
|-
 +
| 2023/10/27  ||Junhui Chen ||  ||
 
|-
 
|-
 
|}
 
|}
 
  
  
  
 
[[Old readings|Past Events]]
 
[[Old readings|Past Events]]

2023年10月10日 (二) 02:22的最后版本

清华大学语音语言中心内部学习会

时间: 每周五晚19:30

地点: 1区303


Date Speaker Title Materials
PPT模板 媒体文件:Weeklyreading_template.rar
2021/04/01 Haoran Sun Zeus code regularization 媒体文件:代码规范.pdf
2021/05/20 Chen Chen Overview of speech enhancement 媒体文件:Speech_enhancement.pdf
2021/05/27 Di Wang Secret of 'hard trials' 媒体文件:Secret_of_hard_trials.pdf
2021/06/10 Jingxin Shen Expriments about thermal to RGB face synthesis with cycleGan and pix2pix 媒体文件:Expriments about thermal to RGB face synthesis with cycleGan and pix2pix.pdf
2021/06/17 Yang Zhang NIPS2020: Long-Tailed Classification by Keeping the Good and Removing the Bad Momentum Causal Effect 媒体文件:long-tail.pdf
2021/07/08 Tiankai Zhi Some experiments on stargan 媒体文件:Some experiments on stargan.pdf
2021/07/15 Jiao Han MG experiments based on ASV system 媒体文件:MG experiments based on ASV system..pptx
2021/07/22 Zixi Yan & Sirui Li Unsupervised Speech Recognition 媒体文件:Unsupervised_Speech_Recognition.pdf
2021/07/29 Pengqi Li A Simulation Study on 􏰛􏰜 Ro􏰛bust MAML 媒体文件:A Simulation Study on 􏰛􏰜 Ro􏰛bust MAML.pdf
2021/08/12 Qingyang Zhu Noise-aware method for Speech Enhancement 媒体文件:Noise-aware method for Speech Enhancement.pdf
2021/08/12 Weida Liang Unsupervised Audio-Visual Synthesis via Exemplar Autoencoders 媒体文件:Bi-weekly_report_Liangwd.pdf
2021/08/19 Di Wang Inter Dataset Variability Compensation 媒体文件:Inter_dataset_variability_compensation.pdf
2021/09/02 Tiankai Zhi One Shot VC 媒体文件:One_shot_VC.pdf
2021/09/09 Jingxin Shen Thermal Speaking 媒体文件:Thermal_Speaking_2021.pdf
2021/09/23 Sirui Li & Zixi Yan Wav2vec-U Experimental Report 媒体文件:Wav2vec-U_experimental_report.pdf ‎
2021/10/20 Renmiao Chen Is Someone Speaking? 媒体文件:Is_Someone_Speaking_Exploring_Long-term_Temporal_Features.pdf ‎
2021/10/28 Chen Chen WenetSpeech Introduction 媒体文件:WenetSpeech_Dataset_Introduction.pdf ‎
2021/11/10 Weida Liang Cycle-loss Exemplar Autoencoder 媒体文件:Cycle-loss_Exemplar_Autoencoder.pdf ‎
2021/11/17 吾买尔江 Modulation Spectrum 媒体文件:Modulation_Spectrum.pdf ‎
2021/11/24 Chen Chen S-DCCRN 媒体文件:S-DCCRN_pdf.pdf ‎
2021/12/01 Pengqi Li GuidedMix: An on-the-fly data augmentation approach for robust speaker recognition system 媒体文件:201201-GuidedMix-LPQ.pdf ‎
2021/12/08 Renmiao Chen Multimodal preson verification 媒体文件:Multimodal_preson_verification.pdf
2021/12/15 Ruihai Hou Crossmodal clustered contrastive learning: Grounding of spoken language to gesture 媒体文件:Crossmodal_clustered_contrasti.pdf
2021/12/29 Zixi Yan Capsules Network 媒体文件:Capsules_Network.pdf
2022/01/05 Sirui Li Self-Supervised Learning for speech recognition with Intermediate layer supervision 媒体文件:SSL with Intermediate layer supervision.pdf
2022/01/12 Weida Liang FragmentVC 媒体文件:FragmentVC.pdf
2022/01/19 Haoyu Jiang Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video 媒体文件:Multi-modality_Associative_Bridging_through_Memory.pdf
2022/02/14 Interspeech 2021 Review 媒体文件:Interspeech_paper_review_min.pdf
2022/02/16 Chen Chen Audio Visual HuBERT 媒体文件:AVHuBERT.pdf
2022/03/04 Pengqi Li Study of Visualization 媒体文件:Visualization.pdf
2022/03/11 Renmiao Chen Can audio-visual integration strengthen robustness under multimodal attacks? 媒体文件:Audio-Visual_Robustness_Under_Multimodal_Attacks.pdf
2022/03/11 吾买尔江 Signal Separation 媒体文件:Signal_Separation.pdf
2022/03/18 Chen Chen Overview on Lip Reading and Audio-visual Speech Recognition 媒体文件:LipReadingAndAVSR.pdf
2022/04/01 Ruihai Hou Scalable Identity-Oriented Speech Retrieval 媒体文件:Scalable_Identity-Oriented_Speech_Retrieval.pdf
2022/04/08 Zixi Yan Wav2vec related papers share 媒体文件:Wav2vec_related_papers.pdf
2022/04/22 Sirui Li Speech-Based Language Modelling 媒体文件:Speech-Based Language Modelling.pdf
2022/04/29 Haoyu Jiang Models of Speaker Recognition 媒体文件:Models_of_Speaker_Recognition.pdf
2022/05/13 Chen Chen Audio-visual Representation Learning 媒体文件:Audio_visual_representation_learning.pdf
2022/05/20 Haoran Sun
2022/05/27 Pengqi Li The important ”feature” for speaker recognition 媒体文件:The important ”feature” for speaker recognition.pdf
2022/06/10 Zixi Yan Paper Share 媒体文件:Paper_share_yzx0610.pdf
2022/06/24 Renmiao Chen Transformer in multimodal 媒体文件:Transformer_in_multimodal.pdf
ICASSP 2022 review 媒体文件:ICASSP2022_review.pdf 媒体文件:ICASSP-2022-readinglist.pdf
2022/07/04 Chen Chen Video to Speech papers 媒体文件:VTS_cc.pdf
2022/07/08 Ruihai Hou ICASSP 2022 review (part) 媒体文件:Weeklyreading_hrh.pdf
2022/07/15 Sirui Li Towards End-to-end Unsupervised Speech Recognition 媒体文件:Towards_End_to_end_Unsupervised_Speech_Recognition.pdf
2022/07/22 Wan Lin AutoED: Text-independent unsupervised speaker recognition Model 媒体文件:AutoED_spk_reg.pdf
2022/07/29 Haoyu Jiang ArcFace_iQIYI-VID 媒体文件:ArcFace_iQIYI-VID.pdf
2022/08/05 Chen Chen Recent advance in VTS task 媒体文件:RecentVTS.pdf
2022/08/12 Tianhao Wang Extremal Perturbations 媒体文件:Extremal_perturbations.pdf
2022/08/19 Renmiao Chen The correlation of face and vioce 媒体文件:The_correlation_of_face_and_vioce_CRM.pdf
2022/09/02 Zixi Yan Non-Contrastive Self-supervised Learning 媒体文件:Non_contrastive_Self_supervised_Learning.pdf
2022/09/09 Sirui Li Low Resource Speech Recognition 媒体文件:Low_Resource_Speech_Recognition_lsr_0909.pdf
2022/09/16 Xipin Wei Controllable Multi-style Music Generation Model based on simple Contrastive Learning 媒体文件:Controllable_Multi_style_Music_Generation_Model_based_on_simple_Contrastive_learning.pdf
2022/09/23 Haoyu Jiang Audio Visual Learning 媒体文件:Audio_Visual_Learning.pdf
2022/09/30 Chen Chen Speech Quality Assessment 媒体文件:220930_cchen_SpeechQualityAssessment.pdf
2022/10/07 Wan Lin Cross-Domain Speaker Recognition 媒体文件:Cross_Domain_Speaker_Recognition.pdf
2022/10/14 Tianhao Wang How do deep speaker models treat silence and noises 媒体文件:20221014_wth.pdf
2022/10/31 Pengqi Li Visualization of a specific filter in CNN 媒体文件:Visualization of a specific filter in CNN.pdf
2022/11/04 Zhenyu Zhou Acoustic-aware Training for Multi-genre Speaker Recognition 媒体文件:20221104_acoustic_training.pdf
2022/11/07 Chen Chen & Renmiao Chen Experience and perceptions of collecting Audio-Visual dataset 媒体文件:20221107_cc_crm.pdf
2022/12/23 Renmiao Chen IS22 and Perceiver IO 媒体文件:221223CRM.pdf
2022/12/23 Dong Wang NIPS2022 媒体文件:NIPS2022.pdf
2022/12/30 Chen Chen Perceptual in Generative Audio Models 媒体文件:221230_cc.pdf
IS22_review 媒体文件:IS22_review_all.pdf
2023/02/10 Jiaying Wang Ordered binary speaker embedding 媒体文件:230210wjy.pdf
2023/02/17 Xipin Wei MSAT: A Multi-Scale Attentive Transformer for Multi-Instrument Symbolic Music Generation 媒体文件:MSAT_wxp.pdf
2023/03/10 Zhenyu Zhou consistence_loss&BCE_loss 媒体文件:consistence_loss&BCE_loss.pdf
2023/03/17 Tianhao Wang Score calibration in speaker verification 媒体文件:Score_calibration_in_speaker_verification.pdf
2023/03/31 Wan Lin Understand contrast and non-contrast in self-supervised learning 媒体文件:Understand contrast and non-contrast in self-supervised learning.pdf
2023/04/14 Pengqi Li Towards Attribution Methods in Deep Speaker Recognition 媒体文件:Towards_Attribution_Methods_in_Deep_Speaker_Recognition_230414_lpq.pdf
2023/04/21 Chen Chen Masked Prediction Task Based Self-supervised Multimodal Learning 媒体文件:Masked_prediction_task_based_self-supervised_multimodal_learning.pdf
2023/04/28 Xiaolou Li Incomplete Multimodal Method Exploration 媒体文件:Incomplete_Multimodal_Method_Exploration.pdf
2023/05/04 Renmiao Chen Applications of Diffusion Model 媒体文件:230505_Applications_of_Diffusion_Model_CRM.pdf
2023/05/19 Jiaying Wang DSH based method 媒体文件:230519_DSH_based_paper.pptx
2023/05/26 Zhenyu Zhou representation learning approach for domain adaptation 媒体文件:Representation_learning_approach_for_domain_adaptation.pptx
2023/06/02 Pengqi Li
2023/06/30 Tianhao Wang Robust Speaker Verification ICASSP2023 媒体文件:20230630_Robust_Speaker_Verification_ICASSP2023.pdf
2023/10/13 Xiaolou Li
2023/10/20 Zehua Liu
2023/10/27 Junhui Chen


Past Events