“2025-03-24”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
 
(16位用户的22个中间修订版本未显示)
第6行: 第6行:
 
|Dong Wang
 
|Dong Wang
 
||
 
||
*  
+
* Reformulate AI handbook college version.
  
 
||
 
||
第18行: 第18行:
 
|Lantian Li
 
|Lantian Li
 
||
 
||
*  
+
* lots of paperwork and staffwork
 
||
 
||
 
*
 
*
第42行: 第42行:
 
|Zhenghai You
 
|Zhenghai You
 
||
 
||
*  
+
* Complete some tests required by Huawei[https://z1et6d3xtb.feishu.cn/wiki/OXubwl2fIip91vkYsgMc1duhnLd]
 +
* Starting to implement TSE diffusion refiner, and plan to share some papers on speech separation and speech enhancement refiner on Friday
 
||
 
||
 
*
 
*
第52行: 第53行:
 
|Junming Yuan
 
|Junming Yuan
 
||
 
||
*  
+
* Rewrite MT-HuBERT paper (1/5)
 +
* AI practice handbook design of primary school and middle school(contents finished)
 
||
 
||
 
*
 
*
第63行: 第65行:
 
|Xiaolou Li
 
|Xiaolou Li
 
||
 
||
*  
+
* VSR big data training (5500h) and debug
 +
* Reproducing LipVoicer on Mandarin (overfitting now, still finding the best hyperparameter)
 +
* Writing the VTS test document
 
||
 
||
 
*  
 
*  
第74行: 第78行:
 
|Zehua Liu
 
|Zehua Liu
 
||
 
||
*
+
*GA VTS test document writing
 +
*VTS paper Reading and Sharing this Friday
 
||
 
||
 
*
 
*
第85行: 第90行:
 
|Pengqi Li
 
|Pengqi Li
 
||
 
||
*  
+
* Prepare the AI course for Tsinghua University middle School.
 +
* write paper for "Design course"
 
||
 
||
 
*
 
*
第96行: 第102行:
 
|Wan Lin
 
|Wan Lin
 
||
 
||
*  
+
* run ablation experiments [https://z1et6d3xtb.feishu.cn/docx/MxBNdPbLao0tsoxkBVCcUgUoneh?from=from_copylink]
 
||
 
||
 
*
 
*
第120行: 第126行:
 
|Xiaoxue Luo
 
|Xiaoxue Luo
 
||
 
||
*  
+
* Sound separation
 +
** baseline: there are some bugs in previous code that resulting in low test results, so I modified the code and retrained it
 +
* filter testing data for Huawei project
 
||
 
||
 
*
 
*
第131行: 第139行:
 
|Zhenyu Zhou
 
|Zhenyu Zhou
 
||
 
||
*
+
* white paper of Voiceprint Recognition
 +
* huawei project
 
||
 
||
 
*
 
*
第142行: 第151行:
 
|Junhui Chen
 
|Junhui Chen
 
||
 
||
* fix code, continue test diarization on vox-e
+
* improve code efficiency, continue to test diarization on vox-e
 
* read paper
 
* read paper
 
||
 
||
第154行: 第163行:
 
|Jiaying Wang
 
|Jiaying Wang
 
||
 
||
*
+
* finish model structure
 +
* current problem:ctc loss Nan
 +
** data-related reason of NaN have been fixed, and further investigation is ongoing
 
||
 
||
 
*
 
*
第165行: 第176行:
 
|Yu Zhang
 
|Yu Zhang
 
||
 
||
*  
+
* paper reading and sharing
 +
* add long-term and short-term trading strategy
 
||
 
||
 
*
 
*
第176行: 第188行:
 
|Wenqiang Du
 
|Wenqiang Du
 
||
 
||
*  
+
* Summit AI primary handbook
 +
* AI primary handbook's PPT (44/44),Need to Check
 +
* Check AI middl handbook(108/278)
 
||
 
||
 
*
 
*
第187行: 第201行:
 
|Yang Wei
 
|Yang Wei
 
||
 
||
*  
+
* Text enroll kws model adaptation with keyword phone label from decoding and non-linear adaptation layer. (in progress)
 
||
 
||
 
*
 
*
第197行: 第211行:
 
|Turi
 
|Turi
 
||
 
||
*  
+
* Completed writing thesis (Will revise this week)
 
||
 
||
 
*  
 
*  
第215行: 第229行:
 
|Qi Qu
 
|Qi Qu
 
||
 
||
*  
+
* Text-enroll KWS: different window shifts (e.g. one at 100ms and the other at 200ms) for different keywords.
 +
* VPR experiment: finding suitable thresholds for defined use cases [https://b30lttjm7l.feishu.cn/docx/OlPCduevQo1phexmO4wcY4uynPh?from=from_copylink].
 
||
 
||
 
*
 
*

2025年3月24日 (一) 11:03的最后版本

People This Week Next Week Task Tracking (DeadLine)
Dong Wang
  • Reformulate AI handbook college version.
Lantian Li
  • lots of paperwork and staffwork
Ying Shi
  • revisit cohort conditional chain overlap asr and conduct experiments (model is in training)
  • thesis
Zhenghai You
  • Complete some tests required by Huawei[1]
  • Starting to implement TSE diffusion refiner, and plan to share some papers on speech separation and speech enhancement refiner on Friday
Junming Yuan
  • Rewrite MT-HuBERT paper (1/5)
  • AI practice handbook design of primary school and middle school(contents finished)
Xiaolou Li
  • VSR big data training (5500h) and debug
  • Reproducing LipVoicer on Mandarin (overfitting now, still finding the best hyperparameter)
  • Writing the VTS test document
Zehua Liu
  • GA VTS test document writing
  • VTS paper Reading and Sharing this Friday
Pengqi Li
  • Prepare the AI course for Tsinghua University middle School.
  • write paper for "Design course"
Wan Lin
  • run ablation experiments [2]
Tianhao Wang
  • dragon05 data transfer
  • 4-mix & 5-mix training
  • huawei project things
Xiaoxue Luo
  • Sound separation
    • baseline: there are some bugs in previous code that resulting in low test results, so I modified the code and retrained it
  • filter testing data for Huawei project
Zhenyu Zhou
  • white paper of Voiceprint Recognition
  • huawei project
Junhui Chen
  • improve code efficiency, continue to test diarization on vox-e
  • read paper
Jiaying Wang
  • finish model structure
  • current problem:ctc loss Nan
    • data-related reason of NaN have been fixed, and further investigation is ongoing
Yu Zhang
  • paper reading and sharing
  • add long-term and short-term trading strategy
Wenqiang Du
  • Summit AI primary handbook
  • AI primary handbook's PPT (44/44),Need to Check
  • Check AI middl handbook(108/278)
Yang Wei
  • Text enroll kws model adaptation with keyword phone label from decoding and non-linear adaptation layer. (in progress)
Turi
  • Completed writing thesis (Will revise this week)
Yue Gu
  • FIP-based personality-gated adaptation with synthetic data for personal ASR, almost finish preview evaluation exps.
  • read 2 TTS-augment papers and reivew an interspeech2025 paper
Qi Qu
  • Text-enroll KWS: different window shifts (e.g. one at 100ms and the other at 200ms) for different keywords.
  • VPR experiment: finding suitable thresholds for defined use cases [3].