“2025-03-24”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
 
(9位用户的12个中间修订版本未显示)
第6行: 第6行:
 
|Dong Wang
 
|Dong Wang
 
||
 
||
*  
+
* Reformulate AI handbook college version.
  
 
||
 
||
第18行: 第18行:
 
|Lantian Li
 
|Lantian Li
 
||
 
||
*  
+
* lots of paperwork and staffwork
 
||
 
||
 
*
 
*
第42行: 第42行:
 
|Zhenghai You
 
|Zhenghai You
 
||
 
||
*  
+
* Complete some tests required by Huawei[https://z1et6d3xtb.feishu.cn/wiki/OXubwl2fIip91vkYsgMc1duhnLd]
 +
* Starting to implement TSE diffusion refiner, and plan to share some papers on speech separation and speech enhancement refiner on Friday
 
||
 
||
 
*
 
*
第65行: 第66行:
 
||
 
||
 
* VSR big data training (5500h) and debug
 
* VSR big data training (5500h) and debug
* LipVoicer reproduce on Mandarin (overfitting now, still finding the best hyperparameter)
+
* Reproducing LipVoicer on Mandarin (overfitting now, still finding the best hyperparameter)
 
* Writing the VTS test document
 
* Writing the VTS test document
 
||
 
||
第77行: 第78行:
 
|Zehua Liu
 
|Zehua Liu
 
||
 
||
*GA test document writing
+
*GA VTS test document writing
 
*VTS paper Reading and Sharing this Friday
 
*VTS paper Reading and Sharing this Friday
 
||
 
||
第138行: 第139行:
 
|Zhenyu Zhou
 
|Zhenyu Zhou
 
||
 
||
*
+
* white paper of Voiceprint Recognition
 +
* huawei project
 
||
 
||
 
*
 
*
第161行: 第163行:
 
|Jiaying Wang
 
|Jiaying Wang
 
||
 
||
*
+
* finish model structure
 +
* current problem:ctc loss Nan
 +
** data-related reason of NaN have been fixed, and further investigation is ongoing
 
||
 
||
 
*
 
*
第172行: 第176行:
 
|Yu Zhang
 
|Yu Zhang
 
||
 
||
*  
+
* paper reading and sharing
 +
* add long-term and short-term trading strategy
 
||
 
||
 
*
 
*
第196行: 第201行:
 
|Yang Wei
 
|Yang Wei
 
||
 
||
*  
+
* Text enroll kws model adaptation with keyword phone label from decoding and non-linear adaptation layer. (in progress)
 
||
 
||
 
*
 
*
第206行: 第211行:
 
|Turi
 
|Turi
 
||
 
||
*  
+
* Completed writing thesis (Will revise this week)
 
||
 
||
 
*  
 
*  
第225行: 第230行:
 
||
 
||
 
* Text-enroll KWS: different window shifts (e.g. one at 100ms and the other at 200ms) for different keywords.
 
* Text-enroll KWS: different window shifts (e.g. one at 100ms and the other at 200ms) for different keywords.
* VPR experiment: finding suitable thresholds for defined use cases.
+
* VPR experiment: finding suitable thresholds for defined use cases [https://b30lttjm7l.feishu.cn/docx/OlPCduevQo1phexmO4wcY4uynPh?from=from_copylink].
 
||
 
||
 
*
 
*

2025年3月24日 (一) 11:03的最后版本

People This Week Next Week Task Tracking (DeadLine)
Dong Wang
  • Reformulate AI handbook college version.
Lantian Li
  • lots of paperwork and staffwork
Ying Shi
  • revisit cohort conditional chain overlap asr and conduct experiments (model is in training)
  • thesis
Zhenghai You
  • Complete some tests required by Huawei[1]
  • Starting to implement TSE diffusion refiner, and plan to share some papers on speech separation and speech enhancement refiner on Friday
Junming Yuan
  • Rewrite MT-HuBERT paper (1/5)
  • AI practice handbook design of primary school and middle school(contents finished)
Xiaolou Li
  • VSR big data training (5500h) and debug
  • Reproducing LipVoicer on Mandarin (overfitting now, still finding the best hyperparameter)
  • Writing the VTS test document
Zehua Liu
  • GA VTS test document writing
  • VTS paper Reading and Sharing this Friday
Pengqi Li
  • Prepare the AI course for Tsinghua University middle School.
  • write paper for "Design course"
Wan Lin
  • run ablation experiments [2]
Tianhao Wang
  • dragon05 data transfer
  • 4-mix & 5-mix training
  • huawei project things
Xiaoxue Luo
  • Sound separation
    • baseline: there are some bugs in previous code that resulting in low test results, so I modified the code and retrained it
  • filter testing data for Huawei project
Zhenyu Zhou
  • white paper of Voiceprint Recognition
  • huawei project
Junhui Chen
  • improve code efficiency, continue to test diarization on vox-e
  • read paper
Jiaying Wang
  • finish model structure
  • current problem:ctc loss Nan
    • data-related reason of NaN have been fixed, and further investigation is ongoing
Yu Zhang
  • paper reading and sharing
  • add long-term and short-term trading strategy
Wenqiang Du
  • Summit AI primary handbook
  • AI primary handbook's PPT (44/44),Need to Check
  • Check AI middl handbook(108/278)
Yang Wei
  • Text enroll kws model adaptation with keyword phone label from decoding and non-linear adaptation layer. (in progress)
Turi
  • Completed writing thesis (Will revise this week)
Yue Gu
  • FIP-based personality-gated adaptation with synthetic data for personal ASR, almost finish preview evaluation exps.
  • read 2 TTS-augment papers and reivew an interspeech2025 paper
Qi Qu
  • Text-enroll KWS: different window shifts (e.g. one at 100ms and the other at 200ms) for different keywords.
  • VPR experiment: finding suitable thresholds for defined use cases [3].