“2025-03-31”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
 
(18位用户的26个中间修订版本未显示)
第6行: 第6行:
 
|Dong Wang
 
|Dong Wang
 
||
 
||
*  
+
* AI handbook college version
  
 
||
 
||
第18行: 第18行:
 
|Lantian Li
 
|Lantian Li
 
||
 
||
*  
+
* AI Handbook (College Edition) (4/10)
 +
* Project Matters (BUPT and Huawei)
 +
* Review Some Papers
 
||
 
||
 
*
 
*
第29行: 第31行:
 
|Ying Shi
 
|Ying Shi
 
||
 
||
*  
+
* Huawei project  cooperation
 
||
 
||
 
*  
 
*  
第41行: 第43行:
 
|Zhenghai You
 
|Zhenghai You
 
||
 
||
*  
+
* Training second iteration TSE model for Huawei project ( eol 0db SI-SDR (once:6.963)-> (twice:8.428))[https://z1et6d3xtb.feishu.cn/wiki/OXubwl2fIip91vkYsgMc1duhnLd]
 +
* Check the breakdown of the host to China Unicom
 
||
 
||
 
*
 
*
第52行: 第55行:
 
||
 
||
 
* MT-HuBERT paper-CN-v0.1(Done)
 
* MT-HuBERT paper-CN-v0.1(Done)
* some AI handbook works
+
* some AI handbook works(checking & revisions) with Brother Qiang
 
* some AI practice handbook works
 
* some AI practice handbook works
 
* Review primary school PPT  
 
* Review primary school PPT  
第65行: 第68行:
 
|Xiaolou Li
 
|Xiaolou Li
 
||
 
||
*  
+
* The reproduction of LipVoicer is nearly complete — the melgen, vocoder, and VSR components are ready. The remaining step is to fine-tune the ASR module.
||
+
* Fixed some bugs of VSR big data training.
*  
+
 
||
 
||
 
*
 
*
第76行: 第78行:
 
|Zehua Liu
 
|Zehua Liu
 
||
 
||
*
+
*Paper Reading on last Friday
 +
*Reproducing Accurately Lip2Speech paper,maybe get some Result this week.
 
||
 
||
 
*
 
*
第87行: 第90行:
 
|Pengqi Li
 
|Pengqi Li
 
||
 
||
*  
+
* Urgent tasks with Qiang Brother
 +
** Handbook checking and revisions, PPT development
 +
* Modified the paper for "Design course"
 +
* Courses and report(Tuesday, Saturday, Today)
 
||
 
||
 
*
 
*
第98行: 第104行:
 
|Wan Lin
 
|Wan Lin
 
||
 
||
*  
+
* Ablation experiments [https://z1et6d3xtb.feishu.cn/docx/MxBNdPbLao0tsoxkBVCcUgUoneh?from=from_copylink]
 +
* Experiments for diff. Transformer layers
 
||
 
||
 
*
 
*
第109行: 第116行:
 
|Tianhao Wang
 
|Tianhao Wang
 
||
 
||
*  
+
* Huawei project things
 +
** test report
 +
** vggsound 44 classes model training
 +
* Huawei interview
 
||
 
||
 
*
 
*
第120行: 第130行:
 
|Xiaoxue Luo
 
|Xiaoxue Luo
 
||
 
||
*  
+
* Sound separation
 +
** familiar with the code of our model
 +
* check and revise the AI handbooks
 
||
 
||
 
*
 
*
第142行: 第154行:
 
|Junhui Chen
 
|Junhui Chen
 
||
 
||
*
+
* finished diarization test in vox-e (result is normal)
 +
* read paper
 
||
 
||
 
*
 
*
第153行: 第166行:
 
|Jiaying Wang
 
|Jiaying Wang
 
||
 
||
*
+
* use Whisper replace the CTC calculation module: reading paper and preparing code.
 +
* modify the bp strategy of the current CTC loss.
 
||
 
||
 
*
 
*
第164行: 第178行:
 
|Yu Zhang
 
|Yu Zhang
 
||
 
||
*  
+
* Multi Agent System
 +
** debug about new long-term short-term trade policy
 +
* Huawei AED
 +
** new dataset training (77% from Audio Set, 33% from Huawei) and some bad case study
 
||
 
||
 
*
 
*
第175行: 第192行:
 
|Wenqiang Du
 
|Wenqiang Du
 
||
 
||
*  
+
* Summit middl handbook
 
||
 
||
 
*
 
*
第186行: 第203行:
 
|Yang Wei
 
|Yang Wei
 
||
 
||
*  
+
* Nonlinear layer adaptation for text enroll kws model on English data, a little better.
 
||
 
||
 
*
 
*
第196行: 第213行:
 
|Turi
 
|Turi
 
||
 
||
*  
+
* Thesis review
 
||
 
||
 
*  
 
*  
第204行: 第221行:
 
|Yue Gu
 
|Yue Gu
 
||
 
||
*  
+
* almost finish 5 speakers'exps,plan to write the paper
 +
* plan to give the presentation after accomplishing the introduction
 +
* proofread the AI handbooks
 
||
 
||
 
*
 
*
第213行: 第232行:
 
|Qi Qu
 
|Qi Qu
 
||
 
||
*  
+
* NPU (MR536) models benchmark regarding speed [https://b30lttjm7l.feishu.cn/docx/IygUdJ16kofcg2xEPnEc7lNsnqb?from=from_copylink].
 +
* KWS models for Chaoshan dialect released.
 
||
 
||
 
*
 
*
 
||
 
||
*
+
*
 
|-
 
|-

2025年3月31日 (一) 10:59的最后版本

People This Week Next Week Task Tracking (DeadLine)
Dong Wang
  • AI handbook college version
Lantian Li
  • AI Handbook (College Edition) (4/10)
  • Project Matters (BUPT and Huawei)
  • Review Some Papers
Ying Shi
  • Huawei project cooperation
Zhenghai You
  • Training second iteration TSE model for Huawei project ( eol 0db SI-SDR (once:6.963)-> (twice:8.428))[1]
  • Check the breakdown of the host to China Unicom
Junming Yuan
  • MT-HuBERT paper-CN-v0.1(Done)
  • some AI handbook works(checking & revisions) with Brother Qiang
  • some AI practice handbook works
  • Review primary school PPT
Xiaolou Li
  • The reproduction of LipVoicer is nearly complete — the melgen, vocoder, and VSR components are ready. The remaining step is to fine-tune the ASR module.
  • Fixed some bugs of VSR big data training.
Zehua Liu
  • Paper Reading on last Friday
  • Reproducing Accurately Lip2Speech paper,maybe get some Result this week.
Pengqi Li
  • Urgent tasks with Qiang Brother
    • Handbook checking and revisions, PPT development
  • Modified the paper for "Design course"
  • Courses and report(Tuesday, Saturday, Today)
Wan Lin
  • Ablation experiments [2]
  • Experiments for diff. Transformer layers
Tianhao Wang
  • Huawei project things
    • test report
    • vggsound 44 classes model training
  • Huawei interview
Xiaoxue Luo
  • Sound separation
    • familiar with the code of our model
  • check and revise the AI handbooks
Zhenyu Zhou
Junhui Chen
  • finished diarization test in vox-e (result is normal)
  • read paper
Jiaying Wang
  • use Whisper replace the CTC calculation module: reading paper and preparing code.
  • modify the bp strategy of the current CTC loss.
Yu Zhang
  • Multi Agent System
    • debug about new long-term short-term trade policy
  • Huawei AED
    • new dataset training (77% from Audio Set, 33% from Huawei) and some bad case study
Wenqiang Du
  • Summit middl handbook
Yang Wei
  • Nonlinear layer adaptation for text enroll kws model on English data, a little better.
Turi
  • Thesis review
Yue Gu
  • almost finish 5 speakers'exps,plan to write the paper
  • plan to give the presentation after accomplishing the introduction
  • proofread the AI handbooks
Qi Qu
  • NPU (MR536) models benchmark regarding speed [3].
  • KWS models for Chaoshan dialect released.