“2025-04-14”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
 
(14位用户的20个中间修订版本未显示)
第6行: 第6行:
 
|Dong Wang
 
|Dong Wang
 
||
 
||
*  
+
* AIGE opening ceremoney
  
 
||
 
||
第18行: 第18行:
 
|Lantian Li
 
|Lantian Li
 
||
 
||
*  
+
* Complete IS rebuttal
 +
* Translate ML book
 +
* Release AIGE web v.10
 
||
 
||
 
*
 
*
第29行: 第31行:
 
|Ying Shi
 
|Ying Shi
 
||
 
||
*  
+
* 4.11 meeting stuff
 +
* Structure of thesis
 
||
 
||
 
*  
 
*  
第41行: 第44行:
 
|Zhenghai You
 
|Zhenghai You
 
||
 
||
*  
+
* Some model tuning in Huawei projects (training)
 +
* Revise rebuttal
 
||
 
||
 
*
 
*
第63行: 第67行:
 
|Xiaolou Li
 
|Xiaolou Li
 
||
 
||
*  
+
* Write document for GA project
 +
* LipVoicer reproduction[https://z1et6d3xtb.feishu.cn/docx/AQWMdHYA2ov4pTxmoAncLW7bn1c?from=from_copylink]
 
||
 
||
 
*  
 
*  
第74行: 第79行:
 
|Zehua Liu
 
|Zehua Liu
 
||
 
||
*
+
*Reproduce ACClip2Speech on CNVSRC.Single[https://z1et6d3xtb.feishu.cn/docx/Dg0VdOa2MofEsMxbr2Xcn8MXnUh?from=from_copylink]
 +
*Finish Vocoder code (still on training)
 +
*GongAn document writing
 
||
 
||
 
*
 
*
第85行: 第92行:
 
|Pengqi Li
 
|Pengqi Li
 
||
 
||
*  
+
* developed primary school PPTs (2)
 +
* Co-developed AI practice handbooks with the practice schools (3)
 
||
 
||
 
*
 
*
第96行: 第104行:
 
|Wan Lin
 
|Wan Lin
 
||
 
||
*  
+
* Some Experiments [https://z1et6d3xtb.feishu.cn/docx/MxBNdPbLao0tsoxkBVCcUgUoneh?from=from_copylink]
 
||
 
||
 
*
 
*
第107行: 第115行:
 
|Tianhao Wang
 
|Tianhao Wang
 
||
 
||
*  
+
* huawei project & huohua testing
 +
* personal stuff
 
||
 
||
 
*
 
*
第118行: 第127行:
 
|Xiaoxue Luo
 
|Xiaoxue Luo
 
||
 
||
*  
+
* Sound separation
 +
** baseline: adjust the code of one USS model and evaluate its performance using our test datasets
 
||
 
||
 
*
 
*
第140行: 第150行:
 
|Junhui Chen
 
|Junhui Chen
 
||
 
||
*
+
* Continue all NS exp with 1 layer
 +
* Read paper
 
||
 
||
 
*
 
*
第151行: 第162行:
 
|Jiaying Wang
 
|Jiaying Wang
 
||
 
||
*
+
* explore Whisper: ASR function cannot transcribe mix speech well, the SOT of whisper is on the task level (translate, transcribe, ...)
 +
* modified the model
 +
** using the CTC loss with the minimum loss per iteration backpropagation strategy: loss still turns out to be NaN (problem not in loss caculation)
 +
** replace asr modules to find reason
 +
* other ASR tools
 +
** trying mix speech ASR of FunASR, and attempting to integrate it into the current model
 +
 
 
||
 
||
 
*
 
*
第162行: 第179行:
 
|Yu Zhang
 
|Yu Zhang
 
||
 
||
*  
+
* Add additional inference step for technical factor analysis (worth for a longer backtesting)
 +
* Done single stock trade pipeline (debug is done, can run backtesting tomorrow)
 +
* Weight avg trading (still debugging)
 
||
 
||
 
*
 
*
第173行: 第192行:
 
|Wenqiang Du
 
|Wenqiang Du
 
||
 
||
*  
+
* Prepared for the 4.11 meeting
 
||
 
||
 
*
 
*
第184行: 第203行:
 
|Yang Wei
 
|Yang Wei
 
||
 
||
*  
+
* Some modification based on the kws patent document from Wenqiang.
 
||
 
||
 
*
 
*
第212行: 第231行:
 
|Qi Qu
 
|Qi Qu
 
||
 
||
*  
+
* Finished organizing codebase for evaluating KWS models (both "audio-enroll" and text-enroll).
 +
* Text-enroll KWS models evaluation on Mandarin Chinese w/ southern China accent data [https://b30lttjm7l.feishu.cn/docx/NehJdldhOohQX7xjpascFd9vnac?from=from_copylink].
 
||
 
||
 
*
 
*

2025年4月14日 (一) 11:05的最后版本

People This Week Next Week Task Tracking (DeadLine)
Dong Wang
  • AIGE opening ceremoney
Lantian Li
  • Complete IS rebuttal
  • Translate ML book
  • Release AIGE web v.10
Ying Shi
  • 4.11 meeting stuff
  • Structure of thesis
Zhenghai You
  • Some model tuning in Huawei projects (training)
  • Revise rebuttal
Junming Yuan
  • make primary school PPT
  • some AI practice handbook works
Xiaolou Li
  • Write document for GA project
  • LipVoicer reproduction[1]
Zehua Liu
  • Reproduce ACClip2Speech on CNVSRC.Single[2]
  • Finish Vocoder code (still on training)
  • GongAn document writing
Pengqi Li
  • developed primary school PPTs (2)
  • Co-developed AI practice handbooks with the practice schools (3)
Wan Lin
  • Some Experiments [3]
Tianhao Wang
  • huawei project & huohua testing
  • personal stuff
Xiaoxue Luo
  • Sound separation
    • baseline: adjust the code of one USS model and evaluate its performance using our test datasets
Zhenyu Zhou
Junhui Chen
  • Continue all NS exp with 1 layer
  • Read paper
Jiaying Wang
  • explore Whisper: ASR function cannot transcribe mix speech well, the SOT of whisper is on the task level (translate, transcribe, ...)
  • modified the model
    • using the CTC loss with the minimum loss per iteration backpropagation strategy: loss still turns out to be NaN (problem not in loss caculation)
    • replace asr modules to find reason
  • other ASR tools
    • trying mix speech ASR of FunASR, and attempting to integrate it into the current model
Yu Zhang
  • Add additional inference step for technical factor analysis (worth for a longer backtesting)
  • Done single stock trade pipeline (debug is done, can run backtesting tomorrow)
  • Weight avg trading (still debugging)
Wenqiang Du
  • Prepared for the 4.11 meeting
Yang Wei
  • Some modification based on the kws patent document from Wenqiang.
Turi
Yue Gu
  • read the FIP paper carefully, write two introduction paragraphs
  • reproduce some hotword exps in new cloud service
Qi Qu
  • Finished organizing codebase for evaluating KWS models (both "audio-enroll" and text-enroll).
  • Text-enroll KWS models evaluation on Mandarin Chinese w/ southern China accent data [4].