2025-03-03

来自cslt Wiki
跳转至: 导航搜索
People This Week Next Week Task Tracking (DeadLine)
Dong Wang
  • Three slides for AIGE to gov and enterprise.
Lantian Li
  • Proofread of the high-school book (Done)
Ying Shi
  • Prepare Ascend Sever environment
  • training Conditional Chain overlap ASR model with Hierachical-Transformer here
Zhenghai You
  • Training TSE model for with content enrollment(for Huawei & CSSC(中船) projects)
  • Reading papers about refiner
Junming Yuan
  • Finish MPC-HuBERT pretrain.
  • Double check the related experimental code.
    • MT-HuBERT(in progress) & Cocktail-HuBERT need re-pretrain.
    • The results of other baseline in here
Xiaolou Li
  • VSR training (1500h) cnvsrc-single valid 300 CER: 36.14% (not converged)
  • Finish pre-processing 4000h data
  • get ASR transcript for 4000h data
  • Writing NSFC document
Zehua Liu
  • Paper Reading and Sharing in last Friday
  • Writing Vision Language Model code
  • Writing NSFC document
Pengqi Li
  • Prepare the AI course for Tsinghua University Junior High School.
  • Using t-SNE to visualize the factorized content vector.
    • Next step is to color(speaker information importance or not) each point.
Wan Lin
  • try some adjustment for clean performance(no improvement)
  • supply experiments for other tests
Tianhao Wang
  • sound separation: 2-mix and 3-mix model training
  • weekly report
  • subset data training
Xiaoxue Luo
  • generation of multi-mix audio data and did some test experiments.
  • read papers
Zhenyu Zhou
  • finish graduation thesis
Junhui Chen
  • Reproducing speaker diarization method for NS (debugging...)
  • read paper
Jiaying Wang
  • debug ctc loss part[1]
Yu Zhang
  • AED:
    • Split AED model into two smaller model to detect the human voice in noisy environments and in clean environments separately.
    • Trying smaller model (under 200K)
  • Multi Agent Investment
    • try index enhancement trading, no obvious excess return
  • try do portfolio investment on some selected big company
  • add the debate topic about the logical consistency inside investment decisions.
Wenqiang Du
  • Primary handbook's PPT (24/44)
  • Continue to check Primary and middle handbook(Completed this week)
  • Speech cloning sample for the company
Yang Wei
  • Tuning text enroll kws model for dialect data with linear layer. (recall: 65%->85%->94%)
Turi
  • Thesis writing
  • Result with LM[2]
Yue Gu
  • finish some exps, but nothing is improved.
  • finish a proposal,I will present it recently
Qi Qu
  • Applying pre-prod eval routine on text-enroll KWS models: the ideal thresholds for each keyword vary significantly. [3]