2026-01-19

来自cslt Wiki
2026年1月19日 (一) 10:57Chenjh讨论 | 贡献的版本

跳转至: 导航搜索
People This Week Next Week Task Tracking (DeadLine)
Dong Wang
  • Recheck AI textbooks for primary and middle schools.
Lantian Li
Wenqiang Du
  • Statistics the accounts for companies, our laboratory and AIGE.
  • Check AI textbooks
Yang Wei
Ying Shi
  • Thesis
  • Huawei multi-talker ASR project(Mandarin 2-mix test with FunASR-Nano-800M)
    • ground-truth WER 3.89%
    • separated WER 12.11%
Yue Gu
  • write my Phd thesis
  • the seminar
Lily
  • Participated in reviewing the AI handbooks (primary/middle/high school)
  • Compiled course materials for the AI Handbook (Senior Edition)
Pengqi Li
  • Writing the Experiments section.
    • Focus on hard-to-explain results by formulating hypotheses and conducting deeper analysis.
Junming Yuan
  • Aug-MT-HuBERT(failed)
    • Continued pre-training for 600K steps. there is still no improvement observed on clean-speech tasks.
      • Aug-MT-HuBERT has more substitution errors in clean ASR adaptation.
  • Incorporate a "learn-not-to-listen" mechanism into MT-HuBERT and retrain the backbone. (in progress)
    • At 425K steps, PR(PER%): 8.18%, ASR(WER%): 9.32%, SD(DER%): 5.05%
  • middle school AI textbook checking(picture, table)
Yu Zhang
  • GPU Util: [1]
  • LLM
    • Rewrite the graph connectivity of Swarm MMLU (the original topology does not align with our expectations).
    • Add context segmentation logic to the ECS computation, so that we can trace which preceding node a given text segment originates from.
Junhui Chen
  • LLM
    • Get swarm crosswords experiment with metric detection working
      • because crosswords has more different role characters than MMLU, so it is necessary
    • Implement multi-process of LLM instances in experiments
    • Check and find some bugs in code of @ZhangYu
      • p.s.: try multi-threading, fails because a single Python interpreter only initializes one CUDA context, causing resource contention between different instances.
Jiaying Wang
  • loudness verification 2/3
  • spk experiment 3-mix testing, 2-mix & 4-mix training
Xiaoxue Luo
  • 2-5mix multi_head separation model for Huawei project [2]
  • prepare 2mix speech separation results for ASR test
Bochao Hu
  • read some papers
  • refactor P2S code, using ms-swift and adding n-best seq to input.
Hongcheng Zhang
  • Complete the training of short audio descriptions for AudioSet-SL in WavCaps
    • METEOR: 0.3373, ROUGE-L: 0.3121, BLEU-4 : 0.0769, CIDEr : 0.6207
Weiman Sun
  • Label audioset vedio dataset
  • Analyze the representation of MT-LLM inference and review relevant literature