2025-01-13

来自cslt Wiki
跳转至: 导航搜索
People This Week Next Week Task Tracking (DeadLine)
Dong Wang
  • AI handbook high-school version, recheck
  • Publication stuff
Lantian Li
  • Go on AI-Graph EN (49/50)
Ying Shi
  • Try to reproduce the CTC conditional-chain ASR baseline (failed)
  • check code with PIT loss (code looks fine)
  • Try to train a model that only recognizes the most dominant components (failed)
Zhenghai You
  • The first Chinese version of the paper [1]
  • Online Demo UI Design for Huawei Project[2]
Junming Yuan
  • Check the high school AI handbook(done)
  • Organize our photos
  • The results of MT-Hubert on LS960[3]
    • 2-mixed Test: 400K steps top-2 ACC: 74.63% ---> 1600K steps top-2 ACC: 79.43%
Xiaolou Li
  • Process server code update
  • Data audit (until 15th Jan)
Zehua Liu
  • Paper Reading
  • Writing Code for loading larger LLM(32B and 72B)
  • Interspeech paper writing
  • Collected Data Checking (With Xiaolou)
Pengqi Li
  • IS25 Proposal almost done
    • writing the experimental part
    • checks and analysis the result
  • Go to the hospital for a follow-up and return lab.
Wan Lin
  • NS: Adopt multi-enroll, margin-bce and long-duration test to resnet+transformer model: EER 1.23%->1.13%
Tianhao Wang
  • writing the chain-based sound sep code
  • test some SED pretraining under mix scenario
Xiaoxue Luo
  • got a bad flu
  • read papers
Zhenyu Zhou
Junhui Chen
Jiaying Wang
  • IS2025 proposal[4]
Yu Zhang
  • Finish Multi Agent Investment pipeline debug, experiment still running (can get a draft result this week)
Wenqiang Du
  • Check Primary handbook
    • Related PPT and Jiaoan
  • Some project cooperation
Yang Wei
  • Train Text enroll kws model (pretrain w ctc loss + finetune w/o ctc loss). Not success yet.
  • Develop ASR REST service based on FunASR model.
Turi
  • Did experiment on Conformer CTC with nbpe=500 (26 to 18 WER)
  • Refined ICASSP paper for final submission and submitted it
  • Prepare for interview
Yue Gu
  • design the fine-grained personality extractor to produce the phone-level voice charactor similarity (code is in progress)
  • check the primary school handbook
Qi Qu
  • AED:
    • CED classifiers implemented on mr536 NPU.
  • KWS:
    • Training data collected and processed for Qingdao dialect, 20 keywords.
    • Analysis of some prod FAs.
  • Android demo and supporting backend services: KWS + ASR/MT -> instruction submission.