2024-08-26

来自cslt Wiki
跳转至: 导航搜索
People This Week Next Week Task Tracking (DeadLine)
Dong Wang
  • Primary school book (17)
  • College AI education
Lantian Li
  • GPU status [1]
  • AI primary
    • High school handbook (40/40)
Ying Shi
  • Fenyinta stuff
  • reproduce cohort-SOT overlap ASR and some analysis
  • Text enroll keywords spotting intermediate PIT-SOT CTC + high layer cross-attention is in progress here
Zhenghai You
  • Speaker Augument: Completed experiments in Libri2mix, Low SISDR testset lower speaker confusion rate
  • ExFormer: Always inferior to the SOTA[2]
Junming Yuan
  • Confirmed that the performance gap of the 10% is determined by the impact of GPUs[3].
    • To fully reproduce the official model, it would take approximately 32 days.
  • Investigate how to train Hubert with Mix-speech (in progress)
Xiaolou Li
  • LLM long context test
  • Poster for IS24
  • Paper reading
Zehua Liu
  • CNVSRC 2024 Website
  • Data transfer to HUAWEI
  • LLM in Chinese VSR(In-context-learning)
Pengqi Li
  • Extend Proposal for 'HOW PHONEMES CONTRIBUTE TO DEEP SPEAKER MODELS?'[4]
    • Reviewing code, paper.
    • Analyzing di-phones in Audio-Mnist.
    • Start exp with TIMIT dataset.
  • 9.20(one month)
Wan Lin
  • Neural Scoring: vox2+voxblink1 [5]
Tianhao Wang
  • AudioSep reproducing
  • IS24 poster
Zhenyu Zhou
  • Some thinking about onnx quantization[6]
Junhui Chen
  • Neural Scoring:
    • Vox2+Voxblink-clean test[7]
Jiaying Wang
  • re-write conditional chain code(can be finished this week)
  • check wsj data
Yu Zhang
  • AED engineering problem assist
  • Prepare for report
Wenqiang Du
  • Complete the unified format and recheck of Primary school handbook
  • Write middle school handbook(29-41)
  • Training Chinese and Cantonese KWS model
Yang Wei
  • Check the badcase of KWS model test.
Lily
Turi
  • Added more sections to the draft paper
    • Need to refine and do more experiments
Yue Gu
  • write the introduction
  • test the adaptation model on the same accent data:[8]

(got sick today)

Qi Qu
  • KWS:
    • zh48 test dataset updated: 29 speakers in 3 locations, ~600 utterances per keyword.
    • Recall ~ FA relations plotted.