2024-01-29

来自cslt Wiki
2024年1月29日 (一) 11:09Liuzehua讨论 | 贡献的版本

跳转至: 导航搜索
People This Week Next Week Task Tracking (DeadLine)
Dong Wang
  • MicroMagnetic paper, the first pass completed.
Lantian Li
  • GPU status [1]
  • ASIP-BUPT (Neural Scoring)
  • ASIP Annual report
Ying Shi
  • Keyword-Attributed OverLap ASR
    • Fix test dataset: LibirMix-Espnet & LibriMix-Official (2 mix clean)
    • Finish model training: KA-ASR-Full, KA-ASR-Oracle, SOT-Our
  • Cohort Overlap ASR
    • Finish first step: Recognize one source from mixture by employ speaker embedding
  • group work
Zhenghai You
  • cohort embedding replace speakerbeam speaker embedding
Junming Yuan
  • Check and organize the mix-training pretraining experiment project.
·Solving the error of MFA on dragon03.(done)
·Extending the pretraining data.(done)
·Exploring the effect of BN in the few-shot finetuning(in progress).
Chen Chen
  • CNCVS data collect
    • Finished testing phase with support from sunyiwei,shuyanzhi,mengshuaiming
  • Child Record Website
    • Finished phoneme annotation phase
    • Get some statistics
  • DeepFake
    • Human Test on DFDC [2]
    • Zehua & Xiaolou Report
Xiaolou Li
  • test on LAV-DF dataset
  • dataset survey
  • weekly report
Zehua Liu
  • weekly report
  • AV-Hubert test
Pengqi Li
  • Duration mismatch with XueYing[3]
    • Compare pre-TDNN 和 post-TDNN
Wan Lin
Tianhao Wang
  • IS24 paper writing (english version & latex)
Zhenyu Zhou
  • Signal leval Speaker Augmentation Plan[4]:
    • Transformation(Random based & Knowledge based)
    • Speaker Characteristics Guided Voice Conversion
Junhui Chen
Jiaying Wang
  • speaker encoder preparation(ResNet34_ASP_AAMSoftmax-LMFT)
  • gender divide test on speaker beam
  • cohort with min SNR loss pb
Yu Zhang
  • Financial Pipeline
    • adapt portfolio policy to position changes
Wenqiang Du
  • Diting Project
    • data aug
    • add gaussian noise to control FA
Yang Wei
  • Huilan stuff
    • Develop stream mode ASR interface for ASR service
    • Deal with time delay problem with long text input for TTS service
Lily
  • update statistical results[5]