“2024-01-29”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
第35行: 第35行:
 
* Cohort Overlap ASR
 
* Cohort Overlap ASR
 
** Finish first step:  Recognize one source from mixture by employ speaker embedding
 
** Finish first step:  Recognize one source from mixture by employ speaker embedding
 +
* [https://z1et6d3xtb.feishu.cn/wiki/QoNWwCs9QibHt7k670hcxZBYncb?from=from_copylink group work]
 
||
 
||
 
*  
 
*  

2024年1月29日 (一) 10:46的版本

People This Week Next Week Task Tracking (DeadLine)
Dong Wang
  • MicroMagnetic paper, the first pass completed.
Lantian Li
  • GPU status [1]
  • ASIP-BUPT (Neural Scoring)
  • ASIP Annual report
Ying Shi
  • Keyword-Attributed OverLap ASR
    • Fix test dataset: LibirMix-Espnet & LibriMix-Official (2 mix clean)
    • Finish model training: KA-ASR-Full, KA-ASR-Oracle, SOT-Our
  • Cohort Overlap ASR
    • Finish first step: Recognize one source from mixture by employ speaker embedding
  • group work
Zhenghai You
Junming Yuan
  • Check and organize the mix-training pretraining experiment project.
·Solving the error of MFA on dragon03.(done)
·Extending the pretraining data.(done)
·Exploring the effect of BN in the few-shot finetuning(in progress).
Chen Chen
  • CNCVS data collect
    • Finished testing phase with support from sunyiwei,shuyanzhi,mengshuaiming
  • Child Record Website
    • Finished phoneme annotation phase
    • Get some statistics
  • DeepFake
    • Human Test on DFDC
    • Zehua & Xiaolou Report
Xiaolou Li
Zehua Liu
Pengqi Li
Wan Lin
Tianhao Wang
  • IS24 paper writing (english version & latex)
Zhenyu Zhou
  • Signal leval Speaker Augmentation Plan[2]:
    • Transformation(Random based & Knowledge based)
    • Speaker Characteristics Guided Voice Conversion
Junhui Chen
Jiaying Wang
  • speaker encoder preparation(ResNet34_ASP_AAMSoftmax-LMFT)
  • gender divide test on speaker beam
  • cohort with min SNR loss pb
Yu Zhang
Wenqiang Du
  • Diting Project
    • data aug
    • add gaussian noise to control FA
Yang Wei
  • Huilan stuff
    • Develop stream mode ASR interface for ASR service
    • Deal with time delay problem with long text input for TTS service
Lily
  • update statistical results[3]