|
|
(4位用户的4个中间修订版本未显示) |
第46行: |
第46行: |
| |Zhenghai You | | |Zhenghai You |
| || | | || |
− | * | + | * cohort embedding replace speakerbeam speaker embedding |
| || | | || |
| * | | * |
第88行: |
第88行: |
| |Xiaolou Li | | |Xiaolou Li |
| || | | || |
− | * | + | * test on LAV-DF dataset |
| + | * dataset survey |
| + | * weekly report |
| || | | || |
| * | | * |
第99行: |
第101行: |
| |Zehua Liu | | |Zehua Liu |
| || | | || |
− | * | + | * weekly report |
| + | * AV-Hubert test |
| || | | || |
| * | | * |
People |
This Week |
Next Week |
Task Tracking (DeadLine)
|
Dong Wang
|
- MicroMagnetic paper, the first pass completed.
|
|
|
Lantian Li
|
- GPU status [1]
- ASIP-BUPT (Neural Scoring)
- ASIP Annual report
|
|
|
Ying Shi
|
- Keyword-Attributed OverLap ASR
- Fix test dataset: LibirMix-Espnet & LibriMix-Official (2 mix clean)
- Finish model training: KA-ASR-Full, KA-ASR-Oracle, SOT-Our
- Cohort Overlap ASR
- Finish first step: Recognize one source from mixture by employ speaker embedding
- group work
|
|
|
Zhenghai You
|
- cohort embedding replace speakerbeam speaker embedding
|
|
|
Junming Yuan
|
- Check and organize the mix-training pretraining experiment project.
·Solving the error of MFA on dragon03.(done)
·Extending the pretraining data.(done)
·Exploring the effect of BN in the few-shot finetuning(in progress).
|
|
|
Chen Chen
|
- CNCVS data collect
- Finished testing phase with support from sunyiwei,shuyanzhi,mengshuaiming
- Child Record Website
- Finished phoneme annotation phase
- Get some statistics
- DeepFake
- Human Test on DFDC [2]
- Zehua & Xiaolou Report
|
|
|
Xiaolou Li
|
- test on LAV-DF dataset
- dataset survey
- weekly report
|
|
|
Zehua Liu
|
- weekly report
- AV-Hubert test
|
|
|
Pengqi Li
|
- Duration mismatch with XueYing[3]
- Compare pre-TDNN 和 post-TDNN
|
|
|
Wan Lin
|
|
|
|
Tianhao Wang
|
- IS24 paper writing (english version & latex)
|
|
|
Zhenyu Zhou
|
- Signal leval Speaker Augmentation Plan[4]:
- Transformation(Random based & Knowledge based)
- Speaker Characteristics Guided Voice Conversion
|
|
|
Junhui Chen
|
|
|
|
Jiaying Wang
|
- speaker encoder preparation(ResNet34_ASP_AAMSoftmax-LMFT)
|
- gender divide test on speaker beam
- cohort with min SNR loss pb
|
|
Yu Zhang
|
- Financial Pipeline
- adapt portfolio policy to position changes
|
|
|
Wenqiang Du
|
- Diting Project
- data aug
- add gaussian noise to control FA
|
|
|
Yang Wei
|
- Huilan stuff
- Develop stream mode ASR interface for ASR service
- Deal with time delay problem with long text input for TTS service
|
|
|
Lily
|
- update statistical results[5]
|
|
|