“2024-01-29”版本间的差异
来自cslt Wiki
Yuanjunming(讨论 | 贡献) |
|||
(13位用户的19个中间修订版本未显示) | |||
第30行: | 第30行: | ||
|Ying Shi | |Ying Shi | ||
|| | || | ||
− | * | + | * Keyword-Attributed OverLap ASR |
+ | ** Fix test dataset: LibirMix-Espnet & LibriMix-Official (2 mix clean) | ||
+ | ** Finish model training: KA-ASR-Full, KA-ASR-Oracle, SOT-Our | ||
+ | * Cohort Overlap ASR | ||
+ | ** Finish first step: Recognize one source from mixture by employ speaker embedding | ||
+ | * [https://z1et6d3xtb.feishu.cn/wiki/QoNWwCs9QibHt7k670hcxZBYncb?from=from_copylink group work] | ||
|| | || | ||
* | * | ||
第41行: | 第46行: | ||
|Zhenghai You | |Zhenghai You | ||
|| | || | ||
− | * | + | * cohort embedding replace speakerbeam speaker embedding |
|| | || | ||
* | * | ||
第65行: | 第70行: | ||
|Chen Chen | |Chen Chen | ||
|| | || | ||
− | * | + | * CNCVS data collect |
+ | ** Finished testing phase with support from sunyiwei,shuyanzhi,mengshuaiming | ||
+ | * Child Record Website | ||
+ | ** Finished phoneme annotation phase | ||
+ | ** Get some statistics | ||
+ | * DeepFake | ||
+ | ** Human Test on DFDC [https://z1et6d3xtb.feishu.cn/docx/N5u6dSmgNoYHT2xzP2IcymxFn6d?from=from_copylink] | ||
+ | ** Zehua & Xiaolou Report | ||
|| | || | ||
* | * | ||
第76行: | 第88行: | ||
|Xiaolou Li | |Xiaolou Li | ||
|| | || | ||
− | * | + | * test on LAV-DF dataset |
+ | * dataset survey | ||
+ | * weekly report | ||
|| | || | ||
* | * | ||
第87行: | 第101行: | ||
|Zehua Liu | |Zehua Liu | ||
|| | || | ||
− | * | + | * weekly report |
+ | * AV-Hubert test | ||
|| | || | ||
* | * | ||
第98行: | 第113行: | ||
|Pengqi Li | |Pengqi Li | ||
|| | || | ||
− | * | + | * Duration mismatch with XueYing[https://z1et6d3xtb.feishu.cn/docx/CDcxdX5BcomHlCx2So5cWxL8nVg] |
+ | ** Compare pre-TDNN 和 post-TDNN | ||
|| | || | ||
* | * | ||
第120行: | 第136行: | ||
|Tianhao Wang | |Tianhao Wang | ||
|| | || | ||
− | * | + | * IS24 paper writing (english version & latex) |
|| | || | ||
* | * | ||
第131行: | 第147行: | ||
|Zhenyu Zhou | |Zhenyu Zhou | ||
|| | || | ||
− | *Signal leval Speaker Augmentation[https://z1et6d3xtb.feishu.cn/docx/DViBdvm8KoQMMXxMXC0cWp2vnPf]: | + | *Signal leval Speaker Augmentation Plan[https://z1et6d3xtb.feishu.cn/docx/DViBdvm8KoQMMXxMXC0cWp2vnPf]: |
− | + | **Transformation(Random based & Knowledge based) | |
− | + | **Speaker Characteristics Guided Voice Conversion | |
|| | || | ||
* | * | ||
第155行: | 第171行: | ||
|Jiaying Wang | |Jiaying Wang | ||
|| | || | ||
− | * | + | * speaker encoder preparation(ResNet34_ASP_AAMSoftmax-LMFT) |
|| | || | ||
− | * | + | * gender divide test on speaker beam |
+ | * cohort with min SNR loss pb | ||
|| | || | ||
* | * | ||
第166行: | 第183行: | ||
|Yu Zhang | |Yu Zhang | ||
|| | || | ||
− | * | + | * Financial Pipeline |
+ | ** adapt portfolio policy to position changes | ||
|| | || | ||
* | * | ||
第177行: | 第195行: | ||
|Wenqiang Du | |Wenqiang Du | ||
|| | || | ||
− | * | + | * Diting Project |
+ | **data aug | ||
+ | **add gaussian noise to control FA | ||
+ | |||
|| | || | ||
* | * | ||
第188行: | 第209行: | ||
|Yang Wei | |Yang Wei | ||
|| | || | ||
− | * | + | * Huilan stuff |
+ | ** Develop stream mode ASR interface for ASR service | ||
+ | ** Deal with time delay problem with long text input for TTS service | ||
|| | || | ||
* | * |
2024年2月19日 (一) 10:50的最后版本
People | This Week | Next Week | Task Tracking (DeadLine) |
---|---|---|---|
Dong Wang |
|
|
|
Lantian Li |
|
|
|
Ying Shi |
|
|
|
Zhenghai You |
|
|
|
Junming Yuan |
·Solving the error of MFA on dragon03.(done) ·Extending the pretraining data.(done) ·Exploring the effect of BN in the few-shot finetuning(in progress). |
|
|
Chen Chen |
|
|
|
Xiaolou Li |
|
|
|
Zehua Liu |
|
|
|
Pengqi Li |
|
|
|
Wan Lin |
|
|
|
Tianhao Wang |
|
|
|
Zhenyu Zhou |
|
|
|
Junhui Chen |
|
|
|
Jiaying Wang |
|
|
|
Yu Zhang |
|
|
|
Wenqiang Du |
|
|
|
Yang Wei |
|
|
|
Lily |
|
|
|