“2024-02-05”版本间的差异
来自cslt Wiki
第60行: | 第60行: | ||
|Chen Chen | |Chen Chen | ||
|| | || | ||
− | * | + | * DeepFake |
+ | ** by xiaolou,zehua | ||
+ | ** syncnet and wer based experiments on noisy audio/video input | ||
+ | ** seems noise is not the reason why these methods failed | ||
+ | * VTS | ||
+ | ** Finetune a HuBERT with a HiFiGAN for "audio feature to speech" system (both single speaker and multi speaker is ok) | ||
+ | ** Train a VTS(ResNet Conformer Encoder) for "Video to audio feature" system (for single speaker it works well to some degree) | ||
+ | ** Try training multi-speaker video-to-audio-feature system | ||
+ | ** Try joint train video encoder and hifigan | ||
|| | || | ||
* | * |
2024年2月5日 (一) 10:57的版本
People | This Week | Next Week | Task Tracking (DeadLine) |
---|---|---|---|
Dong Wang |
|
|
|
Lantian Li |
|
|
|
Ying Shi |
|
|
|
Zhenghai You |
|
|
|
Junming Yuan |
|
|
|
Chen Chen |
|
|
|
Xiaolou Li |
|
|
|
Zehua Liu |
|
|
|
Pengqi Li |
|
|
|
Wan Lin |
|
|
|
Tianhao Wang |
|
|
|
Zhenyu Zhou |
|
|
|
Junhui Chen |
|
|
|
Jiaying Wang |
|
|
|
Yu Zhang |
|
|
|
Wenqiang Du |
|
|
|
Yang Wei |
|
|
|
Lily |
|
|
|