“2024-02-05”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
第61行: 第61行:
 
|Chen Chen
 
|Chen Chen
 
||  
 
||  
*  
+
 +
 
 +
*   DeepFake
 +
:        by xiaolou,zehua
 +
:        syncnet and wer based experiments on noisy audio/video input
 +
:        seems noise is not the reason why these methods failed
 +
*    VTS
 +
:        Finetune a HuBERT with a HiFiGAN for "audio feature to speech" system (both single speaker and multi speaker is ok)
 +
:        Train a VTS(ResNet Conformer Encoder) for "Video to audio feature" system (for single speaker it works well to some degree)
 +
:        Try training multi-speaker video-to-audio-feature system
 +
:        Try joint train video encoder and hifigan
 +
 
 
||
 
||
 
*  
 
*  
第182行: 第193行:
 
|Yang Wei
 
|Yang Wei
 
||  
 
||  
*  
+
* Prepare data backup for corpus disk. 
 
||
 
||
 
*
 
*

2024年2月5日 (一) 11:18的版本

People This Week Next Week Task Tracking (DeadLine)
Dong Wang
  • Keep on NeuralMag paper, refine the complexity theory
  • Design AI course for Primary School.
Lantian Li
Ying Shi
Zhenghai You
Junming Yuan
Chen Chen


  • DeepFake
by xiaolou,zehua
syncnet and wer based experiments on noisy audio/video input
seems noise is not the reason why these methods failed
  • VTS
Finetune a HuBERT with a HiFiGAN for "audio feature to speech" system (both single speaker and multi speaker is ok)
Train a VTS(ResNet Conformer Encoder) for "Video to audio feature" system (for single speaker it works well to some degree)
Try training multi-speaker video-to-audio-feature system
Try joint train video encoder and hifigan
Xiaolou Li
Zehua Liu
Pengqi Li
Wan Lin
Tianhao Wang
Zhenyu Zhou
Junhui Chen
Jiaying Wang
Yu Zhang
Wenqiang Du
Yang Wei
  • Prepare data backup for corpus disk.
Lily