People |
This Week |
Next Week |
Task Tracking (DeadLine)
|
Dong Wang
|
- Refine spoof paper
- Prepare talk for information theory in NN
- Prepare talk for representation investigation.
|
|
|
Yunqi Cai
|
- review papers about CQDs
- Verify the deconvolution of infrared and visible faces
- Verify infrared and visible image fusion based on GLOW model
- Arrange research plans for interns
|
|
|
Lantian Li
|
- Finish course on AI.
- Study speaker separation and think about structural embedding.
|
- Finish ETM response.
- Exps of hard trials.
|
|
Ying Shi
|
- Report about e2e kws
- speech engrave (garbage node, sil training data, text to speech attention)
- analyse fenyinta test data [here]
|
- more analyse about speech engrave(speech to text attention)
- speech engrave (text to speech attention)
|
|
Haoran Sun
|
|
- make some more efficient attempts
- ——remove rhythm and pitch encoders
- ——increase distance between speakers
- ——improve content encoder
- ——make use of speaker label
|
|
Chen Chen
|
- pre-process audio data & train GAN with wav2vec2 output data directly
|
- use kmeans and pca clustering wav2vec2 output to build better segment representation
|
|
Pengqi Li
|
- reproduce a series of CAM method on speaker classification
|
|
|
Qingyang Zhu
|
|
|
|
Weida Liang
|
- Finish the first version on improved exemplar autoencoder with cycle loss
- Rethink the theory analysis part
|
- Test on never-before-seen speaker conversion
- Review the code of wav2vec, StarGAN and PPG based GAN
|
|
Zixi Yan
|
|
|
|
Sirui Li
|
- Fine-tune the wav2vec model
|
- Comparing Tibetan and Chinese fine-tune results
|
|
Haoyu Jiang
|
- Face sampling in CNCeleb dataset
- Filter videos without the target's face
|
|
|
Ruihai Hou
|
|
|
|
Renmiao Chen
|
- Sample some audio,listen and analyze
|
|
|