|
|
| 第143行: |
第143行: |
| | |Junhui Chen | | |Junhui Chen |
| | || | | || |
| − | * | + | * VoxBlink1 |
| | + | ** Data processing |
| | + | ** Baseline(ResNet34) training and NS training [https://z1et6d3xtb.feishu.cn/docx/BywjdkGvNou12sxQ4dAcxYa9noh?from=from_copylink] |
| | || | | || |
| | * | | * |
| People |
This Week |
Next Week |
Task Tracking (DeadLine)
|
| Dong Wang
|
- AI primary (middle-school) 1-6
|
|
|
| Lantian Li
|
|
|
|
| Ying Shi
|
|
|
|
| Zhenghai You
|
|
|
|
| Junming Yuan
|
- Verified two parameters in Hubert pretraining config file that were confused with the original paper.[1]
- Confirmed that in the second iteration of pretraining, features should be extracted from the 6-th layer of the transformer, not the 9-th layer.
- in 175k step, result of 6-th layer: 71.55/9.39, result of 9-th layer: 37.31/16.72
- Basically confirmed the setting of the parameter 'untie_final_proj' for the two iterations of pretraining.
|
|
|
| Chen Chen
|
|
|
|
| Xiaolou Li
|
|
|
|
| Zehua Liu
|
|
|
|
| Pengqi Li
|
- Investigating Extremely Short-Utterance in speaker recognition[2]
|
|
|
| Wan Lin
|
- VoxBlink1
- Data processing
- Baseline(ResNet34) training and NS training [3]
|
|
|
| Tianhao Wang
|
|
|
|
| Zhenyu Zhou
|
|
|
|
| Junhui Chen
|
- VoxBlink1
- Data processing
- Baseline(ResNet34) training and NS training [4]
|
|
|
| Jiaying Wang
|
|
|
|
| Yu Zhang
|
|
|
|
| Wenqiang Du
|
|
|
|
| Yang Wei
|
|
|
|
| Lily
|
|
|
|
| Turi
|
|
|
|
| Yue Gu
|
- modify the introduction
- complete the interspeech poster, and open source the paper code
- rest for two days, next I will focus on my new work
|
|
|
| Qi Qu
|
|
- KWS:
- zh48 test dataset to be updated: ~30 speakers in 3 locations.
- yue10 (Cantonese 10 keywords) train dataset to be updated: ~120 speakers verified, more to come.
- Try to find suitable keyword-wise thresholds based on Recall ~ FA relation.
|
|