“2024-10-21”版本间的差异
来自cslt Wiki
(3位用户的3个中间修订版本未显示) | |||
第66行: | 第66行: | ||
|Xiaolou Li | |Xiaolou Li | ||
|| | || | ||
− | * | + | * AVHuBERT unit exp |
+ | ** dc connector (↑0.8% than discrete unit) | ||
+ | ** concat feature and embedding (↑2% than discrete unit, ↓0.3% than baseline) | ||
+ | * CVS3 quality check (30h totally) [https://z1et6d3xtb.feishu.cn/drive/folder/HGHbfyCJRlLYzUdSlEicOEztnYc] | ||
+ | * This work is help by Zehua, Linwan, Tianhao | ||
+ | * MLLM system with audio output design | ||
|| | || | ||
* | * | ||
第104行: | 第109行: | ||
|Wan Lin | |Wan Lin | ||
|| | || | ||
+ | * help VSR data verification | ||
* experiment in voxblink2 [https://z1et6d3xtb.feishu.cn/docx/MxBNdPbLao0tsoxkBVCcUgUoneh?from=from_copylink] | * experiment in voxblink2 [https://z1et6d3xtb.feishu.cn/docx/MxBNdPbLao0tsoxkBVCcUgUoneh?from=from_copylink] | ||
|| | || | ||
第153行: | 第159行: | ||
|Junhui Chen | |Junhui Chen | ||
|| | || | ||
− | * | + | * Experiments for NS |
+ | * Look for speaker detection model with Resnet34 for frame label | ||
|| | || | ||
* | * | ||
第200行: | 第207行: | ||
|Yang Wei | |Yang Wei | ||
|| | || | ||
− | * | + | * Train text enroll KWS model and test with Aibabel dialect data. |
|| | || | ||
* | * |
2024年10月21日 (一) 11:01的最后版本
People | This Week | Next Week | Task Tracking (DeadLine) |
---|---|---|---|
Dong Wang |
|
|
|
Lantian Li |
|
|
|
Ying Shi |
|
|
|
Zhenghai You |
|
|
|
Junming Yuan |
|
|
|
Xiaolou Li |
|
|
|
Zehua Liu |
|
|
|
Pengqi Li |
|
|
|
Wan Lin |
|
|
|
Tianhao Wang |
|
|
|
Xiaoxue Luo |
|
|
|
Zhenyu Zhou |
|
|
|
Junhui Chen |
|
|
|
Jiaying Wang |
|
|
|
Yu Zhang |
|
|
|
Wenqiang Du |
|
|
|
Yang Wei |
|
|
|
Lily |
|
|
|
Turi |
|
| |
Yue Gu |
|
|
|
Qi Qu |
|
|
|