“2024-11-18”版本间的差异

2024年11月18日 (一) 11:14的最后版本

People	This Week	Next Week
Dong Wang	2nd round check for middle-school AI handbook AI training for teachres of Tsinghua Middle School
Lantian Li	AI-Graph EN Chapter 3 Done 2025 Daily Sign v1.0 Done CSTR Report
Ying Shi	Test Google's product about sound separation Correct the test results of the previous condition overlap asr model here
Zhenghai You
Junming Yuan	reproduce cocktail-Hubert feat-mask MT-Hubert change training strategy result in [1]
Chen Chen
Xiaolou Li	Finally finish the VTS report Data preparation CVS3 process 1/4 take over webVideo from SUN CHANG and preprocess it through auto-avsr pipline Code preparation Finish the Conformer/CTC pretraining code Still debuging AVHuBERT pretraining code Paper reading...
Zehua Liu	VTS Documents Revise with Xiaolou Iterative inference training LLM Different context length[2]
Pengqi Li	Summarize recently work and report.	Mapping to IPA from diff language. Write Paper.
Wan Lin
Tianhao Wang	organizing the exp plan and modify the code for In-context-Audio-Retrieval (in training)
Xiaoxue Luo	prepare the code for CED+AudioSep participate in an AI competition with Wenqiang and Zhangyu
Zhenyu Zhou	Huawei project read papers code review(Design new ordering method)
Junhui Chen
Jiaying Wang
Yu Zhang	ICCIP 2024 Paper reading about LLM Market Simulation
Wenqiang Du	ICCIP2024 Participated in an AI competition
Yang Wei	Do text enroll kws model experiment on small data set.(https://z1et6d3xtb.feishu.cn/docx/WLLLd2zQvoiE78xfMhpcHejdnnd)
Lily
Turi
Yue Gu	synthesis some audios for target speakers [3] paper writing
Qi Qu	Knock detection: output every knock's offset so that a shorter audio can be built to speed up human verification. Text-enroll KWS: model i/o optimized; 2.5x faster than the first version. KWS: Chongqing dialect train dataset (15 keywords, ~24.5k utterances). Exp. using new FunASR model (SeACoParaformer) for cloud verification, which handles hotwords better. Exp. using B0-based KWS model for local verification after detection from Chipintelli's chip.

@@ 第18行： / 第18行： @@
 |Lantian Li
 ||
-*
+* AI-Graph EN Chapter 3 Done
+* 2025 Daily Sign v1.0 Done
+* CSTR Report
 ||
 *
@@ 第29行： / 第31行： @@
 |Ying Shi
 ||
-*
+* Test Google's product about sound separation
+* Correct the test results of the previous condition overlap asr model [https://z1et6d3xtb.feishu.cn/docx/GtmydP85Noq1eIx56z3c7qV5nWc?from=from_copylink here]
 ||
 *
@@ 第75行： / 第78行： @@
 |Xiaolou Li
 ||
-*
+* Finally finish the VTS report
+* Data preparation
+** CVS3 process 1/4
+** take over webVideo from SUN CHANG and preprocess it through auto-avsr pipline
+* Code preparation
+** Finish the Conformer/CTC pretraining code
+** Still debuging AVHuBERT pretraining code
+* Paper reading...
 ||
 *
@@ 第86行： / 第96行： @@
 |Zehua Liu
 ||
-*
+*VTS Documents Revise with Xiaolou
+*Iterative inference training
+*LLM Different context length[https://z1et6d3xtb.feishu.cn/docx/JBsidACDVojhCaxFQLbcCVbsnAc?from=from_copylink]
 ||
 *
@@ 第119行： / 第131行： @@
 |-
 |Tianhao Wang
+||
+* organizing the exp plan and modify the code for In-context-Audio-Retrieval (in training)
 ||
 *
+||
+*
+|-
+|-
+|Xiaoxue Luo
+||
+* prepare the code for CED+AudioSep
+* participate in an AI competition with Wenqiang and Zhangyu
 ||
 *
@@ 第131行： / 第155行： @@
 |Zhenyu Zhou
 ||
-*
+*Huawei project
+*read papers
+*code review(Design new ordering method)
 ||
 *
@@ 第164行： / 第190行： @@
 |Yu Zhang
 ||
-*
+* ICCIP 2024
+* Paper reading about LLM Market Simulation
 ||
 *
@@ 第175行： / 第202行： @@
 |Wenqiang Du
 ||
-*
+* ICCIP2024
+* Participated in an AI competition
 ||
 *
@@ 第186行： / 第216行： @@
 |Yang Wei
 ||
-*
+* Do text enroll kws model experiment on small data set.(https://z1et6d3xtb.feishu.cn/docx/WLLLd2zQvoiE78xfMhpcHejdnnd)
 ||
 *
@@ 第214行： / 第244行： @@
 |Yue Gu
 ||
-* synthesis some audios for target speakers
+* synthesis some audios for target speakers [https://j1kw9qcmaxp.feishu.cn/wiki/CZDTwKdPwi3mvTk9BJccdifZnEb?from=from_copylink]
 * paper writing
 ||
@@ 第224行： / 第254行： @@
 |Qi Qu
 ||
-*
+* Knock detection: output every knock's offset so that a shorter audio can be built to speed up human verification.
+* Text-enroll KWS: model i/o optimized; 2.5x faster than the first version.
+* KWS: Chongqing dialect train dataset (15 keywords, ~24.5k utterances).
+* Exp. using new FunASR model (SeACoParaformer) for cloud verification, which handles hotwords better.
+* Exp. using B0-based KWS model for local verification after detection from Chipintelli's chip.
 ||
 *

“2024-11-18”版本间的差异

2024年11月18日 (一) 11:14的最后版本

导航菜单

个人工具

名字空间

变种

查看

操作

搜索

导航

工具