“2024-08-26”版本间的差异

2024年8月26日 (一) 11:03的最后版本

People	This Week	Task Tracking (DeadLine)
Dong Wang	Primary school book (17) College AI education
Lantian Li	GPU status [1] AI primary High school handbook (40/40)
Ying Shi	Fenyinta stuff reproduce cohort-SOT overlap ASR and some analysis Text enroll keywords spotting intermediate PIT-SOT CTC + high layer cross-attention is in progress here
Zhenghai You	Speaker Augument: Completed experiments in Libri2mix, Low SISDR testset lower speaker confusion rate ExFormer: Always inferior to the SOTA[2]
Junming Yuan	Confirmed that the performance gap of the 10% is determined by the impact of GPUs[3]. To fully reproduce the official model, it would take approximately 32 days. Investigate how to train Hubert with Mix-speech (in progress)
Xiaolou Li	LLM long context test Poster for IS24 Paper reading
Zehua Liu	CNVSRC 2024 Website Data transfer to HUAWEI LLM in Chinese VSR(In-context-learning)
Pengqi Li	Extend Proposal for 'HOW PHONEMES CONTRIBUTE TO DEEP SPEAKER MODELS?'[4] Reviewing code, paper. Analyzing di-phones in Audio-Mnist. Start exp with TIMIT dataset.	9.20(one month)
Wan Lin	Neural Scoring: vox2+voxblink1 [5]
Tianhao Wang	AudioSep reproducing IS24 poster
Zhenyu Zhou	Some thinking about onnx quantization[6]
Junhui Chen	Neural Scoring: Vox2+Voxblink-clean test[7]
Jiaying Wang	re-write conditional chain code(can be finished this week) check wsj data
Yu Zhang	AED engineering problem assist Prepare for report
Wenqiang Du	Complete the unified format and recheck of Primary school handbook Write middle school handbook(29-41) Training Chinese and Cantonese KWS model
Yang Wei	Check the badcase of KWS model test.
Lily
Turi	Added more sections to the draft paper Need to refine and do more experiments
Yue Gu	write the introduction test the adaptation model on the same accent data:[8] （got sick today）
Qi Qu	KWS: zh48 test dataset updated: 29 speakers in 3 locations, ~600 utterances per keyword. Recall ~ FA relations plotted.

@@ 第32行： / 第32行： @@
 |Ying Shi
 ||
-*
+* Fenyinta stuff
+* reproduce cohort-SOT overlap ASR and some analysis
+* Text enroll keywords spotting intermediate PIT-SOT CTC + high layer cross-attention is in progress  [https://z1et6d3xtb.feishu.cn/docx/DI3UdF496ojxCQxTqUycsjDQnxf?from=from_copylink here]
 ||
 *
@@ 第43行： / 第45行： @@
 |Zhenghai You
 ||
-*
+* Speaker Augument: Completed experiments in Libri2mix, Low SISDR testset lower speaker confusion rate
+* ExFormer: Always inferior to the SOTA[https://z1et6d3xtb.feishu.cn/docx/ZbtsdTGuQo4IXnxuxHXcpvBynoe]
 ||
 *
@@ 第56行： / 第59行： @@
 ** To fully reproduce the official model, it would take approximately 32 days.
 * Investigate how to train Hubert with Mix-speech (in progress)
-||
-*
-||
-*
-|-
-|-
-|Chen Chen
-||
-*
 ||
 *
@@ 第77行： / 第69行： @@
 |Xiaolou Li
 ||
-*
+* LLM long context test
+* Poster for IS24
+* Paper reading
 ||
 *
@@ 第108行： / 第102行： @@
 *
 ||
-*
+* 9.20(one month)
 |-
@@ 第115行： / 第109行： @@
 |Wan Lin
 ||
-*
+* Neural Scoring: vox2+voxblink1 [https://z1et6d3xtb.feishu.cn/docx/BywjdkGvNou12sxQ4dAcxYa9noh?from=from_copylink]
 ||
 *
@@ 第137行： / 第131行： @@
 |Zhenyu Zhou
 ||
-**Some thinking about onnx quantify
+*Some thinking about onnx quantization[https://z1et6d3xtb.feishu.cn/docx/S9ChdyH7go490txZ2ZHcNjXTn2b]
 ||
 *
@@ 第160行： / 第154行： @@
 |Jiaying Wang
 ||
-*
+* re-write conditional chain code(can be finished this week)
+* check wsj data
 ||
 *
@@ 第195行： / 第191行： @@
 |Yang Wei
 ||
-*
+* Check the badcase of KWS model test.
 ||
 *
@@ 第225行： / 第221行： @@
 ||
 * write the introduction
-* test the adaptation model on the same accent data:
+* test the adaptation model on the same accent data:[https://www.yuque.com/g/shibeiing/angax2/efpdhqvsxdi4phua/collaborator/join?token=ysvAeigXC9KFi4CF&source=doc_collaborator]
+（got sick today）
 ||
 *
@@ 第235行： / 第231行： @@
 |Qi Qu
 ||
-*
+* KWS:
+** zh48 test dataset updated: 29 speakers in 3 locations, ~600 utterances per keyword.
+** Recall ~ FA relations plotted.
 ||
 *

“2024-08-26”版本间的差异

2024年8月26日 (一) 11:03的最后版本

导航菜单

个人工具

名字空间

变种

查看

操作

搜索

导航

工具