“2024-06-17”版本间的差异

2024年6月17日 (一) 12:59的最后版本

People	This Week	Next Week
Dong Wang	Review of a few papers from NCMMSC, MDPI etc. Review papers regarding AI for Medicine Refine "the principle of AI education in primary schools" A few public talks
Lantian Li	GPU status [1] Completed all teaching for this semester. Projects AED -> System integration with Huawei, Bullying patent for FYT TSE -> Preparing for the first phase delivery. VSR -> 1200h+ Finance -> Reproducing R^2 SAC, Overview time-series modelling Papers NeuralScoring -> In progress IS24 Camera-ready paper CNVSRC 2024 baseline paper
Ying Shi	Text enroll mix speech keyword spotting Cohort ASR with conditional Chain here
Zhenghai You	Start training on complete data for Huawei TSE project Change adaptlayer from cancat to FiLm [2] Test inference time and SISDR in online model Consider a TSE network that combines mixture and enrollment with attractor to extract speaker information
Junming Yuan	SSL model finetuning analysis v1[3].Need to check.
Chen Chen	Release UY/CH-CHILD dataset help with NCMMSC paper about CNVSRC 2024 CN-CVS2 1200+ hours data, phase 1 finished in June 26th hand over the CN-CVS2 website things	ISCSLP paper (7.7 20:00 ddl)
Xiaolou Li	LRS2 full test[4] Paper reading
Zehua Liu	NCMMSC papper change parameter result seems good ,(but still training)[5]
Pengqi Li	PhD mid-term assessment two NC-papers
Wan Lin
Tianhao Wang	Neural Scoring exps[6] share encoder channel attention (similar to EA-ASP, useless) early frequency attention (fbank level, training)
Zhenyu Zhou	Huawei projetc Summary of recent experimental results[7]
Junhui Chen	Neural Scoring supplementary experiments Share Encoder NS: NS > Share Encoder NS > EA-ASP (Importance of decoupling) Ways of attention (F-bank Enroll-Aware, seems useful)
Jiaying Wang	debug cohort transformer structure (confused why transformer does not work) deeper network: 2 attention head, 8 layer/block, 4 blocks in total(failed) use only MF training set (failed) use position encoding and transformer block in speechbrain(failed both pit and sisdr loss)
Yu Zhang	Implement R2SAC Retrain Huawei Quantization Model Paper reading
Wenqiang Du	Preparing for the final exam
Yang Wei	Huilan TTS Export ONNX model from original format. Still deal with inferring error.
Lily	Thesis Prepare slides for Xinjiang teacher's course
Turi	End of semester course project presentations
Yue Gu	try to fill the gap between CEM recall and utterance recall, then I want achieve more better performance
Qi Qu	AED Model tested on different data. Tried some other models, i.e. Zipformer from sherpa-onnx. KWS Data collected and processed to account for poor performance.

@@ 第42行： / 第42行： @@
 |Ying Shi
 ||
-*
+* Text enroll mix speech keyword spotting
+* Cohort ASR with conditional Chain [https://z1et6d3xtb.feishu.cn/docx/VXhcdlaLto5HT1xR8Fec5Dtsnxh?from=from_copylink here]
 ||
 *
@@ 第105行： / 第106行： @@
 ||
 * NCMMSC papper
-* change parameter
+* change parameter result seems good ,(but still training)[https://z1et6d3xtb.feishu.cn/docx/ZaTFd3A5EoK982xWBVschloanee?from=from_copylink]
 ||
 *
@@ 第248行： / 第249行： @@
 |Yue Gu
 ||
-*
+* try to fill the gap between CEM recall and utterance recall, then I want achieve more better performance
 ||
 *
@@ 第257行： / 第258行： @@
 |Qi Qu
 ||
-*
+* AED
+** Model tested on different data.
+** Tried some other models, i.e. Zipformer from sherpa-onnx.
+* KWS
+** Data collected and processed to account for poor performance.
 ||
 *

“2024-06-17”版本间的差异

2024年6月17日 (一) 12:59的最后版本

导航菜单

个人工具

名字空间

变种

查看

操作

搜索

导航

工具