“2025-03-10”版本间的差异

2025年3月10日 (一) 10:46的版本

People	This Week	Next Week
Dong Wang	Revise AI textbook of the colleage version
Lantian Li
Ying Shi	Compare Ascend and Nvidia Performance: Clean ASR task 20epochs WER 6.91% : 7.02% (Ascend vs Nvidia) Speed: Nvidia is one time faster than Ascend Start think about my thesis
Zhenghai You
Junming Yuan	Pretraining work: MT-HuBERT & Cocktail-HuBERT will be finished next week. Get a set of comparable finetuning results(15/5/3-shot) for each pretrain model at the 400K training step.[1] Check and add reference for AI junior high school handbook(1/2).(Done)
Xiaolou Li
Zehua Liu	Writing NFSC document Lora finetune VLM(both Encoder and LLM Decoder) result seem not very well(maybe need parameeter adjustment) Pretrained VSR Encoder + VLM(Decoder) seems better than Normal LM	Design VTS architecture and implement it
Pengqi Li	Prepare the AI course for Tsinghua University Junior High School. Add references to the handbook(junior high school version 1/2)(Done).
Wan Lin
Tianhao Wang
Xiaoxue Luo
Zhenyu Zhou
Junhui Chen	speaker diarization baseline for NS (mix test: baseline EER 15.972% -> 12.983%) others still testing... make ppt about scaling law on speaker volume.
Jiaying Wang
Yu Zhang	Multi Agent Investment use Top 31 stocks in 11 sector to do portfolio for better correlation with input news (no excess return) analysis the trading decision Huawei AED smallest model to keep AUC excess 0.9 split inference into two phase (Phase 1: Human Voice vs None Human Voice, Phase 2: Speech vs Other Human Voice) with two smaller model
Wenqiang Du	Check Primary handbook V3.0（Done） Add reference(80%)
Yang Wei	Adapt text enroll kws model with synthesized dialect data.(recall: 83% -> 94%)[2]
Turi
Yue Gu	a 0.4% CER reduction has achieved for one spk, but no improvement was discovered on other spks. I'm still do some exps. restart the synthetic-data related exps, try to fill the gap between synthetic data and real data on the output distribution of model.
Qi Qu	Technical investigation on Visual Event Detection. Experiment on annotating and auditing audio with Audio LLM: insufficient VRAM; poor I/O in CPU/GPU hybrid mode.

@@ 第171行： / 第171行： @@
 |Yu Zhang
 ||
-*
+* Multi Agent Investment
+** use Top 31 stocks in 11 sector to do portfolio for better correlation with input news (no excess return)
+** analysis the trading decision
+* Huawei AED
+** smallest model to keep AUC excess 0.9
+** split inference into two phase (Phase 1: Human Voice vs None Human Voice, Phase 2: Speech vs Other Human Voice) with two smaller model
 ||
 *
 ||
 *

“2025-03-10”版本间的差异

2025年3月10日 (一) 10:46的版本

导航菜单

个人工具

名字空间

变种

查看

操作

搜索

导航

工具