People |
This Week |
Next Week |
Task Tracking (DeadLine)
|
Dong Wang
|
- Three slides for AIGE to gov and enterprise.
|
|
|
Lantian Li
|
- Proofread of the high-school book (Done)
|
|
|
Ying Shi
|
- Prepare Ascend Sever environment
- training Conditional Chain overlap ASR model with Hierachical-Transformer here
|
|
|
Zhenghai You
|
- Training TSE model for with content enrollment(for Huawei & CSSC(中船) projects)
- Reading papers about refiner
|
|
|
Junming Yuan
|
- Finish MPC-HuBERT pretrain.
- Double check the related experimental code.
- MT-HuBERT(in progress) & Cocktail-HuBERT need re-pretrain.
- The results of other baseline in here
|
|
|
Xiaolou Li
|
- VSR training (1500h) cnvsrc-single valid 300 CER: 36.14% (not converged)
- Finish pre-processing 4000h data
- get ASR transcript for 4000h data
- Writing NSFC document
|
|
|
Zehua Liu
|
- Paper Reading and Sharing in last Friday
- Writing Vision Language Model code
- Writing NSFC document
|
|
|
Pengqi Li
|
- Prepare the AI course for Tsinghua University Junior High School.
- Using t-SNE to visualize the factorized content vector.
- Next step is to color(speaker information importance or not) each point.
|
|
|
Wan Lin
|
- try some adjustment for clean performance(no improvement)
- supply experiments for other tests
|
|
|
Tianhao Wang
|
- sound separation: 2-mix and 3-mix model training
- weekly report
|
|
|
Xiaoxue Luo
|
- generation of multi-mix audio data and did some test experiments.
- read papers
|
|
|
Zhenyu Zhou
|
|
|
|
Junhui Chen
|
- Reproducing speaker diarization method for NS (debugging...)
- read paper
|
|
|
Jiaying Wang
|
|
|
|
Yu Zhang
|
- AED:
- Split AED model into two smaller model to detect the human voice in noisy environments and in clean environments separately.
- Trying smaller model (under 200K)
- Multi Agent Investment
- try index enhancement trading, no obvious excess return
|
- try do portfolio investment on some selected big company
- add the debate topic about the logical consistency inside investment decisions.
|
|
Wenqiang Du
|
- Primary handbook's PPT (24/44)
- Continue to check Primary and middle handbook(Completed this week)
- Speech cloning sample for the company
|
|
|
Yang Wei
|
- Tuning text enroll kws model for dialect data with linear layer. (recall: 65%->85%->94%)
|
|
|
Turi
|
- Thesis writing
- Result with LM[2]
|
|
|
Yue Gu
|
- finish some exps, but nothing is improved.
- finish a proposal,I will present it recently
|
|
|
Qi Qu
|
- Applying pre-prod eval routine on text-enroll KWS models: the ideal thresholds for each keyword vary significantly. [3]
|
|
|