|
|
| 第106行: |
第106行: |
| | * Incorporate a "learn-not-to-listen" mechanism into MT-HuBERT and retrain the backbone. (in progress) | | * Incorporate a "learn-not-to-listen" mechanism into MT-HuBERT and retrain the backbone. (in progress) |
| | ** At 425K steps, PR(PER%): 8.18%, ASR(WER%): 9.32%, SD(DER%): 5.05% | | ** At 425K steps, PR(PER%): 8.18%, ASR(WER%): 9.32%, SD(DER%): 5.05% |
| − | * prepare the slides of the seminar
| |
| | * middle school AI textbook checking(picture, table) | | * middle school AI textbook checking(picture, table) |
| | || | | || |
| People |
This Week |
Next Week |
Task Tracking (DeadLine)
|
| Dong Wang
|
- Recheck AI textbooks for primary and middle schools.
|
|
|
| Lantian Li
|
|
|
|
| Wenqiang Du
|
- Statistics the accounts for three companies, our laboratory and AIGE.
- Check AI textbooks
|
|
|
| Yang Wei
|
|
|
|
| Ying Shi
|
- Thesis
- Huawei multi-talker ASR project(Mandarin 2-mix test with FunASR-Nano-800M)
- ground-truth WER 3.89%
- separated WER 12.11%
|
|
|
| Yue Gu
|
- write my Phd thesis
- the seminar
|
|
|
| Lily
|
- Participated in reviewing the AI handbooks (primary/middle/high school)
- Compiled course materials for the AI Handbook (Senior Edition)
|
|
|
| Pengqi Li
|
- Writing the Experiments section.
- Focus on hard-to-explain results by formulating hypotheses and conducting deeper analysis.
|
|
|
| Junming Yuan
|
- Aug-MT-HuBERT(failed)
- Continued pre-training for 600K steps. there is still no improvement observed on clean-speech tasks.
- Aug-MT-HuBERT has more substitution errors in clean ASR adaptation.
- Incorporate a "learn-not-to-listen" mechanism into MT-HuBERT and retrain the backbone. (in progress)
- At 425K steps, PR(PER%): 8.18%, ASR(WER%): 9.32%, SD(DER%): 5.05%
- middle school AI textbook checking(picture, table)
|
|
|
| Yu Zhang
|
- GPU Util: [1]
- LLM
- Rewrite the graph connectivity of Swarm MMLU (the original topology does not align with our expectations).
- Add context segmentation logic to the ECS computation, so that we can trace which preceding node a given text segment originates from.
|
|
|
| Junhui Chen
|
- LLM
- Get swarm crosswords experiment with metric detection working
- Implement multi-process of LLM instances in experiments
- p.s.: try multi-threading, fails because a single Python interpreter only initializes one CUDA context, causing resource contention between different instances.
|
|
|
| Jiaying Wang
|
|
|
|
| Xiaoxue Luo
|
- 2-5mix multi_head separation model for Huawei project [2]
- prepare 2mix speech separation results for ASR test
|
|
|
| Bochao Hu
|
- read some papers
- refactor P2S code, using ms-swift and adding n-best seq to input.
|
|
|
| Hongcheng Zhang
|
- Complete the training of short audio descriptions for AudioSet-SL in WavCaps
- METEOR: 0.3373, ROUGE-L: 0.3121, BLEU-4 : 0.0769, CIDEr : 0.6207
|
|
|
| Weiman Sun
|
- Label audioset vedio dataset
- Analyze the representation of MT-LLM inference and review relevant literature
|
|
|