|
|
| 第103行: |
第103行: |
| | |Wan Lin | | |Wan Lin |
| | || | | || |
| − | * | + | * Writing research proposal |
| | + | * Some Experiments [https://z1et6d3xtb.feishu.cn/wiki/NJLtw3l1Xi2xQBkUpCbcRic9nOh?from=from_copylink] |
| | || | | || |
| | * | | * |
| People |
This Week |
Next Week |
Task Tracking (DeadLine)
|
| Dong Wang
|
|
|
|
| Lantian Li
|
- Review AI Book — College Edition (12/53).
- Project matters: HUAWEI SS/AutoBGM; FYT A/V GenreDetect.
- Prepare materials for my professional title evaluation.
|
|
|
| Ying Shi
|
|
|
|
| Zhenghai You
|
- Multi-Step Infer (multi-task search)[1]
- Huawei 4-mix SS model training
|
|
|
| Junming Yuan
|
- Grade 8 AI practice handbook(done!)
- Further evaluation of our MT-HuBERT model on other speech downstream tasks.
- Based on SUPERB benchmark(Speech processing Universal Performance Benchmark)
- Firstly focused on Source Separation downstream task(still in training)
- Intermediate results(SI-SDRi at 50K training steps): MT-HuBERT: 10.77, HuBERT-BASE: 9.84.
|
|
|
| Xiaolou Li
|
|
|
|
| Zehua Liu
|
|
|
|
| Pengqi Li
|
- Science popularization activities in Changzhi, Shanxi, and returned to the lab on tomorrow.
|
|
|
| Wan Lin
|
- Writing research proposal
- Some Experiments [2]
|
|
|
| Tianhao Wang
|
- writing my paper (method and experiments is almost done)
- job interviews
|
|
|
| Xiaoxue Luo
|
- read some papers on speech separation of unknown number of speakers
- Apply EDA(encoder-decoder based attractor calculation) method to speech separation
- Environment Configuration(done)
- Familiar with the code and make some adjustments to it(in progress)
|
|
|
| Junhui Chen
|
- LLM:
- 2 different reflection paper reading
- Baseline self-reflection metric collection — code completed, data collection in progress.
- Integrating new reflection methods (e.g. beamsearch-based reflection) into the current pipeline in collaboration with @Zhang Yu — work in progress.
|
|
|
| Jiaying Wang
|
- construct loudness training data: librimix with different loudness source
- loudness order exp[3]:
- chain based structure: 2mix 11.92(150 epoch)
- convtasnet structure: still training, 60epoch achieve 12.29
- ctc order exp:
- only use ctc for order, chain based structure: 10.65
- a speculation: Compared with semantic information (provided by CTC), acoustic-oriented information may be more suitable as a basis for separation.
|
- try to use speaker info as order
|
|
| Yu Zhang
|
- LLM:
- Baseline self-reflection metric collection — code completed, data collection in progress.
- Integrating new reflection methods (e.g. beamsearch-based reflection) into the current pipeline in collaboration with @chenjunhui — work in progress.
- AED:
- Added humming test and analysis for Huawei.
|
|
|
| Wenqiang Du
|
|
|
|
| Yang Wei
|
- Train English MD model based on Whisper
|
|
|
| Yue Gu
|
- polish the structure of my thesis and continue find jobs
|
|
|
| Qi Qu
|
- Elderly alarm customization.
- SLAM (simultaneous localization & mapping) experiments.
|
|
|