|
|
| 第151行: |
第151行: |
| | |Jiaying Wang | | |Jiaying Wang |
| | || | | || |
| − | * | + | * construct loudness training data: librimix with different loudness source |
| | + | * loudness order exp: |
| | + | ** chain based structure: 2mix 11.92(150 epoch) |
| | + | ** convtasnet structure: still training, 60epoch achieve 12.29 |
| | + | * ctc order exp: |
| | + | ** only use ctc for order, chain based structure: 10.65 |
| | + | * a speculation: Compared with semantic information (provided by CTC), acoustic-oriented information may be more suitable as a basis for separation. |
| | + | |
| | || | | || |
| − | * | + | * try to use speaker info as order |
| | || | | || |
| | * | | * |
| People |
This Week |
Next Week |
Task Tracking (DeadLine)
|
| Dong Wang
|
|
|
|
| Lantian Li
|
- Review AI Book — College Edition (12/53).
- Project matters: HUAWEI SS/AutoBGM; FYT A/V GenreDetect.
- Prepare materials for my professional title evaluation.
|
|
|
| Ying Shi
|
|
|
|
| Zhenghai You
|
|
|
|
| Junming Yuan
|
- Grade 8 AI practice book(done!)
- SUPERB benchmark(Speech processing Universal Performance Benchmark)
- Source Separation downstream task(SI-SDRi at 50K training steps):
- MT-HuBERT: 10.77, HuBERT-BASE: 9.84
|
|
|
| Xiaolou Li
|
|
|
|
| Zehua Liu
|
|
|
|
| Pengqi Li
|
|
|
|
| Wan Lin
|
|
|
|
| Tianhao Wang
|
|
|
|
| Xiaoxue Luo
|
- read some papers on speech separation of unknown number of speakers
- Apply EDA(encoder-decoder based attractor calculation) method to speech separation
- Environment Configuration(done)
- Familiar with the code and make some adjustments to it(in progress)
|
|
|
| Junhui Chen
|
- LLM:
- 2 different reflection paper reading
- Baseline self-reflection metric collection — code completed, data collection in progress.
- Integrating new reflection methods (e.g. beamsearch-based reflection) into the current pipeline in collaboration with @Zhang Yu — work in progress.
|
|
|
| Jiaying Wang
|
- construct loudness training data: librimix with different loudness source
- loudness order exp:
- chain based structure: 2mix 11.92(150 epoch)
- convtasnet structure: still training, 60epoch achieve 12.29
- ctc order exp:
- only use ctc for order, chain based structure: 10.65
- a speculation: Compared with semantic information (provided by CTC), acoustic-oriented information may be more suitable as a basis for separation.
|
- try to use speaker info as order
|
|
| Yu Zhang
|
- LLM:
- Baseline self-reflection metric collection — code completed, data collection in progress.
- Integrating new reflection methods (e.g. beamsearch-based reflection) into the current pipeline in collaboration with @chenjunhui — work in progress.
- AED:
- Added humming test and analysis for Huawei.
|
|
|
| Wenqiang Du
|
|
|
|
| Yang Wei
|
|
|
|
| Yue Gu
|
- polish the structure of my thesis and continue find jobs
|
|
|
| Qi Qu
|
|
|
|