“2025-09-29”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
第151行: 第151行:
 
|Jiaying Wang
 
|Jiaying Wang
 
||
 
||
*
+
* construct loudness training data: librimix with different loudness source
 +
* loudness order exp:
 +
** chain based structure: 2mix 11.92(150 epoch)
 +
** convtasnet structure: still training, 60epoch achieve 12.29
 +
* ctc order exp:
 +
** only use ctc for order, chain based structure: 10.65
 +
* a speculation: Compared with semantic information (provided by CTC), acoustic-oriented information may be more suitable as a basis for separation.
 +
 
 
||
 
||
*
+
* try to use speaker info as order
 
||
 
||
 
*
 
*

2025年9月29日 (一) 10:29的版本

People This Week Next Week Task Tracking (DeadLine)
Dong Wang
Lantian Li
  • Review AI Book — College Edition (12/53).
  • Project matters: HUAWEI SS/AutoBGM; FYT A/V GenreDetect.
  • Prepare materials for my professional title evaluation.
Ying Shi
  • Thesis
  • revise resume
Zhenghai You
Junming Yuan
  • Grade 8 AI practice book(done!)
  • SUPERB benchmark(Speech processing Universal Performance Benchmark)
    • Source Separation downstream task(SI-SDRi at 50K training steps):
      • MT-HuBERT: 10.77, HuBERT-BASE: 9.84
Xiaolou Li
Zehua Liu
Pengqi Li
Wan Lin
Tianhao Wang
Xiaoxue Luo
  • read some papers on speech separation of unknown number of speakers
  • Apply EDA(encoder-decoder based attractor calculation) method to speech separation
    • Environment Configuration(done)
    • Familiar with the code and make some adjustments to it(in progress)
Junhui Chen
  • LLM:
    • 2 different reflection paper reading
    • Baseline self-reflection metric collection — code completed, data collection in progress.
    • Integrating new reflection methods (e.g. beamsearch-based reflection) into the current pipeline in collaboration with @Zhang Yu — work in progress.
Jiaying Wang
  • construct loudness training data: librimix with different loudness source
  • loudness order exp:
    • chain based structure: 2mix 11.92(150 epoch)
    • convtasnet structure: still training, 60epoch achieve 12.29
  • ctc order exp:
    • only use ctc for order, chain based structure: 10.65
  • a speculation: Compared with semantic information (provided by CTC), acoustic-oriented information may be more suitable as a basis for separation.
  • try to use speaker info as order
Yu Zhang
  • LLM:
    • Baseline self-reflection metric collection — code completed, data collection in progress.
    • Integrating new reflection methods (e.g. beamsearch-based reflection) into the current pipeline in collaboration with @chenjunhui — work in progress.
  • AED:
    • Added humming test and analysis for Huawei.
Wenqiang Du
Yang Wei
Yue Gu
  • polish the structure of my thesis and continue find jobs
Qi Qu