“2024-04-29”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
 
(19位用户的21个中间修订版本未显示)
第6行: 第6行:
 
|Dong Wang
 
|Dong Wang
 
||  
 
||  
*  
+
* Presentation for several public AI promotion
 +
* Primary AI, Grade 3 (1), 8 chapters done
 +
* Some review on linguistic literature about perception uncertainty in pronunciation assessment
 +
* New form of text enrollment and speech fine-tuning to account for accent-based KWS.
 +
 
 
||
 
||
 
*
 
*
第17行: 第21行:
 
|Lantian Li
 
|Lantian Li
 
||
 
||
*  
+
* GPU status [https://z1et6d3xtb.feishu.cn/wiki/XGcGwRK5viJmpRkjH9AczIhynCh]
 +
* Projects (AED, TSE)
 +
* ASIP-BUPT (NeuralScoring, CohortSS)
 +
* BlockChain Courses
 
||
 
||
 
*  
 
*  
第28行: 第35行:
 
|Ying Shi
 
|Ying Shi
 
||  
 
||  
*
+
* DI-TING structure verify [https://z1et6d3xtb.feishu.cn/docx/QldZdNqGdoW8QDxTdFnc0L6Qntc?from=from_copylink here]
||
+
||  
 
*  
 
*  
 
||
 
||
第49行: 第56行:
 
|Junming Yuan
 
|Junming Yuan
 
||  
 
||  
* AI  
+
* AI Graph slides refinement
 
* IS24 rebuttal
 
* IS24 rebuttal
 
* Control FA experiment baseline result[https://z1et6d3xtb.feishu.cn/docx/Ua0cdv3ano0qHoxN8YvcmRsVn9f]
 
* Control FA experiment baseline result[https://z1et6d3xtb.feishu.cn/docx/Ua0cdv3ano0qHoxN8YvcmRsVn9f]
第62行: 第69行:
 
|Chen Chen
 
|Chen Chen
 
||  
 
||  
*  
+
* prepare CNVSRC2024 Baseline system
 +
* vii group [https://z1et6d3xtb.feishu.cn/docx/EFOzdvhjwohMqLx4Kpice4frnjg?from=from_copylink]
 
||
 
||
*  
+
* Rebuttal
 
||
 
||
 
*   
 
*   
第73行: 第81行:
 
|Xiaolou Li
 
|Xiaolou Li
 
||  
 
||  
*  
+
* CNVSRC2024 baseline training
 +
* IS24 Rebuttal
 
||
 
||
 
*  
 
*  
第84行: 第93行:
 
|Zehua Liu
 
|Zehua Liu
 
||  
 
||  
 +
*AKVSR code
 +
*data crop
 +
*IS24 Rebutall
 +
||
 
*
 
*
||
 
*
 
 
||
 
||
 
*   
 
*   
第107行: 第118行:
 
|Wan Lin
 
|Wan Lin
 
||  
 
||  
*  
+
* Pre & QA for graduation paper
 +
* Neural Scoring [https://z1et6d3xtb.feishu.cn/docx/BywjdkGvNou12sxQ4dAcxYa9noh?from=from_copylink]
 
||
 
||
 
*
 
*
第118行: 第130行:
 
|Tianhao Wang
 
|Tianhao Wang
 
||  
 
||  
*  
+
* Neural Scoring [https://z1et6d3xtb.feishu.cn/docx/BywjdkGvNou12sxQ4dAcxYa9noh]
 +
* IS24 rebuttal
 
||
 
||
 
*  
 
*  
第129行: 第142行:
 
|Zhenyu Zhou
 
|Zhenyu Zhou
 
||  
 
||  
*
+
*Paper reading
 +
*IS24 rebuttal
 
||
 
||
 
*
 
*
第140行: 第154行:
 
|Junhui Chen
 
|Junhui Chen
 
||
 
||
*  
+
* Graduation paper
 +
* Neural Scoring: chunk 2s->4s, NS is better than EA-ASP
 
||
 
||
*
+
* Try to use large pretrain model as test utt encoder(wavLM, wav2vec2, etc.)
 
||
 
||
 
*   
 
*   
第151行: 第166行:
 
|Jiaying Wang
 
|Jiaying Wang
 
||  
 
||  
*  
+
* paper reading report
 +
* Huawei data collection
 +
* cohort test[https://z1et6d3xtb.feishu.cn/docx/IeIydyjzJozfUWxvmNfcWASBngg]
 
||
 
||
 
*  
 
*  
第162行: 第179行:
 
|Yu Zhang
 
|Yu Zhang
 
||
 
||
*  
+
* AutoML:
 +
** switched from FLAML to EVALML as it provided a tool chain better suited to our tasks
 
||
 
||
 
*
 
*
第173行: 第191行:
 
|Wenqiang Du
 
|Wenqiang Du
 
||  
 
||  
 +
* Hard negative training,FA data is ready,prepare to train the model
 
*  
 
*  
 
||
 
||
第184行: 第203行:
 
|Yang Wei
 
|Yang Wei
 
||  
 
||  
*  
+
* Children mispronunciation detection and diagnosis
 +
** Prepare baseline recipe and challenge document
 +
** Check the baseline model, due to the extraordinary CER performance.
 
||
 
||
 
*
 
*
第193行: 第214行:
 
|-
 
|-
 
|Lily
 
|Lily
 +
||
 +
* thesis
 +
* AIgraph slides
 
||
 
||
 
*  
 
*  
||
 
*
 
 
||
 
||
 
*   
 
*   
第204行: 第226行:
 
|Turi
 
|Turi
 
||
 
||
*  
+
* Data collection
 +
** 14K so far
 +
* Course work & paper reading
 
||
 
||
 
*
 
*
第212行: 第236行:
 
|Yue Gu
 
|Yue Gu
 
||
 
||
*  
+
* Semantic paraformer model reconstruction
 +
* Interspeech rebuttal
 +
||
 +
*
 +
||
 +
 +
|-
 +
|Qi Qu
 +
||
 +
* Performance test:
 +
** Two-phased KWS vs KWS + FunASR
 +
* Data processing:
 +
** ~100k FA segments collected out of ~6k hours
 +
** Data processing routine
 +
* Model training:
 +
** Fine-tuning w/ ~20k more FA
 
||
 
||
 
*
 
*

2024年4月29日 (一) 10:59的最后版本

People This Week Next Week Task Tracking (DeadLine)
Dong Wang
  • Presentation for several public AI promotion
  • Primary AI, Grade 3 (1), 8 chapters done
  • Some review on linguistic literature about perception uncertainty in pronunciation assessment
  • New form of text enrollment and speech fine-tuning to account for accent-based KWS.
Lantian Li
  • GPU status [1]
  • Projects (AED, TSE)
  • ASIP-BUPT (NeuralScoring, CohortSS)
  • BlockChain Courses
Ying Shi
  • DI-TING structure verify here
Zhenghai You
Junming Yuan
  • AI Graph slides refinement
  • IS24 rebuttal
  • Control FA experiment baseline result[2]
Chen Chen
  • prepare CNVSRC2024 Baseline system
  • vii group [3]
  • Rebuttal
Xiaolou Li
  • CNVSRC2024 baseline training
  • IS24 Rebuttal
Zehua Liu
  • AKVSR code
  • data crop
  • IS24 Rebutall
Pengqi Li
  • Leave of Absence
  • Read&Summary SA-XAI workshop(ICASSP) papers
  • Experiment(PID) on Timit(Extend workshop paper)
Wan Lin
  • Pre & QA for graduation paper
  • Neural Scoring [4]
Tianhao Wang
  • Neural Scoring [5]
  • IS24 rebuttal
Zhenyu Zhou
  • Paper reading
  • IS24 rebuttal
Junhui Chen
  • Graduation paper
  • Neural Scoring: chunk 2s->4s, NS is better than EA-ASP
  • Try to use large pretrain model as test utt encoder(wavLM, wav2vec2, etc.)
Jiaying Wang
  • paper reading report
  • Huawei data collection
  • cohort test[6]
Yu Zhang
  • AutoML:
    • switched from FLAML to EVALML as it provided a tool chain better suited to our tasks
Wenqiang Du
  • Hard negative training,FA data is ready,prepare to train the model
Yang Wei
  • Children mispronunciation detection and diagnosis
    • Prepare baseline recipe and challenge document
    • Check the baseline model, due to the extraordinary CER performance.
Lily
  • thesis
  • AIgraph slides
Turi
  • Data collection
    • 14K so far
  • Course work & paper reading
Yue Gu
  • Semantic paraformer model reconstruction
  • Interspeech rebuttal
Qi Qu
  • Performance test:
    • Two-phased KWS vs KWS + FunASR
  • Data processing:
    • ~100k FA segments collected out of ~6k hours
    • Data processing routine
  • Model training:
    • Fine-tuning w/ ~20k more FA