“ASR Status Report 2017-6-5”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
 
(3位用户的8个中间修订版本未显示)
第8行: 第8行:
 
|Hui Tang  
 
|Hui Tang  
 
||  
 
||  
*  
+
*created a database for speech overlap
 +
* experiments with speech overlap
 +
[http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=tanghui&step=view_request&cvssid=610 here]
 
||  
 
||  
*  
+
* align data before overlap
 +
* test in lstm,tdnn
 
|-
 
|-
  
第17行: 第20行:
 
|Yanqing Wang
 
|Yanqing Wang
 
||  
 
||  
*  
+
* [http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=wangyanqing&step=view_request&cvssid=609 sparse dnn with different threshold]
 
||  
 
||  
*  
+
* continue the experiment
 
|-
 
|-
  
第26行: 第29行:
 
|Ying Shi   
 
|Ying Shi   
 
||  
 
||  
* Ten languages i-vector L-vector + svm performance
+
* mix-lingual structure (without LID)
 +
** the performance of thch30 is bad than baseline 24.84
 +
** the performance of kazak is better than baseline 24.37
 +
** the performance of uyghur is very bad 109.01
 +
*result analysis
 +
** pdf mix
 +
** the language model of uyghur is too weak
 
||  
 
||  
* cough laugh hello data speaker recognition
+
* work for APSIPA2017
 
|-
 
|-
  
第35行: 第44行:
 
|Yixiang Chen   
 
|Yixiang Chen   
 
||  
 
||  
*  
+
* OLR17(Ten lans) i-vector / L-vector + svm baseline
 +
* "cough, laugh, hello" data for speaker identification
 
||  
 
||  
*  
+
* go on the two tasks.
 
|-
 
|-
  
第44行: 第54行:
 
|Lantian Li   
 
|Lantian Li   
 
||  
 
||  
*  
+
* End-to-end ASV [http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=lilt&step=view_request&cvssid=611]
 +
* Deep speaker feature learning.
 
||  
 
||  
*  
+
* Speaker diarization.
 
|-
 
|-
  

2017年6月5日 (一) 04:25的最后版本

Date People Last Week This Week
2017.6.5


Hui Tang
  • created a database for speech overlap
  • experiments with speech overlap

here

  • align data before overlap
  • test in lstm,tdnn
Yanqing Wang
  • continue the experiment
Ying Shi
  • mix-lingual structure (without LID)
    • the performance of thch30 is bad than baseline 24.84
    • the performance of kazak is better than baseline 24.37
    • the performance of uyghur is very bad 109.01
  • result analysis
    • pdf mix
    • the language model of uyghur is too weak
  • work for APSIPA2017
Yixiang Chen
  • OLR17(Ten lans) i-vector / L-vector + svm baseline
  • "cough, laugh, hello" data for speaker identification
  • go on the two tasks.
Lantian Li
  • End-to-end ASV [1]
  • Deep speaker feature learning.
  • Speaker diarization.
Zhiyuan Tang
  • basic recipes for OLR17 and MixASR17
  • improve the recipes and publish


Date People Last Week This Week
2017.5.31


Hui Tang
  • after introducing beamforming, zhiyong lets me learn something about how to sound localization
  • Create a database for overlapping speech
Yanqing Wang
Ying Shi
  • work for Apsipa17
    • thchs baseline
    • kazakh baseline
    • uyghur baseline
    • 3 language merge GMM baseline
  • work for Apsipa17
  • graduation thesis
Yixiang Chen
Lantian Li
  • Submit the Interspeech paper.
  • Deep speaker feature learning with a larger ASR model.
  • Prepare the end-to-end ASV.
  • End-to-end ASV.
Zhiyuan Tang
  • Experiments for OC17 mix-asr challenge,baseline done.
  • Spk feature as LID input.
  • Data preparation for AP17 orl challenge.
  • Two challenges done.