“ASR Status Report 2017-9-4”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
(以“{| class="wikitable" !Date!!People !! Last Week !! This Week |- | rowspan="9"|2017.8.21 |Jiayin Cai || * || * |- |- |Xiaofei Kang || * Recording new audios from...”为内容创建页面)
 
 
(5位用户的8个中间修订版本未显示)
第2行: 第2行:
 
!Date!!People !! Last Week !! This Week
 
!Date!!People !! Last Week !! This Week
 
|-
 
|-
| rowspan="9"|2017.8.21
+
| rowspan="9"|2017.9.4
  
  
 
|Jiayin Cai
 
|Jiayin Cai
 
||
 
||
*
+
*Finished the phonetic i-vector experiment.
 
||
 
||
*
+
*get BN feature and train i-vector LID.
 +
*Get phonetic feat from a stronger phonetic network
 +
*combine PTN and phonetic i-vector.
 
|-
 
|-
  
第16行: 第18行:
 
|Xiaofei Kang
 
|Xiaofei Kang
 
||  
 
||  
* Recording new audios from 38 person, located in /work7/tanghui/kangxf/workspaces/speaker/wavdata/V2.0
+
* cutting audio and marking:21 speakers,a total of 1050 sentences
* Improve the test website to judge before committing
+
* Finish the new speaker recognition using the two recordings.
 
||  
 
||  
* Test the new recording。
+
* improve the human Test website
 
|-
 
|-
  
第26行: 第28行:
 
|Miao Zhang
 
|Miao Zhang
 
||  
 
||  
*  
+
* Absent
 
||  
 
||  
*  
+
* Perform human test on 21-style speech(add the disguise)
 +
* Draw spectrums and t-SNE plots compared with experiment results
 
|-
 
|-
  
第35行: 第38行:
 
|Yanqing Wang
 
|Yanqing Wang
 
||  
 
||  
* pruning the connections and refining, [http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=tangzy&step=view_request&cvssid=626 results]
+
* Absent.
 
||
 
||
* Absent.
+
*
 
|-
 
|-
  
第44行: 第47行:
 
|Ying Shi   
 
|Ying Shi   
 
||  
 
||  
* check toolkit code
+
* multi decodeing ASR model
* multilingual baseline system
+
* multi decodeing with fake Lid [http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=shiying&step=view_request&cvssid=627 here]
 +
* read code about TTS
 
||  
 
||  
* train language id model
+
* employ group softmax to train multi decoding ASR model
* use Lid to do multi-decoding
+
* synthesis one 'real' speech
* some experiments for zhiyong zhang about TTS
+
 
|-
 
|-
  
第56行: 第59行:
 
|Yixiang Chen   
 
|Yixiang Chen   
 
||  
 
||  
*  
+
* Absent.
 
||  
 
||  
 
*  
 
*  
第65行: 第68行:
 
|Lantian Li   
 
|Lantian Li   
 
||  
 
||  
* Attend IS2017.
+
* Go on speaker segmentation tasks, see [http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=lilt&step=view_request&cvssid=615 here]
 +
** Dimensionality reduction.
 +
** Clustering.
 +
** Visualization.
 
||
 
||
* Go on speaker segmentation tasks.
+
* Phonetic-aware speaker segmentation.
 
|-
 
|-
  
第74行: 第80行:
 
|Zhiyuan Tang  
 
|Zhiyuan Tang  
 
||  
 
||  
* several indicators for VV scoring system, see [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/a/a1/VV_scoring.pdf].
+
* more indicators for VV scoring system, see [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/a/a1/VV_scoring.pdf].
 
||
 
||
 
* more indicators, a demo with Shuai.
 
* more indicators, a demo with Shuai.

2017年9月4日 (一) 05:22的最后版本

Date People Last Week This Week
2017.9.4


Jiayin Cai
  • Finished the phonetic i-vector experiment.
  • get BN feature and train i-vector LID.
  • Get phonetic feat from a stronger phonetic network
  • combine PTN and phonetic i-vector.
Xiaofei Kang
  • cutting audio and marking:21 speakers,a total of 1050 sentences
  • Finish the new speaker recognition using the two recordings.
  • improve the human Test website
Miao Zhang
  • Absent
  • Perform human test on 21-style speech(add the disguise)
  • Draw spectrums and t-SNE plots compared with experiment results
Yanqing Wang
  • Absent.
Ying Shi
  • multi decodeing ASR model
  • multi decodeing with fake Lid here
  • read code about TTS
  • employ group softmax to train multi decoding ASR model
  • synthesis one 'real' speech
Yixiang Chen
  • Absent.
Lantian Li
  • Go on speaker segmentation tasks, see here
    • Dimensionality reduction.
    • Clustering.
    • Visualization.
  • Phonetic-aware speaker segmentation.
Zhiyuan Tang
  • more indicators for VV scoring system, see [1].
  • more indicators, a demo with Shuai.
  • toolbook writing.




Date People Last Week This Week
2017.8.21 Xiaofei Kang
  • Recording new audios from 38 person, located in /work7/tanghui/kangxf/workspaces/speaker/wavdata/V2.0
  • Improve the test website to judge before committing
  • Test the new recording。
Miao Zhang
Yanqing Wang
  • pruning the connections and refining, results
  • Absent.
Ying Shi
  • check toolkit code
  • multilingual baseline system
  • train language id model
  • use Lid to do multi-decoding
  • some experiments for zhiyong zhang about TTS
Yixiang Chen
Lantian Li
  • Attend IS2017.
  • Go on speaker segmentation tasks.
Zhiyuan Tang
  • several indicators for VV scoring system, see [2].
  • more indicators, a demo with Shuai.
  • toolbook writing.