“ASR Status Report 2016-12-26”版本间的差异

2017年1月4日 (三) 07:28的最后版本

Date	People	Last Week	This Week
2016.12.26	Jingyi Lin	Learn and make Dr.Wang's personal web page. Prepare for the CSLT's Annual Meeting.	Finish Dr.Wang's personal web page. Take photos for menmbers in CSLT.
	Yanqing Wang	implement the detection mechanism by socket find best parameters to avoid over-fitting add two-class-SVM to the program make GUI more pretty and easy to use improve the program's robustness screenshot: data sender data analyzer	write a document on the program
	Hang Luo	Run joint training and write systemic script and documents	Finish joint training documents Conclude joint training experiments result Make a review on mixlingual
	Ying Shi	crawl corpus from internet.(I don't know whether the corpus is right or not.......) make new LM(complete) train new AM(complete) a part of TRP	finish the TRP
	Yixiang Chen	Prepare the input of speech data (trick of block segmentation) Complete the init version on max-margin SRE. Write TRP-20160012 "基于Kaldi i-vector的说话人识别系统使用说明"[delivery].	Prepare the thesis proposal. Integrate CNN + max-margin.
	Lantian Li	Deep speaker embedding Prepare two datasets and make the i-vector baselines. Write TRP-20160012 "基于Kaldi i-vector的说话人识别系统使用说明"[delivery]. Write book of robustness SRE. Wechat open account.	Deep speaker embedding. Write book. Replay detection on INTERSPEECH chanllenge.
	Zhiyuan Tang	TRP of "How to Config Kaldi nnet3 (in Chinese)", not finished yet; outline of TRP for "Multi-task Recurrent Model for True Multilingual Speech Recognition"; Generative models, part of Chapter Deep Learning.	Finish the above 3 writings.

Date	People	Last Week	This Week
2016.12.19	Jingyi Lin	--	Concentrate on checking the cslt.book. Prepare for the annual convention.
	Yanqing Wang	build a data sender ( read & generate txt files of distracted feature ) build a data analyzer ( detect the modification of files and make response ( show tokens ) ) screenshot: distraction focus	(maybe) replace the detection mechanism by socket find best parameters to avoid over-fitting add two-class SVM to the program make GUI more pretty and easy to use
	Hang Luo	Compare decode result between mono and bi LM, and the decode result ues bi LM before and after joint Choose wrong decode sentence and find its difference between baseline and shareGMM baseline Finished ML book	Continue joint training analysis work, but I'm very confused about how to improve
	Ying Shi	some work about kazak lm crawl data from kazak internet	run new AM by current speech data get more corpus from internet use current corpus make LM and decode
	Yixiang Chen	Leanring tensorflow coding pair wise net use tensorflow alter CNN	coding CNN connect pair wise Dealing with the issue of different lengths of voice
	Lantian Li	LRE challenge on AP16-OL7. Jeju for APSIPA16.	LRE on AP16-OL7. Deep speaker embedding.
	Zhiyuan Tang	Jeju for APSIPA16.	A speech about recent ASR improvements. A supplementary TRP for "Multi-task Recurrent Model for True Multilingual Speech Recognition".

@@ 第10行： / 第10行： @@
 |Jingyi Lin
 ||
-*
+* Learn and make Dr.Wang's personal web page.
+* Prepare for the CSLT's Annual Meeting.
 ||
-*
+* Finish Dr.Wang's personal web page.
+* Take photos for menmbers in CSLT.
 |-
@@ 第19行： / 第21行： @@
 |Yanqing Wang
 ||
-*
+* implement the detection mechanism by socket
+* find best parameters to avoid over-fitting
+* add two-class-SVM to the program
+* make GUI more pretty and easy to use
+* improve the program's robustness
+* screenshot:
+** [[媒体文件:dataSender.png|data sender]]
+**[[媒体文件:dataAnalyser.png|data analyzer]]
 ||
-*
+* write a document on the program
 |-
@@ 第30行： / 第39行： @@
 |Hang Luo
 ||
-*
+* Run joint training and write systemic script and documents
 ||
-*
+* Finish joint training documents
+* Conclude joint training experiments result
+* Make a review on mixlingual
 |-
@@ 第39行： / 第50行： @@
 |Ying Shi
 ||
-*
+* crawl corpus from internet.(I don't know whether the corpus is right or not.......)
+* make new LM(complete)
+* train new AM(complete)
+* a part of TRP
 ||
-*
+* finish the TRP
 |-
@@ 第49行： / 第63行： @@
 |Yixiang Chen
 ||
-*
+* Prepare the input of speech data (trick of block segmentation)
+* Complete the init version on max-margin SRE.
+* Write TRP-20160012 "基于Kaldi i-vector的说话人识别系统使用说明"'''[delivery]'''.
 ||
-*
+* Prepare the thesis proposal.
+* Integrate CNN + max-margin.
 |-
@@ 第58行： / 第75行： @@
 |Lantian Li
 ||
-*
+* Deep speaker embedding
+** Prepare two datasets and make the i-vector baselines.
+* Write TRP-20160012 "基于Kaldi i-vector的说话人识别系统使用说明"'''[delivery]'''.
+* Write book of robustness SRE.
+* Wechat open account.
 ||
-*
+* Deep speaker embedding.
+* Write book.
+* Replay detection on INTERSPEECH chanllenge.
 |-
@@ 第67行： / 第90行： @@
 |Zhiyuan Tang
 ||
-*
+* TRP of "How to Config Kaldi nnet3 (in Chinese)", not finished yet;
+* outline of TRP for "Multi-task Recurrent Model for True Multilingual Speech Recognition";
+* Generative models, part of Chapter Deep Learning.
 ||
-*
+* Finish the above 3 writings.
 |-
 |}
 -------------------------------
@@ 第82行： / 第110行： @@
 !Date!!People !! Last Week !! This Week
 |-
-| rowspan="6"|2016.12.19
+| rowspan="7"|2016.12.19
+|Jingyi Lin
+||
+* --
+||
+* Concentrate on checking the cslt.book.
+* Prepare for the annual convention.
+|-
+|-
 |Yanqing Wang
 ||
-*  read a paper about driving distraction detection task
+* build a data sender ( read & generate txt files of distracted feature )
+* build a data analyzer ( detect the modification of files and make response ( show tokens ) )
+* screenshot:
+**[[媒体文件:GUI_distracted.png|distraction]]
+**[[媒体文件:GUI_focus.png|focus]]
 ||
-*   show normal/distraction patterns of a driver with one class and two class SVM
+* (maybe) replace the detection mechanism by socket
+* find best parameters to avoid over-fitting
+* add two-class SVM to the program
+* make GUI more pretty and easy to use
 |-
@@ 第98行： / 第144行： @@
 |Hang Luo
 ||
-* Compared mono-language model and bi-language model decode result.
+* Compare decode result between mono and bi LM, and the decode result ues bi LM before and after joint
-* Read paper of WFST.
+* Choose wrong decode sentence and find its difference between baseline and shareGMM baseline
+* Finished ML book
 ||
-* Use different corpus or generate mix-lingual corpus to run experiments
+* Continue joint training analysis work, but I'm very confused about how to improve
 |-
@@ 第108行： / 第155行： @@
 |Ying Shi
 ||
-* work from Chao Xing down
+* some work about kazak lm
-* kazak lm
+* crawl data from kazak internet
-** got some corpus from a student who study in Minzu University of China.But the corpus is short (about 10000) so the ppl is also poor.
-** spider
 ||
-* kazak lm
+* run new AM by current speech data
+* get more corpus from internet
+* use current corpus make LM and decode
 |-
@@ 第121行： / 第168行： @@
 |Yixiang Chen
 ||
-*  Complete the replay task experiment and report
+* Leanring tensorflow
+* coding pair wise net use tensorflow
+* alter CNN
 ||
-*  learning tensorflow coding DNN and CNN net
+* coding CNN connect pair wise
+* Dealing with the issue of different lengths of voice
 |-
@@ 第130行： / 第180行： @@
 |Lantian Li
 ||
-* interim report done;
-* PPT for APSIPA16;
 * LRE challenge on AP16-OL7.
-* Deep speaker embedding restart!
-* Submit TRP-20160011 on Replay detection.
-||
 * Jeju for APSIPA16.
+||
+* LRE on AP16-OL7.
+* Deep speaker embedding.
 |-
@@ 第142行： / 第190行： @@
 |-
 |Zhiyuan Tang
-||
-* interim report done;
-* PPT for APSIPA16;
-* language mask[http://192.168.0.51:5555/cgi-bin/cvss/cvss_request.pl?account=tangzy&step=view_request&cvssid=536]
 ||
 * Jeju for APSIPA16.
+||
+* A speech about recent ASR improvements.
+* A supplementary TRP for "Multi-task Recurrent Model for True Multilingual Speech Recognition".
 |-
 |}

“ASR Status Report 2016-12-26”版本间的差异

2017年1月4日 (三) 07:28的最后版本

导航菜单

个人工具

名字空间

变种

查看

操作

搜索

导航

工具