2015年7月27日 (一) 06:43的最后版本

Speech Processing

AM development

Environment

grid-14 is on repairation
prepare to buy a server

RNN AM

hold
morpheme RNN --zhiyuan
train using 1400h large dataset--mengyuan
write code to tune learning rate--mengyuan

Mic-Array

hold
compute EER with kaldi

====Data selection unsupervised learning

hold
acoustic feature based submodular using Pinan dataset --zhiyong
write code to speed up --zhiyong

RNN-DAE(Deep based Auto-Encode-RNN)

hold
deliver to mengyuan

http://cslt.riit.tsinghua.edu.cn/cgi-bin/cvss/cvss_request.pl?account=zhangzy&step=view_request&cvssid=261

Speaker ID

DNN-based sid --Lantian

http://cslt.riit.tsinghua.edu.cn/cgi-bin/cvss/cvss_request.pl?account=zhangzy&step=view_request&cvssid=327

Ivector&Dvector based ASR

hold --Tian Lan
Cluster the speakers to speaker-classes, then using the distance or the posterior-probability as the metric
dark-konowlege using i-vector
train on wsj(testbase dev93+evl92)

--hold

language vector

hold
train using language vector with the dataset of 1400h_CN + 100h_EN--mengyuan

hold

write a paper--zhiyuan
RNN language vector
train as a paper--xuewei

rectifier

hold
rectifier RNN --zhiyuan

monophone

hold
triphone is tranfered to monophone

Text Processing

RNN LM

character-lm rnn(hold)
lstm+rnn

check the lstm-rnnlm code about how to Initialize and update learning rate.(hold)

Neural Based Document Classification

(hold)

RNN Rank Task

(hold)

RNN Word Segment

(hold)

Seq to Seq(09-15)

Review papers.
Reproduce baseline. (08-03)

Order representation

Nested Dropout

semi-linear --> neural based auto-encoder.

modify the objective function(hold)

Balance Representation

Find error signal

Recommendation

Reproduce baseline.

LDA matrix dissovle.
LDA (Text classification & Recommendation System) --> AAAI

DSSM based QA

Demo Release.(English done.)

Chinese Model start.

RNN based QA

Read Source Code.

Text Group Intern Project

Buddhist Process

(hold)

RNN Poem Process

Read Paper & Source Code.

RNN Document Vector

(hold)

Image Baseline

Demo Release.
Paper Report.

Read CNN Paper.

financial group

world quant

websim(done)

learn the websim and test several alpha
submit the alpha

tonglian platform

learn the platform

test the alpha in tonglian platform
verify the Theano in tonglian

strategy

ml strategy

ml method

optimize the strategy

optimize the model

classical strategy

@@ 第3行： / 第3行： @@
 ==== Environment ====
-* grid-14 is on reparation
+* grid-14 is on repairation
 * prepare to buy a server
@@ 第11行： / 第11行： @@
 *morpheme RNN --zhiyuan
 *train using 1400h large dataset--mengyuan
+*write code to tune learning rate--mengyuan
 ==== Mic-Array ====
@@ 第26行： / 第27行： @@
 * deliver to mengyuan
 :* http://cslt.riit.tsinghua.edu.cn/cgi-bin/cvss/cvss_request.pl?account=zhangzy&step=view_request&cvssid=261
 ===Speaker ID===
 *  DNN-based sid --Lantian
@@ 第39行： / 第40行： @@
 :*--hold
-===Dark knowledge===
+===language vector===
 * hold
-* test random last output layer when train MPE --zhiyuan,mengyuan
-===language vector===
 * train using language vector with the dataset of 1400h_CN + 100h_EN--mengyuan
 :* hold
 * write a paper--zhiyuan
+* RNN language vector
+* train as a paper--xuewei
 ===rectifier===
 * hold
-* rectifier RNN
+* rectifier RNN --zhiyuan
 ===monophone===
+* hold
 * triphone is tranfered to monophone
-===audio embedding===
-* audio ebedding --Wei Xu
 ==Text Processing==

“ASR:2015-07-27”版本间的差异

2015年7月27日 (一) 06:43的最后版本

目录

Speech Processing

AM development

Environment

RNN AM

Mic-Array

RNN-DAE(Deep based Auto-Encode-RNN)

Speaker ID

Ivector&Dvector based ASR

language vector

rectifier

monophone

Text Processing

RNN LM

Neural Based Document Classification

RNN Rank Task

RNN Word Segment

Seq to Seq(09-15)

Order representation

Balance Representation

Recommendation

DSSM based QA

RNN based QA

Text Group Intern Project

Buddhist Process

RNN Poem Process

RNN Document Vector

Image Baseline

financial group

world quant

tonglian platform

strategy

导航菜单

搜索