2014年2月10日 (一) 05:41的版本

DNN training

Model	CE	MPE1	MPE2	MPE3	MPE4
4k states	23.27/22.85	21.35/18.87	21.18/18.76	21.07/18.54	20.93/18.32
8k states	22.16/22.22	20.55/18.03	20.36/17.94	20.32/17.78	20.29/17.80
8k states + IT	-	20.04/17.38	20.01/17.32	20.07/17.44	19.94/17.65

large LM, it 4, -6/-9 || 15.36 || - large LM, it 4, -7/-9 || 15.25 || - large LM, it 5, -5/-9 || 14.17 || -

CLG decoder uses less memory in decoding
HCLG is faster and more accurate than HCLG, and more amiable to beam control [here http://cslt.riit.tsinghua.edu.cn/cgi-bin/cvss/cvss_request.pl?step=view_request&cvssid=156]
std::exp/std::log result in very slow computation in train203. Solved the problem by replacing to standard exp() and log().