<?xml version="1.0"?>
<?xml-stylesheet type="text/css" href="http://index.cslt.org/mediawiki/skins/common/feed.css?303"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="zh-cn">
		<id>http://index.cslt.org/mediawiki/index.php?action=history&amp;feed=atom&amp;title=2013-12-20</id>
		<title>2013-12-20 - 版本历史</title>
		<link rel="self" type="application/atom+xml" href="http://index.cslt.org/mediawiki/index.php?action=history&amp;feed=atom&amp;title=2013-12-20"/>
		<link rel="alternate" type="text/html" href="http://index.cslt.org/mediawiki/index.php?title=2013-12-20&amp;action=history"/>
		<updated>2026-04-14T15:24:13Z</updated>
		<subtitle>本wiki的该页面的版本历史</subtitle>
		<generator>MediaWiki 1.23.3</generator>

	<entry>
		<id>http://index.cslt.org/mediawiki/index.php?title=2013-12-20&amp;diff=8812&amp;oldid=prev</id>
		<title>Cslt：以内容“== AM development ==  === Sparse DNN ===  * Optimal Brain Damage(OBD).   # Online OBD held.  # OBD + L1 norm start to investigation.   * Efficient computing  # Conducti...”创建新页面</title>
		<link rel="alternate" type="text/html" href="http://index.cslt.org/mediawiki/index.php?title=2013-12-20&amp;diff=8812&amp;oldid=prev"/>
				<updated>2013-12-20T01:32:54Z</updated>
		
		<summary type="html">&lt;p&gt;以内容“== AM development ==  === Sparse DNN ===  * Optimal Brain Damage(OBD).   # Online OBD held.  # OBD + L1 norm start to investigation.   * Efficient computing  # Conducti...”创建新页面&lt;/p&gt;
&lt;p&gt;&lt;b&gt;新页面&lt;/b&gt;&lt;/p&gt;&lt;div&gt;== AM development ==&lt;br /&gt;
&lt;br /&gt;
=== Sparse DNN ===&lt;br /&gt;
&lt;br /&gt;
* Optimal Brain Damage(OBD). &lt;br /&gt;
&lt;br /&gt;
# Online OBD held. &lt;br /&gt;
# OBD + L1 norm start to investigation. &lt;br /&gt;
&lt;br /&gt;
* Efficient computing&lt;br /&gt;
&lt;br /&gt;
# Conducting rearrangement the matrix structure and compose zero blocks by some smart approaches, leading to better computing speed. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Efficient DNN training ===&lt;br /&gt;
&lt;br /&gt;
# Moment-based training. With m=0.2 performs the best on WER. 6.8% improvement on WER. Other settings are tried on 0.05,0.1,0.2,..0.6,0.8,1.0. &lt;br /&gt;
# Asymmetric window: left 20, right 5. NN accuracy increase by 7%, however WER is a bit worse than the baseline. Move back to Tencent 100h training. &lt;br /&gt;
# Frame-skipping is on implementation. &lt;br /&gt;
&lt;br /&gt;
=== Optimal phoneset===&lt;br /&gt;
&lt;br /&gt;
# Experiment 3 phone sets: Tencent, CSLT, PQ&lt;br /&gt;
# Some errors occur in pure CHS experiments&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===Engine optimization===&lt;br /&gt;
&lt;br /&gt;
* Investigating LOUDS FST. On progress. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==LM development==&lt;br /&gt;
&lt;br /&gt;
===NN LM===&lt;br /&gt;
&lt;br /&gt;
* Trained with 500M QA data, 110k vocabulary.&lt;br /&gt;
* Tested on number of hidden layers (DNN), performance is better for some tests, but not for others. &lt;br /&gt;
* Tested on larger projection layer, from 256 to 384, the performance is consistently improved. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Embedded development==&lt;br /&gt;
&lt;br /&gt;
* Embedded stream mode on progress.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Speech QA==&lt;br /&gt;
&lt;br /&gt;
* SP-QA accuracy 45.14% in all the input (18*199).&lt;br /&gt;
* Investigate the error patterns:&lt;br /&gt;
:* 70% errors are caused by incorrect name entity recognition. Working on entity recovery (character, pinyin, ... distance penalty). &lt;br /&gt;
:* 8% errors are caused by English names. Use class-based LM to solve the problem. Ready to work.&lt;br /&gt;
:* Use N-best to recover errors in QA.&lt;/div&gt;</summary>
		<author><name>Cslt</name></author>	</entry>

	</feed>