“Hulan-2013-10-18”版本间的差异
来自cslt Wiki
(以内容“=ASR= ==ASR Kernel development== http://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/2013-10-11 ASR group weekly report ==TTS== * CD lab files done. Refining ...”创建新页面) |
(→Dialog system) |
||
第21行: | 第21行: | ||
* The search system migrated to the custom domain, with significant performance reduction | * The search system migrated to the custom domain, with significant performance reduction | ||
+ | <pre> | ||
Customs: | Customs: | ||
n TF TFIDF | n TF TFIDF | ||
第36行: | 第37行: | ||
4 0.867 0.95 | 4 0.867 0.95 | ||
5 0.95 0.967 | 5 0.95 0.967 | ||
+ | </pre> | ||
* Two problems: | * Two problems: |
2013年10月18日 (五) 01:40的版本
ASR
ASR Kernel development
TTS
- CD lab files done. Refining the script.
- Training toolkit is cleaned up. Now no alignment is required. Parallel training is done.
- Tried syllable based system instead of phones.
- Collected an online-novel reading.
Next week:
- Refine the script
- Clean up the online reading.
Dialog system
- The search system migrated to the custom domain, with significant performance reduction
Customs: n TF TFIDF 1 0.496 0.485 2 0.619 0.615 3 0.676 0.673 4 0.713 0.715 5 0.740 0.738 Agriculture: n TF TFIDF 1 0.75 0.8 2 0.85 0.883 3 0.867 0.917 4 0.867 0.95 5 0.95 0.967
- Two problems:
- short of semantic cluster.
- limited training data for idf.
- Next week
- Analyse the QA database, to extract useful domain dependent data
- Analyse the data to expand the key words & phrases
- Analyse the data to attain better IDF.