Hulan-2013-10-18
来自cslt Wiki
ASR
ASR Kernel development
TTS
- CD lab files done. Refining the script.
- Training toolkit is cleaned up. Now no alignment is required. Parallel training is done.
- Tried syllable based system instead of phones.
- Collected an online-novel reading.
Next week:
- Refine the script
- Clean up the online reading.
Dialog system
- The search system migrated to the custom domain, with significant performance reduction
Customs: n TF TFIDF 1 0.496 0.485 2 0.619 0.615 3 0.676 0.673 4 0.713 0.715 5 0.740 0.738 Agriculture: n TF TFIDF 1 0.75 0.8 2 0.85 0.883 3 0.867 0.917 4 0.867 0.95 5 0.95 0.967
- Two problems:
- short of semantic cluster.
- limited training data for idf.
- Next week
- Analyse the QA database, to extract useful domain dependent data
- Analyse the data to expand the key words & phrases
- Analyse the data to attain better IDF.