“ISCSLP Tutorial 2”版本间的差异

来自cslt Wiki
跳转至: 导航搜索
(以“Prof. Chung-Hsien Arousal & Valence coordinator”为内容创建页面)
 
第1行: 第1行:
 
Prof. Chung-Hsien
 
Prof. Chung-Hsien
  
Arousal & Valence coordinator
+
* Arousal & Valence coordinator
 +
* separate emotion process to sub emotions
 +
 
 +
* available databases:
 +
* database collection:
 +
:* acted : GEneva multimodeal emotion portrayals (GEMEP)
 +
:* induced : eNTERFACE'05 EMOTION Database
 +
:* spontaneous: SEMAINE, AFEW
 +
: others: RML,VAM ,FAU AUBO,SAVEE,TUMAVIC,IEMOCAP,SEMAINE MHMC
 +
 
 +
* static vs dynamic modeling
 +
 
 +
STATIC:
 +
:* low level descriptors (LLDs) and functionals
 +
:* good for discriminate high and low-arousal emotions
 +
:* temporal information is lost, no suitable for long utterances, can not detect change in emotion
 +
 
 +
DYNAMIC:
 +
:* frame as the basis, LLDs are extracted and modeled by GMMs, HMMs, DTW
 +
:* temporal information is obtained
 +
:* difficult to model context well
 +
:* a large number of local features need to be extracted
 +
 
 +
* Unit choice for dynamic modeling
 +
:* technical unit: frame, time slice, equally-divided unit
 +
:* meaningful unit: word, syllable, phrases
 +
:* emotionally consistent unit: emotion profiles, emotograms
 +
 
 +
 
 +
 
 +
recognition models

2014年9月13日 (六) 05:25的版本

Prof. Chung-Hsien

  • Arousal & Valence coordinator
  • separate emotion process to sub emotions
  • available databases:
  • database collection:
  • acted : GEneva multimodeal emotion portrayals (GEMEP)
  • induced : eNTERFACE'05 EMOTION Database
  • spontaneous: SEMAINE, AFEW
others: RML,VAM ,FAU AUBO,SAVEE,TUMAVIC,IEMOCAP,SEMAINE MHMC
  • static vs dynamic modeling

STATIC:

  • low level descriptors (LLDs) and functionals
  • good for discriminate high and low-arousal emotions
  • temporal information is lost, no suitable for long utterances, can not detect change in emotion

DYNAMIC:

  • frame as the basis, LLDs are extracted and modeled by GMMs, HMMs, DTW
  • temporal information is obtained
  • difficult to model context well
  • a large number of local features need to be extracted
  • Unit choice for dynamic modeling
  • technical unit: frame, time slice, equally-divided unit
  • meaningful unit: word, syllable, phrases
  • emotionally consistent unit: emotion profiles, emotograms


recognition models