ISCSLP Tutorial 2

来自cslt Wiki
2014年9月13日 (六) 05:25Cslt讨论 | 贡献的版本

跳转至: 导航搜索

Prof. Chung-Hsien

  • Arousal & Valence coordinator
  • separate emotion process to sub emotions
  • available databases:
  • database collection:
  • acted : GEneva multimodeal emotion portrayals (GEMEP)
  • induced : eNTERFACE'05 EMOTION Database
  • spontaneous: SEMAINE, AFEW
others: RML,VAM ,FAU AUBO,SAVEE,TUMAVIC,IEMOCAP,SEMAINE MHMC
  • static vs dynamic modeling

STATIC:

  • low level descriptors (LLDs) and functionals
  • good for discriminate high and low-arousal emotions
  • temporal information is lost, no suitable for long utterances, can not detect change in emotion

DYNAMIC:

  • frame as the basis, LLDs are extracted and modeled by GMMs, HMMs, DTW
  • temporal information is obtained
  • difficult to model context well
  • a large number of local features need to be extracted
  • Unit choice for dynamic modeling
  • technical unit: frame, time slice, equally-divided unit
  • meaningful unit: word, syllable, phrases
  • emotionally consistent unit: emotion profiles, emotograms


recognition models