2019年10月29日 (二) 02:02的版本

Flow-based Speech Analysis

Members: Dong Wang, Haoran Sun, Yunqi Cai, Lantian Li
Paper: Haoran Sun, Yunqi Cai, Lantian Li, Dong Wang, "On Investigation of Unsupervised Speech Factorization Based in Normalization Flow", 2019. link

Introduction

We present a preliminary investigation on unsupervised speech factorization based on the normalization flow model. This model constructs a complex invertible transform, by which we can project speech segments into a latent code space where the distribution is a simple diagonal Gaussian. Our preliminary investigation on the TIMIT database shows that this code space exhibits favorable properties such as denseness and pseudo linearity, and perceptually important factors such as phonetic content and speaker trait can be represented as particular directions within the code space.

@@ 第2行： / 第2行： @@
 * Members: Dong Wang, Haoran Sun, Yunqi Cai, Lantian Li
-* Paper: Haoran Sun, Yunqi Cai, Lantian Li, Dong Wang, "On Investigation of Unsupervised Speech Factorization Based in Normalization Flow", 2019. [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/e/e2/Flow_00.pdf paper]
+* Paper: Haoran Sun, Yunqi Cai, Lantian Li, Dong Wang, "On Investigation of Unsupervised Speech Factorization Based in Normalization Flow", 2019. [http://cslt.riit.tsinghua.edu.cn/mediawiki/images/e/e2/Flow_00.pdf link]
 ===Introduction===
 We present a preliminary investigation on unsupervised speech factorization based on the normalization flow model. This model constructs a complex invertible transform, by which we can project speech segments into a latent code space where the distribution is a simple diagonal Gaussian. Our preliminary investigation on the TIMIT database shows that this code space exhibits favorable properties such as denseness and pseudo linearity, and perceptually important factors such as phonetic content and speaker trait can be represented as particular directions within the code space.

“Flow-based Speech Analysis”版本间的差异

2019年10月29日 (二) 02:02的版本

Flow-based Speech Analysis

Introduction

导航菜单

个人工具

名字空间

变种

查看

操作

搜索

导航

工具