Research

研究プロジェクト・論文・書籍等

Share

  • 論文

The use of articulatory movement data in speech synthesis applications: an overview –Application of articulatory movements using machine learning algorithms–

Author:Korin Richmond, Zhenhua Ling, Junichi Yamagishi

  • #音声処理
  • #音声合成

Acoustical Science and Technology

This paper describes speech processing work in which articulator movements are used in conjunction with the acoustic speech signal and/or linguistic information. By “articulator movements,” we mean the changing positions of human speech articulators such as the tongue and lips, which may be recorded by electromagnetic articulography (EMA), amongst other articulography techniques. Specifically, we provide an overview of: i) inversion mapping techniques, where we estimate articulator movements from a given new speech waveform automatically; ii) statistical voice conversion and speech synthesis techniques which use articulator movements as part of the process to generate synthetic speech, and also make it intuitively controllable via articulation; and iii) automatic prediction (or synthesis) of articulator movements from any given new text input.