Technical Program

SP-P17: Speech Synthesis

Session Type: Poster
Time: Friday, May 31, 10:30 - 12:30
Location: Poster Area C
Session Chair: Antonio Bonafonte, Technical University of Catalonia (UPC)
 
SP-P17.1: ARTICULATORY INVERSION AND SYNTHESIS: TOWARDS ARTICULATORY-BASED MODIFICATION OF SPEECH
         Sandesh Aryal; Texas A&M University
         Ricardo Gutierrez-Osuna; Texas A&M University
 
SP-P17.2: A FAST TABLE LOOKUP BASED, STATISTICAL MODEL DRIVEN NON-UNIFORM UNIT SELECTION TTS
         Yao Qian; Microsoft Research Asia
         Frank Soong; Microsoft Research Asia
         Xiaobo Zhou; Microsoft Research Asia
         Yundi Qian; Microsoft Research Asia
         Xiaotian Zhang; Microsoft Research Asia
 
SP-P17.3: STATISTICAL PARAMETRIC SPEECH SYNTHESIS USING DEEP NEURAL NETWORKS
         Heiga Zen; Google Inc.
         Andrew Senior; Google Inc.
         Mike Schuster; Google Inc.
 
SP-P17.4: PREDICTION OF CREAKY VOICE FROM CONTEXTUAL FACTORS
         Thomas Drugman; University of Mons
         John Kane; Trinity College Dublin
         Tuomo Raitio; Aalto University
         Christer Gobl; Trinity College Dublin
 
SP-P17.5: COMPLEX CEPSTRUM ANALYSIS BASED ON THE MINIMUM MEAN SQUARED ERROR
         Ranniery Maia; Toshiba Research Europe Ltd.
         Masami Akamine; Toshiba Corporation
         Mark J.F. Gales; Toshiba Research Europe Ltd.
 
SP-P17.6: INTEGRATED AUTOMATIC EXPRESSION PREDICTION AND SPEECH SYNTHESIS FROM TEXT
         Langzhou Chen; Toshiba Research Europe Ltd.
         Mark J.F. Gales; Toshiba Research Europe Ltd.
         Norbert Braunschweiler; Toshiba Research Europe Ltd.
         Masami Akamine; Corporate Research and Development Center
         Kate Knill; Engineering Department
 
SP-P17.7: SPEAKER AND LANGUAGE INDEPENDENT VOICE QUALITY CLASSIFICATION APPLIED TO UNLABELLED CORPORA OF EXPRESSIVE SPEECH
         John Kane; Trinity College Dublin
         Scherer Stefan; University of Southern California
         Matthew Aylett; CereProc Ltd.
         Louis-Philippe Morency; University of Southern California
         Christer Gobl; Trinity College Dublin
 
SP-P17.8: LIGHTLY SUPERVISED GMM VAD TO USE AUDIOBOOK FOR SPEECH SYNTHESISER
         Yoshitaka Mamiya; University of Edinburgh
         Junichi Yamagishi; University of Edinburgh
         Oliver Watts; University of Edinburgh
         Robert Clark; University of Edinburgh
         Simon King; University of Edinburgh
         Adriana Stan; Technical University of Cluj-Napoca
 
SP-P17.9: BOOTSTRAPPING TEXT-TO-SPEECH FOR SPEECH PROCESSING IN LANGUAGES WITHOUT AN ORTHOGRAPHY
         Sunayana Sitaram; Carnegie Mellon University
         Sukhada Palkar; Carnegie Mellon University
         Yun-Nung Chen; Carnegie Mellon University
         Alok Parlikar; Carnegie Mellon University
         Alan W. Black; Carnegie Mellon University
 
SP-P17.10: MAXIMUM INTELLIGIBILITY-BASED CLOSE-LOOP SPEECH SYNTHESIS FRAMEWORK FOR NOISY ENVIRONMENTS
         Yuan-Fu Liao; National Taipei University of Technology
         Ming-Long Wu; National Taipei University of Technology
         Jia-Chi Lin; National Taipei University of Technology
 
SP-P17.11: SPEECH SYNTHESIS USING SUBBAND-CODED MULTIBAND SOURCE COMPONENTS AND SINUSOIDS
         Nobuyuki Nishizawa; KDDI R&D Laboratories, Inc.
         Tsuneo Kato; KDDI R&D Laboratories, Inc.
 
SP-P17.12: FRAME-LEVEL ACOUSTIC MODELING BASED ON GAUSSIAN PROCESS REGRESSION FOR STATISTICAL NONPARAMETRIC SPEECH SYNTHESIS
         Tomoki Koriyama; Tokyo Institute of Technology
         Takashi Nose; Tokyo Institute of Technology
         Takao Kobayashi; Tokyo Institute of Technology
 
SP-P17.13: MULTI-DISTRIBUTION DEEP BELIEF NETWORK FOR SPEECH SYNTHESIS
         Shiyin Kang; The Chinese University of Hong Kong
         Xiaojun Qian; The Chinese University of Hong Kong
         Helen Meng; The Chinese University of Hong Kong