Technical Program
SP-P17: Speech Synthesis |
| Session Type: Poster |
| Time: Friday, May 31, 10:30 - 12:30 |
| Location: Poster Area C |
| Session Chair: Antonio Bonafonte, Technical University of Catalonia (UPC) |
| SP-P17.1: ARTICULATORY INVERSION AND SYNTHESIS: TOWARDS ARTICULATORY-BASED MODIFICATION OF SPEECH |
| Sandesh Aryal; Texas A&M University |
| Ricardo Gutierrez-Osuna; Texas A&M University |
| SP-P17.2: A FAST TABLE LOOKUP BASED, STATISTICAL MODEL DRIVEN NON-UNIFORM UNIT SELECTION TTS |
| Yao Qian; Microsoft Research Asia |
| Frank Soong; Microsoft Research Asia |
| Xiaobo Zhou; Microsoft Research Asia |
| Yundi Qian; Microsoft Research Asia |
| Xiaotian Zhang; Microsoft Research Asia |
| SP-P17.3: STATISTICAL PARAMETRIC SPEECH SYNTHESIS USING DEEP NEURAL NETWORKS |
| Heiga Zen; Google Inc. |
| Andrew Senior; Google Inc. |
| Mike Schuster; Google Inc. |
| SP-P17.4: PREDICTION OF CREAKY VOICE FROM CONTEXTUAL FACTORS |
| Thomas Drugman; University of Mons |
| John Kane; Trinity College Dublin |
| Tuomo Raitio; Aalto University |
| Christer Gobl; Trinity College Dublin |
| SP-P17.5: COMPLEX CEPSTRUM ANALYSIS BASED ON THE MINIMUM MEAN SQUARED ERROR |
| Ranniery Maia; Toshiba Research Europe Ltd. |
| Masami Akamine; Toshiba Corporation |
| Mark J.F. Gales; Toshiba Research Europe Ltd. |
| SP-P17.6: INTEGRATED AUTOMATIC EXPRESSION PREDICTION AND SPEECH SYNTHESIS FROM TEXT |
| Langzhou Chen; Toshiba Research Europe Ltd. |
| Mark J.F. Gales; Toshiba Research Europe Ltd. |
| Norbert Braunschweiler; Toshiba Research Europe Ltd. |
| Masami Akamine; Corporate Research and Development Center |
| Kate Knill; Engineering Department |
| SP-P17.7: SPEAKER AND LANGUAGE INDEPENDENT VOICE QUALITY CLASSIFICATION APPLIED TO UNLABELLED CORPORA OF EXPRESSIVE SPEECH |
| John Kane; Trinity College Dublin |
| Scherer Stefan; University of Southern California |
| Matthew Aylett; CereProc Ltd. |
| Louis-Philippe Morency; University of Southern California |
| Christer Gobl; Trinity College Dublin |
| SP-P17.8: LIGHTLY SUPERVISED GMM VAD TO USE AUDIOBOOK FOR SPEECH SYNTHESISER |
| Yoshitaka Mamiya; University of Edinburgh |
| Junichi Yamagishi; University of Edinburgh |
| Oliver Watts; University of Edinburgh |
| Robert Clark; University of Edinburgh |
| Simon King; University of Edinburgh |
| Adriana Stan; Technical University of Cluj-Napoca |
| SP-P17.9: BOOTSTRAPPING TEXT-TO-SPEECH FOR SPEECH PROCESSING IN LANGUAGES WITHOUT AN ORTHOGRAPHY |
| Sunayana Sitaram; Carnegie Mellon University |
| Sukhada Palkar; Carnegie Mellon University |
| Yun-Nung Chen; Carnegie Mellon University |
| Alok Parlikar; Carnegie Mellon University |
| Alan W. Black; Carnegie Mellon University |
| SP-P17.10: MAXIMUM INTELLIGIBILITY-BASED CLOSE-LOOP SPEECH SYNTHESIS FRAMEWORK FOR NOISY ENVIRONMENTS |
| Yuan-Fu Liao; National Taipei University of Technology |
| Ming-Long Wu; National Taipei University of Technology |
| Jia-Chi Lin; National Taipei University of Technology |
| SP-P17.11: SPEECH SYNTHESIS USING SUBBAND-CODED MULTIBAND SOURCE COMPONENTS AND SINUSOIDS |
| Nobuyuki Nishizawa; KDDI R&D Laboratories, Inc. |
| Tsuneo Kato; KDDI R&D Laboratories, Inc. |
| SP-P17.12: FRAME-LEVEL ACOUSTIC MODELING BASED ON GAUSSIAN PROCESS REGRESSION FOR STATISTICAL NONPARAMETRIC SPEECH SYNTHESIS |
| Tomoki Koriyama; Tokyo Institute of Technology |
| Takashi Nose; Tokyo Institute of Technology |
| Takao Kobayashi; Tokyo Institute of Technology |
| SP-P17.13: MULTI-DISTRIBUTION DEEP BELIEF NETWORK FOR SPEECH SYNTHESIS |
| Shiyin Kang; The Chinese University of Hong Kong |
| Xiaojun Qian; The Chinese University of Hong Kong |
| Helen Meng; The Chinese University of Hong Kong |
