Technical Program
SP-P8: Robust Automatic Speech Recognition: General Topics |
| Session Type: Poster |
| Time: Wednesday, May 29, 10:30 - 12:30 |
| Location: Poster Area C |
| Session Chair: Peder Olsen, IBM |
| SP-P8.1: VOICE ACTIVITY DETECTION USING CONVOLUTIVE NON-NEGATIVE SPARSE CODING |
| Peng Teng; Beijing Institute of Technology |
| Yunde Jia; Beijing Institute of Technology |
| SP-P8.2: RECURRENT NEURAL NETWORKS FOR VOICE ACTIVITY DETECTION |
| Thad Hughes; Google Inc. |
| Keir Mierle; Google Inc. |
| SP-P8.3: APPROXIMATED PARALLEL MODEL COMBINATION FOR EFFICIENT NOISE-ROBUST SPEECH RECOGNITION |
| Khe Chai Sim; National University of Singapore |
| SP-P8.4: AN UNCERTAINTY DECODING APPROACH TO NOISE- AND REVERBERATION-ROBUST SPEECH RECOGNITION |
| Roland Maas; University of Erlangen-Nuremberg |
| Akshaya Thippur; KTH Royal Institute of Technology |
| Armin Sehr; Beuth University of Applied Sciences Berlin |
| Walter Kellermann; University of Erlangen-Nuremberg |
| SP-P8.5: BAYESIAN LATENT VARIABLE MODELS FOR SPEECH RECOGNITION |
| Jen-Tzung Chien; National Chiao Tung University |
| Peng Liu; Sohu.com Inc. |
| SP-P8.6: AN INVESTIGATION OF DEEP NEURAL NETWORKS FOR NOISE ROBUST SPEECH RECOGNITION |
| Michael Seltzer; Microsoft Research |
| Dong Yu; Microsoft Research |
| Yongqiang Wang; Cambridge University |
| SP-P8.7: MODELING HETEROGENEOUS DATA SOURCES FOR SPEECH RECOGNITION USING SYNCHRONOUS HIDDEN MARKOV MODELS |
| Yong Zhao; Georgia Institute of Technology |
| Biing-Hwang (Fred) Juang; Georgia Institute of Technology |
| SP-P8.8: NOISE ADAPTIVE FRONT-END NORMALIZATION BASED ON VECTOR TAYLOR SERIES FOR DEEP NEURAL NETWORKS IN ROBUST SPEECH RECOGNITION |
| Bo Li; National University of Singapore |
| Khe Chai Sim; National University of Singapore |
| SP-P8.9: PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES |
| Po-Sen Huang; University of Illinois at Urbana-Champaign |
| Kshitiz Kumar; Microsoft Corporation |
| Chaojun Liu; Microsoft Corporation |
| Yifan Gong; Microsoft Corporation |
| Li Deng; Microsoft Research |
| SP-P8.10: ASR ERROR DETECTION IN A CONVERSATIONAL SPOKEN LANGUAGE TRANSLATION SYSTEM |
| Wei Chen; Raytheon BBN Technologies |
| Sankaranarayanan Ananthakrishnan; Raytheon BBN Technologies |
| Rohit Kumar; Raytheon BBN Technologies |
| Rohit Prasad; Raytheon BBN Technologies |
| Prem Natarajan; Raytheon BBN Technologies |
| SP-P8.11: MEAN TEMPORAL DISTANCE: PREDICTING ASR ERROR FROM TEMPORAL PROPERTIES OF SPEECH SIGNAL |
| Hynek Hermansky; Johns Hopkins University |
| Ehsan Variani; Johns Hopkins University |
| Vijayaditya Peddinti; Johns Hopkins University |
| SP-P8.12: FEATURE EXTRACTION WITH A MULTISCALE MODULATION ANALYSIS FOR ROBUST AUTOMATIC SPEECH RECOGNITION |
| Florian Mueller; University of Luebeck |
| Alfred Mertins; University of Luebeck |
| SP-P8.13: JOINT ANALYSIS OF VOCAL TRACT LENGTH AND TEMPORAL INFORMATION FOR ROBUST SPEECH RECOGNITION |
| Chien-Lin Huang; National Institute of Information and Communications Technology |
| Chiori Hori; National Institute of Information and Communications Technology |
| Hideki Kashioka; National Institute of Information and Communications Technology |
| Bin Ma; Institute for Infocomm Research |
| SP-P8.14: DOUBLE PITCH MARKS IN DIPLOPHONIC VOICE |
| Philipp Aichinger; Medical University of Vienna |
| Berit Schneider-Stickler; Medical University of Vienna |
| Wolfgang Bigenzahn; Medical University of Vienna |
| Anna Katharina Fuchs; Graz University of Technology |
| Bernhard Geiger; Graz University of Technology |
| Martin Hagmüller; Graz University of Technology |
| Gernot Kubin; Graz University of Technology |
