Technical Program

SP-P8: Robust Automatic Speech Recognition: General Topics

Session Type: Poster
Time: Wednesday, May 29, 10:30 - 12:30
Location: Poster Area C
Session Chair: Peder Olsen, IBM
 
SP-P8.1: VOICE ACTIVITY DETECTION USING CONVOLUTIVE NON-NEGATIVE SPARSE CODING
         Peng Teng; Beijing Institute of Technology
         Yunde Jia; Beijing Institute of Technology
 
SP-P8.2: RECURRENT NEURAL NETWORKS FOR VOICE ACTIVITY DETECTION
         Thad Hughes; Google Inc.
         Keir Mierle; Google Inc.
 
SP-P8.3: APPROXIMATED PARALLEL MODEL COMBINATION FOR EFFICIENT NOISE-ROBUST SPEECH RECOGNITION
         Khe Chai Sim; National University of Singapore
 
SP-P8.4: AN UNCERTAINTY DECODING APPROACH TO NOISE- AND REVERBERATION-ROBUST SPEECH RECOGNITION
         Roland Maas; University of Erlangen-Nuremberg
         Akshaya Thippur; KTH Royal Institute of Technology
         Armin Sehr; Beuth University of Applied Sciences Berlin
         Walter Kellermann; University of Erlangen-Nuremberg
 
SP-P8.5: BAYESIAN LATENT VARIABLE MODELS FOR SPEECH RECOGNITION
         Jen-Tzung Chien; National Chiao Tung University
         Peng Liu; Sohu.com Inc.
 
SP-P8.6: AN INVESTIGATION OF DEEP NEURAL NETWORKS FOR NOISE ROBUST SPEECH RECOGNITION
         Michael Seltzer; Microsoft Research
         Dong Yu; Microsoft Research
         Yongqiang Wang; Cambridge University
 
SP-P8.7: MODELING HETEROGENEOUS DATA SOURCES FOR SPEECH RECOGNITION USING SYNCHRONOUS HIDDEN MARKOV MODELS
         Yong Zhao; Georgia Institute of Technology
         Biing-Hwang (Fred) Juang; Georgia Institute of Technology
 
SP-P8.8: NOISE ADAPTIVE FRONT-END NORMALIZATION BASED ON VECTOR TAYLOR SERIES FOR DEEP NEURAL NETWORKS IN ROBUST SPEECH RECOGNITION
         Bo Li; National University of Singapore
         Khe Chai Sim; National University of Singapore
 
SP-P8.9: PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES
         Po-Sen Huang; University of Illinois at Urbana-Champaign
         Kshitiz Kumar; Microsoft Corporation
         Chaojun Liu; Microsoft Corporation
         Yifan Gong; Microsoft Corporation
         Li Deng; Microsoft Research
 
SP-P8.10: ASR ERROR DETECTION IN A CONVERSATIONAL SPOKEN LANGUAGE TRANSLATION SYSTEM
         Wei Chen; Raytheon BBN Technologies
         Sankaranarayanan Ananthakrishnan; Raytheon BBN Technologies
         Rohit Kumar; Raytheon BBN Technologies
         Rohit Prasad; Raytheon BBN Technologies
         Prem Natarajan; Raytheon BBN Technologies
 
SP-P8.11: MEAN TEMPORAL DISTANCE: PREDICTING ASR ERROR FROM TEMPORAL PROPERTIES OF SPEECH SIGNAL
         Hynek Hermansky; Johns Hopkins University
         Ehsan Variani; Johns Hopkins University
         Vijayaditya Peddinti; Johns Hopkins University
 
SP-P8.12: FEATURE EXTRACTION WITH A MULTISCALE MODULATION ANALYSIS FOR ROBUST AUTOMATIC SPEECH RECOGNITION
         Florian Mueller; University of Luebeck
         Alfred Mertins; University of Luebeck
 
SP-P8.13: JOINT ANALYSIS OF VOCAL TRACT LENGTH AND TEMPORAL INFORMATION FOR ROBUST SPEECH RECOGNITION
         Chien-Lin Huang; National Institute of Information and Communications Technology
         Chiori Hori; National Institute of Information and Communications Technology
         Hideki Kashioka; National Institute of Information and Communications Technology
         Bin Ma; Institute for Infocomm Research
 
SP-P8.14: DOUBLE PITCH MARKS IN DIPLOPHONIC VOICE
         Philipp Aichinger; Medical University of Vienna
         Berit Schneider-Stickler; Medical University of Vienna
         Wolfgang Bigenzahn; Medical University of Vienna
         Anna Katharina Fuchs; Graz University of Technology
         Bernhard Geiger; Graz University of Technology
         Martin Hagmüller; Graz University of Technology
         Gernot Kubin; Graz University of Technology