Technical Program
SP-P11: Acoustic Modeling: Novel Methods for Automatic Speech Recognition |
| Session Type: Poster |
| Time: Wednesday, May 29, 15:20 - 17:20 |
| Location: Poster Area D |
| Session Chair: Hermann Ney, RWTH-Aachen |
| SP-P11.1: MULTIFRAME DEEP NEURAL NETWORKS FOR ACOUSTIC MODELING |
| Vincent Vanhoucke; Google Inc. |
| Matthieu Devin; Google Inc. |
| Georg Heigold; Google Inc. |
| SP-P11.2: INVESTIGATION OF TANDEM DEEP BELIEF NETWORK APPROACH FOR PHONEME RECOGNITION |
| Xin Zheng; Tsinghua University |
| Zhiyong Wu; Tsinghua University |
| Binbin Shen; Tsinghua University |
| Helen Meng; The Chinese University of Hong Kong |
| Lianhong Cai; Tsinghua University |
| SP-P11.3: USING MULTIPLE VERSIONS OF SPEECH INPUT IN PHONE RECOGNITION |
| Mark Liberman; University of Pennsylvania |
| Jiahong Yuan; University of Pennsylvania |
| Andreas Stolcke; Microsoft Research |
| Wen Wang; SRI International |
| Vikramjit Mitra; SRI International |
| SP-P11.4: AUDIO-VISUAL DEEP LEARNING FOR NOISE ROBUST SPEECH RECOGNITION |
| Jing Huang; IBM |
| Brian Kingsbury; IBM |
| SP-P11.5: INVESTIGATION OF DEEP BOLTZMANN MACHINES FOR PHONE RECOGNITION |
| Zhao You; Institute of Automation, Chinese Academy of Sciences |
| Xiaorui Wang; Institute of Automation, Chinese Academy of Sciences |
| Bo Xu; Institute of Automation, Chinese Academy of Sciences |
| SP-P11.6: FEATURE AND SCORE LEVEL COMBINATION OF SUBSPACE GAUSSINAS IN LVCSR TASK |
| Petr Motlicek; Idiap Research Institute |
| Daniel Povey; Johns Hopkins University |
| Martin Karafiat; Brno University of Technology |
| SP-P11.7: UPPER AND LOWER BOUNDS FOR APPROXIMATION OF THE KULLBACK-LEIBLER DIVERGENCE BETWEEN HIDDEN MARKOV MODELS |
| Haiyang Li; Harbin Institute of Technology |
| Jiqing Han; Harbin Institute of Technology |
| Tieran Zheng; Harbin Institute of Technology |
| Guibin Zheng; Harbin Institute of Technology |
| SP-P11.8: UNDERSTANDING THE DROPOUT STRATEGY AND ANALYZING ITS EFFECTIVENESS ON LVCSR |
| Jie Li; Institute of Automation, Chinese Academy of Sciences |
| Xiaorui Wang; Institute of Automation, Chinese Academy of Sciences |
| Bo Xu; Institute of Automation, Chinese Academy of Sciences |
| SP-P11.9: EFFICIENT DECODING WITH GENERATIVE SCORE-SPACES USING THE EXPECTATION SEMIRING |
| Rogier van Dalen; University of Cambridge |
| Anton Ragni; University of Cambridge |
| Mark J.F. Gales; University of Cambridge |
| SP-P11.10: IDENTIFICATION AND MODELING OF WORD FRAGMENTS IN SPONTANEOUS SPEECH |
| Yulia Tsvetkov; Carnegie Mellon University |
| Zaid Sheikh; Carnegie Mellon University |
| Florian Metze; Carnegie Mellon University |
| SP-P11.11: LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION BASED ON WFST STRUCTURED CLASSIFIERS AND DEEP BOTTLENECK FEATURES |
| Yotaro Kubo; Nippon Telegraph and Telephone Corporation |
| Takaaki Hori; Nippon Telegraph and Telephone Corporation |
| Atsushi Nakamura; Nippon Telegraph and Telephone Corporation |
| SP-P11.12: DEEP NEURAL NETWORKS WITH AUXILIARY GAUSSIAN MIXTURE MODELS FOR REAL-TIME SPEECH RECOGNITION |
| Xin Lei; Google Inc. |
| Hui Lin; Google Inc. |
| Georg Heigold; Google Inc. |
