SPEECH@SV

Publications

2017

  • Akshay Chandrashekaran, and Ian Lane, "Hierarchical Constrained Bayesian Optimization for Joint Feature, Acoustic Model and Decoder Parameter Optimization", in Interspeech 2017
  • Bing Liu and Ian Lane, "An End-to-End Trainable Neural Network Model with Belief Tracking for Task-Oriented Dialog", in INTERSPEECH 2017.
  • Anurag Kumar, Benjamin Elizalde, Bhiksha Raj. "Audio Content based Geotagging in Multimedia", in INTERSPEECH 2017.
  • Benjamin Elizalde, Ankit Shah, Rohan Badlani, Siddarth Dalmia, Min Lee, Bhiksha Raj, Ian Lane. "An Approach for Self-Training Audio Event Detectors Using Web Data", in EUSIPCO 2017.
  • Suyoun Kim, Takaaki Hori and Shinji Watanabe, "Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning", in ICASSP 2017.
  • Bing Liu and Ian Lane, "Dialog Context Language Modeling with Recurrent Neural Networks", in ICASSP 2017.

2016

2015

2014

2013

  • Jonas Gehring, Wonkyum Lee, Kevin Kilgour, Ian Lane, Yaije Miao and Alex Waibel, "Modular Combination of Deep Neural Networks for Acoustic Modeling," in INTERSPEECH 2013.
  • Ankur Gandhe, Long Qin, Florian Metze, Alexander Rudnicky, Ian Lane, and Matthias Eck, "Using web text to improve keyword spotting in speech," in ASRU 2013.

2012