-
Research Areas:
Automatic Speech Recognition
Spoken Language Technology
Natural Language Processing
Machine Learning
Human-Computer Interface
Computational Biology
Speech Generation
Conference Papers, Journals, Book Chapters
2007
Mukund Narasimhan, Jeff Bilmes, "Local Search for Balanced Submodular Clusterings", Twentieth International Joint Conference on Artificial Intelligence (IJCAI07), Hyderabad, India, January 6-12, 2007
Danny Wyatt, Tanzeem Choudhury, Jeff Bilmes, Henry Kautz, "A Privacy Sensitive Approach to Modeling Multi-Person Conversations", Twentieth International Joint Conference on Artificial Intelligence (IJCAI07), Hyderabad, India, January 6-12, 2007
2006
Xiao Li, Jonathan Malkin, and Jeff Bilmes, "A High-speed, Low-Resource ASR Back-end Based on Custom Arithmetic", IEEE Trans. on Audio, Speech and Language Processing, 14(5), pp. 1694-1703, September, 2006
pdf
Yang Liu, Elizabeth Shriberg, Andreas Stolcke, Dustin Hillard, Mari
Ostendorf, and Mary Harper, "Enriching Speech Recognition with Sentence
Boundaries and Disfluencies," IEEE Transactions on Audio, Speech, and Language Processing, 14(5), pp. 1526-1540, September, 2006
pdf
Andreas Stolcke, Barry Chen, Horacio Franco, Ramana Gadde, Martin
Graciarena, Mei-Yuh Hwang, Katrin Kirchhoff, Xin Lei, Arindam Mandal,
Nelson Morgan, Tim Ng, Mari Ostendorf, Kemal Sonmez, Anand
Venkataraman, Dimitra Vergyri,Wen Wang, Jing Zheng, Qifeng Zhu, "Recent
Innovations in Speech-to-Text Transcription at SRI-ICSI-UW", IEEE Transactions on Audio, Speech and Language Processing, 14(5), pp. 1729-1744, 2006
Katrin Kirchhoff, Dimitra Vergyri, Jeff Bilmes, Kevin Duh, and Andreas
Stolcke, "Morphology-based Language Modeling for Arabic Speech
Recognition," Computer Speech and Language, vol. 20(4), pp.589-608, 2006
Mei-Yuh Hwang, "Acoustic Modeling for Mandarin Chinese," in Advances in Chinese Spoken Language Processing, (editors: Chin-Hui Lee, Haizhou Li, Lin-shan Lee, Ren-Hua Wang, Qiang Huo), World Scientific Publishing Co., Dec 2006
William P. McNeill, Jeremy G. Kahn, Dustin L. Hillard, and Mari
Ostendorf. "Parse Structure and Segmentation for Improving Speech
Recognition", IEEE 2006 Workshop on Spoken Language Technology, Aruba. December, 2006
pdf
Dustin Hillard, Zhongqiang Huang, Heng Ji, Ralph Grishman, Dilek
Hakkani-Tur, Mary Harper, Mari Ostendorf, Wen Wang, "Impact of
Automatic Comma Prediction on POS/Name Tagging of Speech," IEEE 2006 Workshop on Spoken Language Technology, Aruba. December, 2006
pdf
G. Ji, J. Bilmes, J. Michels, K. Kirchhoff, C. Manning, "Graphical Model Representations of Word Lattices", IEEE 2006 Workshop on Spoken Language Technology, Aruba. December, 2006
Karim Filali and Jeff Bilmes, "Multi-dynamic Bayesian Networks," Advances in Neural Information Processing Systems (NIPS), Vancouver, Dec 2006
Xiao Li, Gang Ji and Jeff Bilmes, "A Factored Language Model for Quantized Pitch and Duration," Intl. Computer Music Conf. (ICMC), New Orleans, Nov 2006
pdf
Katrin Kirchhoff, Kevin Duh, and Chris Lim, "The University of Washington Machine Translation System for IWSLT 2006," Proc. of the International Workshop on Spoken Language Translation, Kyoto, Japan, 2006.
Amarnag Subramanya, Alvin Raj, Jeff Bilmes and Dieter Fox, "Hierarchical Models for Activity Recognition", IEEE International Workshop in Multimedia Signal Processing (MMSP), Victoria, BC, Canada, October, 2006.
pdf
S. Harada, J.A. Landay, J. Malkin, X. Li, and J.A. Bilmes, "The Vocal
Joystick: Evaluation of Voice-based Cursor Control Techniques," Proceedings of the 8th International ACM SIGACCESS Conference on Computers and Accessibility, Portland, Oregon, October 2006.
Arindam Mandal, Mari Ostendorf and Andreas Stolcke, "Speaker Clustered Regression-Class Trees", Interspeech (ICSLP), September 2006, pp. 1133-1136, Pittsburgh, US.
Xin Lei, Manhung Siu, Mei-Yuh Hwang, Mari Ostendorf and Tan Lee,
"Improved Tone Modeling for Mandarin Broadcast News Speech
Recognition", Interspeech (ICSLP), September 2006, Pittsburgh, US.
pdf
Mei-Yuh Hwang, Xin Lei, Wen Wang and Takahiro Shinozaki, "Investigation on Mandarin Broadcast News Speech Recognition", Interspeech (ICSLP), September 2006, Pittsburgh, US.
pdf
Xin Lei, Jon Hamaker and Xiaodong He, "Robust Feature Space Adaptation for Telephony Speech Recognition", Interspeech (ICSLP), September 2006, Pittsburgh, US.
pdf
Xiao Li, Jonathan Malkin, Jeff Bilmes, Susumu Harada and James Landay,
"An Online Adaptive Filtering Algorithm for the Vocal Joystick,"
Interspeech, Pittsburgh, September 2006
pdf
Kelly Kilanski, Jonathan Malkin, Xiao Li, Richard Wright and Jeff
Bilmes, "The Vocal Joystick Data Collection Effort and Vowel Corpus,"
Interspeech, Pittsburgh, Sep. 2006
pdf
Sarah E. Petersen and Mari Ostendorf. "Assessing the Reading Level of Web Pages." In Proceedings of Interspeech, 2006.
M. Sammer, K. Reiter, S. Soderland, K. Kirchhoff and O. Etzioni,
"Ambiguity Reduction for Machine Translation: Human-Computer
Collaboration", Proceedings of AMTA (Machine Translation in the
Americas), Boston, MA, August, 2006
pdf
Amarnag Subramanya, Alvin Raj, Jeff Bilmes and Dieter Fox, "Recognizing
Activities and Spatial Context Using Wearable Sensors", 22nd Conference on Uncertainty in Artificial Intelligence (UAI06), Cambridge, MA, July, 2006
pdf
Chris Bartels and Jeff Bilmes. "Non-Minimal Triangulations for Mixed Stochastic/Deterministic Graphical Models", 22nd Conference on Uncertainty in Artificial Intelligence (UAI06), Cambridge, MA, July 2006
Alvin Raj, Amarnag Subramanya, Dieter Fox and Jeff Bilmes,
"Rao-Blackwellized Particle Filters for Recognizing Activities and
Spatial Context Using Wearable Sensors", International Symposium on Experimental Robotics (ISER), Rio de Janeiro, Brazil, July 2006.
pdf
Brian Roark, Mary Harper, Eugene Charniak, Bonnie Dorr, Mark Johnson,
Jeremy G. Kahn, Yang Liu, Mari Ostendorf, John Hale, Anna
Krasnyanskaya, Matt Lease, Izhak Shafran, Matt Snover, Robin Stewart,
Lisa Yung. "SParseval: Evaluation Metrics for Parsing Speech", LREC, 2006.
Kevin Duh and Katrin Kirchhoff, "Lexicon Acquisition for Dialectal Arabic Using Transductive Learning," Proc. of EMNLP (Empirical Methods in Natural Language Processing), Sydney, Australia, 2006.
pdf
Gang Ji and Jeff Bilmes, "Backoff Model Training using Partially Observed Data: Application to Dialog Act Tagging", Proc. Human Language Technology/ American chapter of the Association for Computational Linguistics(HLT/NAACL'06), New York, NY. June, 2006
pdf
Sangyun Hahn, Richard E. Ladner and Mari Ostendorf.
"Agreement/disagreement classification: exploiting unlabeled data using
contrast classifiers." Proc. Human Language Technology/ American chapter of the Association for Computational Linguistics(HLT/NAACL'06), New York, NY. June 2006.
pdf
S. Corston-Oliver, A. Aue, K. Duh, E. Ringger, "Multilingual Dependency Parsing using Bayes Point Machines", Proc. Human Language Technology/ American chapter of the Association for Computational Linguistics(HLT/NAACL'06), New York, NY. June 2006
pdf
Andrei Alexandrescu and Katrin Kirchhoff. "Factored Neural Language Models", Proc. Human Language Technology/ American chapter of the Association for Computational Linguistics(HLT/NAACL'06), New York, NY. June, 2006
pdf
BibTeX
Katrin Kirchhoff, Mei Yang, and Kevin Duh, "Statistical Machine
Translation of Parliamentary Proceedings Using Morpho-Syntactic
Knowledge". Proc. of the TC-STAR Workshop on Speech to Speech Translation, Barcelona, Spain, 2006
pdf
Stephen Purpura and Dustin Hillard, "Automated Classification of Congressional Legislation," Proceedings of the Sixth International Conference on Digital Government Research, 2006.
pdf
M. Yang and K. Kirchhoff, "Phrase-Based Backoff Models for Machine Translation of Highly Inflected Languages," Proc. European Chapter of the Association for Computational Linguistics (EACL), Torento, Italy. April, 2006
pdf
Xiao Li and Jeff Bilmes, "Regularized Adaptation of Discriminative Classifiers," Proc. ICASSP, Toulouse, France, May, 2006
pdf
Dustin Hillard and Mari Ostendorf, "Compensating for Word Posterior Estimation Bias in Confusion Networks," Proc. ICASSP, 2006.
pdf
Andreas Stolcke, Frantisek Grezl, Mei-Yuh Hwang, Xin Lei, Nelson Morgan
and Dimitra Vergyri, "Cross-domain and Cross-language Portability of
Acoustic Features Estimated by Multilayer Perceptrons", Proc. ICASSP, 2006, Toulouse, France.
pdf
Takahiro Shinozaki, "HMM State Clustering based on Efficient Cross-Validation", ICASSP, pp.1157-1160, Toulouse, France, 2006.
pdf
Jeff Bilmes, Jonathan Malkin, Xiao Li, Susumu Harada, Kelley Kilanski,
Katrin Kirchhoff, Richard Wright, Amarnag Subramanya, James Landay,
Patricia Dowden and Howard Chizeck, "The Vocal Joystick," Proc. ICASSP, Toulouse, France, May 2006
pdf
2005
K. Kirchhoff and S. Schimmel, "Statistical properties of infant-directed vs. adult-directed speech: insights from speech recognition", Journal of the Acoustical Society of America 117(4), pp. 2224-2237, 2005
I. Bulyko, K. Kirchhoff, M. Ostendorf and J. Goldberg, "Error
Correction Detection and Response Generation in a Spoken Langugage
Dialogue System", Speech Communication 45(3) (Special Issue on Error
Handling in Spoken Dialogue Systems) 2005, pp. 271-288
Jeff Bilmes and Chris Bartels. "Graphical Model Architectures for Speech Recognition," IEEE Signal Processing Magazine, September 2005, vol. 22, no. 5, pp. 89-100.
K. Kirchhoff and D. Vergyri,"Cross-Dialectal Data Sharing for Acoustic
Modeling in Arabic Speech Recognition", Speech Communication 46, pp.
37-51, 2005
Karim Filali and Jeff Bilmes, "Algorithms for Data-driven ASR Parameter Quantization," Computer Speech & Language, Volume 20, Issue 4, Oct 2005, Pages 625-643.
Xiao Li and Jeff Bilmes, "Feature Pruning for Low-Power ASR Systems in
Clean and Noisy Environments," IEEE Signal Processing Letters, June,
2005
pdf
Kevin Duh and Katrin Kirchhoff, "Structured Multi-label Transductive Learning: a Case Study in Lexicon Acquisition". NIPS 2005, Advances in Structured Learning for Text and Speech Processing Workshop, Whistler, Canada, Dec 2005.
pdf
Mukund Narasimhan, Nebojsa Jojic, Jeff Bilmes, "Q-Clustering" Neural Information Processing Systems (NIPS), Vancover, Canada, Dec 2005
pdf
Chris Bartels and Jeff Bilmes. "Focused State Transition Information in ASR", IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), San Juan, Puerto Rico, Nov/Dec, 2005.
pdf
Karim Filali and Jeff Bilmes, "Leveraging Multiple Languages to Improve Statistical MT Word Alignments," IEEE Automatic Speech Recognition and Understanding (ASRU), Cancun, Mexico, Nov/Dec 2005.
Jon Malkin, Xiao Li and Jeff Bilmes, "Energy and Loudness for Speed
Control in the Vocal Joystick," IEEE Automatic Speech Recognition and
Understanding Workshop (ASRU), San Juan, Puerto Rico, Nov/Dec, 2005
pdf
Jeremy G. Kahn, Matt Lease, Eugene Charniak, Mark Johnson and Mari
Ostendorf. "Effective Use of Prosody in Parsing Conversational Speech",
HLT/EMNLP. 2005
pdf
Jeff A. Bilmes, Xiao Li, Jonathan Malkin, Kelley Kilanski, Richard
Wright, Katrin Kirchhoff, Amarnag Subramanya, Susumu Harada, James A.
Landay, Patricia Dowden and Howard Chizeck, "The Vocal Joystick: A
Voice-Based Human-Computer Interface for Individuals with Motor
Impairments," HLT/EMNLP, Vancouver, Canada, Oct, 2005
pdf
Sheila Reynolds, Jeff Bilmes, "Part-of-Speech Tagging using Virtual
Evidence and Negative Training," HLT/EMNLP, Vancouver, Canada, Oct,
2005
pdf
Chia-Ping Chen, Jeff Bilmes, Dan Ellis, "Speech Feature Smoothing for Robust ASR, IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, Philadelphia, PA, March 2005
pdf
Gang Ji and Jeff Bilmes, "Dialog Act Tagging using Graphical Models," IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, Philadelphia, PA, March 2005
pdf
M. Hasegawa-Johnson, J. Baker, S. Borys, K. Chen, E. Coogan, S.
Greenberg, A. Juneja, K. Kirchhoff, K. Livescu, S. Mohan, J. Muller, K.
Sonmez and T. Wang, "Landmark-based Speech Recognition: Report of the
2004 Johns Hopkins Summer Workshop", Proceedings of ICASSP, 2005
Yang Liu, Elizabeth Shriberg, Andreas Stolcke, Barbara Peskin, Jeremy
Ang, Dustin Hillard, Mari Ostendorf, Marcus Tomalin, Phil Woodland, and
Mary Harper, "Structural Metatada Research in the EARS Program,"
Invited paper. ICASSP, 2005.
pdf
Xin Lei, Gang Ji, Tim Ng, Jeff Bilmes and Mari Ostendorf, "DBN-based Multi-Stream Mandarin Toneme Recogntion", ICASSP, March 2005, Philadelphia, US.
pdf
Tim Ng, Mari Ostendorf, Mei-Yuh Hwang, Ivan Bulyko, Manhung Siu and Xin
Lei, "Web-data Augmented Language Model for Mandarin Speech
Recognition", ICASSP, March 2005, Philadelphia, US.
pdf
Somsak Sukittanon, Les E. Atlas, James W. Pitton, and Karim Filali, "Improved Modulation Spectrum Through Multi-scale Modulation Frequency Decomposition," IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), Philadelphia, Pennsylvania, March 2005.
Xiao Li, Asela Gunawardana, and Alex Acero, "Unsupervised Semantic
Intent Discovery from Call Log Acoustics", ICASSP, philadelphia, March,
2005
pdf
Jon Malkin, Xiao Li and Jeff Bilmes, "A Graphical Model Approach to Formant Tracking," ICASSP, Philadelphia March 2005
pdf
D. Vergyri, K. Kirchhoff, R. Gadde, A. Stolcke and J. Zheng,
"Development of a Conversational Telephone Speech Recognizer for
Levantine Arabic", Proceedings of Interspeech, Lisboa, Portugal, 2005
Amarnag Subramanya, Jeff Bilmes and Chia-Ping Chen, "Focused Word
Segmentation for ASR", European Conference on Speech Communication and
Technology (EUROSPEECH), September 2005.
pdf
Arindam Mandal, Mari Ostendorf and Andreas Stolcke, "Leveraging Speaker-Dependent Variation of Adaptation", Interspeech (Eurospeech), September 2005, pp. 1793-1796, Lisbon, Portugal.
Chris Bartels, Kevin Duh, Jeff Bilmes, Katrin Kirchhoff, and Simon
King. "Genetic Triangulation of Graphical Models for Speech and
Language Processing," 9th European Conference on Speech Communication and Technology (Eurospeech), Lisbon, Portugal, September 2005.
pdf
Takahiro Shinozaki, Mari Ostendorf, and Les Atlas, "Data Sampling for Improved Speech Recognizer Training", 9th European Conference on Speech Communication and Technology (Eurospeech), pp.1693-1696, Lisbon, Portugal, September 2005.
pdf
Simon King, Chris Bartels, and Jeff Bilmes. "SVitchboard 1: Small Vocabulary Tasks from Switchboard 1," 9th European Conference on Speech Communication and Technology (Eurospeech), Lisbon, Portugal, September 2005.
pdf
Xin Lei, Mei-Yuh Hwang and Mari Ostendorf, "Incorporating Tone-related
MLP Posteriors in the Feature Representation for Mandarin ASR", Interspeech (Eurospeech), September 2005, Lisboa, Portugal.
pdf
Xiao Li, Jeff Bilmes, and Jon Malkin, "Maximum Margin Learning and
Adaptation of MLP Classifers," Interspeech, Lisbon, Portugal, 2005
pdf
K. Kirchhoff and M. Yang, "Improved Language Modeling for Statistical Machine Translation", Proceedings of the ACL Workshop on Building and Using Parallel Texts, 2005
pdf
Kevin Duh and Katrin Kirchhoff, "POS Tagging of Dialectal Arabic: A Minimally Supervised Approach," 43rd
Annual Meeting of the Assoc. for Computational Linguistics (ACL2005),
Workshop on Computational Approaches to Semitic Languages, Ann Arbor, Michigan, USA, June 2005.
pdf
Kevin Duh, "Jointly Labeling Multiple Sequences: A Factorial HMM Approach," 43rd Annual Meeting of the Assoc. for Computational Linguistics (ACL 2005), Student Research Workshop, Ann Arbor, Michigan, USA, June 2005.
pdf
Karim Filali and Jeff Bilmes, "A Dynamic Bayesian Framework to Model
Context and Memory in Edit Distance Learning: An Application to
Pronunciation Classification," Proceedings of the Association for Computational Linguistics (ACL), Ann-Arbor, Michigan, June 2005.
Sarah E. Schwarm and Mari Ostendorf. "Reading Level Assessment Using
Support Vector Machines and Statistical Language Models." In
Proceedings of the Association for Computational Linguistics, 2005.
pdf
A. Stolcke, X. Anguera, K. Boakye, O. Cetin, F. Grezl, A. Janin, A.
Mandal, B. Peskin, C. Wooters and J. Zheng, "Further Progress in
Meeting Recognition: The ICSI-SRI Spring 2005 Meeting Recogntion
System", Proceedings of NIST MLMI Meeting Recognition Workshop, 2005.
Constantinos Boulis, Jeremy G. Kahn, and Mari Ostendorf. "The Role of
Disfluencies in Topic Classification of Human-Human Conversations", AAAI-05 workshop on Spoken Language Understanding, 2005.
pdf
Yi Li, Linda Shapiro, and Jeff Bilmes, "A Generative/Discriminative Learning Algorithm for Object Recognition", 10th IEEE Conference on Computer Vision (ICCV) , Beijing, China 2005
pdf
Franz Pernkopf and Jeff Bilmes, "Discriminative versus Generative
Parameter and Structure Learning of Bayesian Network Classifiers", International Conference on Machine Learning, Bonn, Germany, 2005
pdf
Mukund Narasimhan and Jeff Bilmes, "A Supermodular-Submodular Procedure
with Applications to Discriminative Structure Learning", 21st Conference on Uncertainty in Artificial Intelligence (UAI05), Edinburgh, Scotland, 2005
pdf
2004
J. Bilmes, Graphical Models and Automatic Speech Recognition, in
"Mathematical Foundations of Speech and Language Processing", ed. M.
Johnson, S. Khudanpur, M. Ostendorf and R. Rosenfeld, Institute of
Mathematical Analysis Volumes in Mathematics Series, vol. 138,
Springer-Verlag, 2004.
M. Ostendorf and I. Bulyko, "The Use of Speech Recognition Technology in Speech Synthesis," in Text-to-Speech Synthesis: New Paradigms and Advances, ed. S. Narayanan and A. Alwan, Prentice Hall, 2004.
K. Kirchhoff, "Machine Translation", in Encyclopedia of Human-Computer Interaction, Berkshire, 2004
S. Schwarm, I. Bulyko and M. Ostendorf. "Adaptive Language Modeling with Varied Sources to Cover New Vocabulary Items," IEEE Transaction on Speech and Audio Processing, 12(3), pp. 334-342. May 2004.
R. Vuduc, J. Demmel, J. Bilmes. Statistical Models for Empirical Search-Based Performance Tuning. , International Journal of High Performance Computing Applications, 18(1), pp.65-94, Feb 2004.
Mukund Narasimhan, Jeff Bilmes, "Optimal Sub-graphical Models", Neural Information Processing Systems (NIPS), 2004 , Vancouver, Canada, Dec 2004
pdf
Jeff Bilmes, "What HMMs Can't do", Invited paper and lecture, ATR Workshop "Beyond HMMs", Kyoto, Japan, December 2004
pdf
Mei-Yuh Hwang, Xin Lei, Tim Ng, Ivan Bulyko, Mari Ostendorf, Andreas
Stolcke, Wen Wang, Jing Zheng, Venkata Ramana Rao Gadde, Martin
Graciarena, Man-Hung Siu and Yan Huang, "Progress on Mandarin
Conversational Telephone Speech Recognition", Proc. International Symposium on Chinese Spoken Language Processing (ISCSLP) 2004, December 2004, Hong Kong.
pdf
D. Vergyri, K. Kirchhoff, A. Stolcke, R. Gadde and J. Zheng, "Models for dialectal Arabic CTS," Proceedings of the NIST RT-04 Workshop, 2004.
M. Ostendorf and D. Hillard, "Scoring Structural MDE: Towards More Meaningful Error Rates," Proceedings of the NIST RT-04 Workshop, 2004.
Xiao Li, Jonathan Malkin, and Jeff A. Bilmes. "A Graphical Model Approach to Pitch Tracking". Proc. ICSLP, 2004.
ps.gz
pdf
Y. Liu, E. Shriberg, A. Stolcke, D. Hillard, M. Ostendorf, B. Peskin, M. Harper. "The ICSI-SRI-UW Metadata Extraction System". Proc. ICSLP, 2004.
D. Vergyri, K. Kirchhoff, K. Duh, A. Stolcke. "Morphology-Based Language Modeling for Arabic Speech Recognition". Proc. ICSLP, 2004.
pdf
Y. Liu, E. Shriberg, A. Stolcke, D. Hillard, M. Ostendorf, B. Peskin,
& M. Harper, "The ICSI-SRI-UW Metadata Extraction System," Proc. ICSLP, vol. I, pp. 577-580, 2004.
N. Mirghafori et al., "From Switchboard to Meetings: Development of the 2004 ICSI-SRI-UW Meeting Recognition System," in Proc. ICSLP, vol. III, pp. 1957-1960, October 2004.
Kevin Duh and Katrin Kirchhoff. "Automatic Learning of Language Model Structure". Proc. COLING, 2004, Geneva, Switzerland.
pdf
Dimitra Vergyri and Katrin Kirchhoff. "Automatic Diacritization of Arabic for Acoustic Modeling in Speech Recognition". Proc. COLING 2004 Workshop on Arabic-script Based Languages, 2004, Geneva, Switzerland.
Yi Li, Jeff A. Bilmes, and Linda G. Shapiro, "Object Class Recognition using Images of Abstract Regions," International Conference on Pattern Recognition, August 2004. Cambridge, UK
pdf
Mukund Narasimhan and Jeff A. Bilmes. "PAC-learning bounded tree-width Graphical Models". Proc. 20th Conference on Uncertainty in Artificial Intelligence (UAI), July 2004.
ps.gz
pdf
A. Stolcke et al., "Progress in Meeting Recognition: The ICSI-SRI-UW
Spring 2004 Evaluation System" Proc. NIST RT-04 Workshop, 2004.
John N. Gowdy, Amarnag Subramanya, Chris Bartels, and Jeff Bilmes.
"DBN-Based Multi-Stream Models for Audio-Visual Speech Recognition". Proc. ICASSP, May 2004. Montreal, Canada
ps.gz
pdf
Xiao Li, Jonathan Malkin, and Jeff Bilmes. "Codebook Design for ASR Systems using Custom Arithmetic units". Proc. ICASSP, May 2004. Montreal, Canada
ps.gz
pdf
Jonathan Malkin, Xiao Li, and Jeff Bilmes. "Custom Arithmetic for High-Speed, Low-Resource ASR Systems", Proc. ICASSP, May 2004. Montreal, Canada
ps.gz
pdf
K. Kirchhoff and D. Vergyri, "Cross-dialectal acoustic data sharing for Arabic speech recognition", Proc. ICASSP, 2004, Montreal, Canada
pdf
O. Cetin and M. Ostendorf, "Mult-rate hidden Markov models and their application to machining tool-wear classification," in Proc. ICASSP, vol. V, pp. 837-840, May 2004.
Constantinos Boulis, "Speaker Recognition with Mixtures of Gaussians with Sparse Regression Matrices," Proc. HLT-NAACL Student Research Workshop, 2004.
pdf
ps
bib
D. Hillard, M. Ostendorf, A. Stolcke, Y. Liu and E. Shriberg.
"Improving Automatic Sentence Boundary Detection with Confusion
Networks," Proc. HLT-NAACL, 2004.
pdf
ps
bib
Gang Ji and Jeffrey Bilmes. "Multi-Speaker Language Modeling". Proc. HLT-NAACL, 2004.
pdf
ps
bib
Jeremy G. Kahn, Mari Ostendorf and Ciprian Chelba. "Parsing Conversational Speech Using Enhanced Segmentation". Proc. HLT-NAACL, 2004.
pdf
ps
bib
Joungbum Kim, Sarah E Schwarm and Mari Ostendorf, "Detecting Structural
Metadata with Decision Trees and Transformation-Based Learning," in Proc. HLT-NAACL, 2004.
pdf
ps
bib
C. Boulis and M. Ostendorf, "Combining multiple clustering systems," in Proc. European Conference on Principles of Knowledge Discovery in Databases, pp. 63-74, 2004.
2003
M. Richardson, J. Bilmes, and C. Diorio Hidden-Articulator Markov Models for Speech Recognition, Speech Communications, 41(2), October 2003.
J. Bilmes, "Buried Markov Models: A Graphical-Modeling approach to Automatic Speech Recognition", Computer, Speech and Language, Volume 17, No 2-3, April-July, 2003
J. Bilmes and K. Kirchhoff, "Generalized rules for combination and joint training of classifiers", Pattern Analysis and Applications, 6(3), pp. 201-211, 2003
I. Shafran and M. Ostendorf, "Acoustic Model Clustering Based on Syllable Structure," Computer Speech and Language, vol. 17, no. 4, pp. 311-328, 2003.
Nock and M. Ostendorf, "Parameter reduction schemes for loosely coupled HMMs," Computer Speech and Language, vol. 17, no. 2-3, pp. 233-262, 2003.
R. Fish, M. Ostendorf, G. D. Bernard and D. Castanon, "Multilevel Classification of Milling Tool Wear with Confidence Estimation," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 25, no. 1, pp. 75-85, 2003.
K. Kirchhoff and S. Schimmel, "Statistical Modelling of Infant-Directed vs. Adult-Directed Speech: Insights from Speech Recognition", Proceedings of the 146th Meeting of the ASA, 2003.
Xiao Li and Jeff Bilmes. "Feature Pruning in Likelihood Evaluation of HMM-based Speech Recognition", IEEE Automatic Speech Recognition and Understanding (ASRU), 2003. St. Thomas, U.S. Virgin Islands, Dec 2003.
pdf
Gang Ji and Jeff Bilmes. "Necessary Intransitive Likelihood-Ratio Classifiers", Neural Information Processing Systems (NIPS), Vancouver, Canada, Dec 2003
pdf
Karen Livescu, James Glass, and Jeff Bilmes. "Hidden Feature Models for Speech Recognition Using Dynamic Bayesian Networks", Proc. 8th European Conference on Speech Communication and Technology (Eurospeech), 2003. Geneva, Switzerland.
pdf
O. Cetin and M. Ostendorf, "Cross-stream Observation Dependencies for Multi-stream Speech Recognition," Proc. Eurospeech, pp. 2517-2520, September 2003.
J. Goldberg, M. Ostendorf and Katrin Kirchhoff, "The Impact of Response Wording on Error Correction Subdialogues", ISCA Workshop on Error Handling in Spoken Dialogue Systems, 2003
J. Bilmes and K. Kirchhoff, "Factored Language Models and Generalized Parallel Backoff", Proceedings of HLT/NAACL, Edmonton, Canada, May 2003.
pdf
Dustin Hillard, Mari Ostendorf, and Elizabeth Shriberg "Detection of
Agreement vs. Disagreement in Meetings: Training with Unlabeled Data". Proceedings of HLT/NAACL, pp. 34-36, Edmonton, Canada, May 2003.
pdf
I. Bulyko, M. Ostendorf and A. Stolcke. "Getting More Mileage from Web
Text Sources for Conversational Speech Language Modeling using
Class-Dependent Mixtures", Proceedings of HLT/NAACL, pp. 7-9, 2003.
pdf
S. Parandekar and K. Kirchhoff, "Multi-Stream Language Identification Using Data-driven Dependency Selection", Proceedings of ICASSP 2003 , Hong Kong
pdf
K. Kirchhoff, J. Bilmes, S. Das, N. Duta, M. Egan, G. Ji, F. He, J.
Henderson, D. Liu, M. Noamany, P. Schone, R. Schwartz and D. Vergyri,
"Novel Approaches to Arabic Speech Recognition: Report from the 2002
Johns-Hopkins Workshop", Proceedings of the International Conference on Acoustics, Speech and Signal Processing, Hong Kong, April 2003.
pdf
Yimin Zhang, Qian Diao, Shan Huang, Wei Hu, Chris Bartels, and Jeff Bilmes. "DBN Based Multi-Stream Models for Speech", IEEE Int. Conference on Acoustics, Speech, and Signal Processing, April 2003. Hong Kong, China
gzipped ps
pdf
Jeff Bilmes and Chris Bartels "On Triangulating Dynamic Graphical Models", 19th Conference on Uncertainty in Artificial Intelligence (UAI), 2003. Acapulco, Mexico.
gzipped ps
pdf
M. Ostendorf, I. Shafran and R. Bates, "Prosody models for conversational speech recognition," Proc. of the 2nd Plenary Meeting and Symposium on Prosody and Speech Processing, pp. 147-154, Feb. 2003.
2002
C. Boulis, M. Ostendorf, E. Riskin, and S. Otterson "Graceful
Degradation of Speech Recognition Performance over Packet-Erasure
Networks," IEEE Transactions on Speech and Audio Processing, vol. 10, no. 8, pp. 580-590, 2002.
I. Bulyko and M. Ostendorf. "Efficient Integrated Response Generation
from Multiple Targets using Weighted Finite-State Transducers", Computer Speech and Language, Vol. 16, No. 3/4, pp. 533-550, July 2002.
K. Kirchhoff, G.A. Fink and G. Sagerer. "Combining acoustic and
articulatory feature information for robust speech recognition", Speech Communication, May, 2002
Ozgur Cetin, Harriet Nock, Katrin Kirchhoff, Jeff Bilmes, and Mari Ostendorf. "The 2001 GMTK-Based SPINE ASR System", International Conference on Spoken Language Processing (ICSLP), vol. 2, 1037-1040, 2002.
pdf
Karim Filali, Xiao Li, and Jeff Bilmes. "Data-Driven Vector Clustering for Low-Memory Footprint ASR", International Conference on Spoken Language Processing (ICSLP) 2002, Denver, Colorado
pdf
Chia-Ping Chen, Karim Filali, and Jeff Bilmes. "Frontend
Post-Processing and Backend Model Enhancement on the Aurora 2.0/3.0
Databases", International Conference on Spoken Language Processing (ICSLP) 2002, Denver, Colorado
pdf
Chia-Ping Chen, Jeff Bilmes, and Katrin Kirchhoff. "Low-Resource Noise-Robust Feature Post-Processing on Aurora 2.0", International Conference on Spoken Language Processing (ICSLP) 2002, Denver, Colorado
pdf
R. Bates and M. Ostendorf, "Modeling Pronunciation Variation in Conversational Speech Using Prosody," Proc. ISCA Tutorial and Research Workshop on Pronunciation Modeling and Lexicon Adaptation for Spoken Language, September 2002.
M. Ostendorf and I. Bulyko. "The Impact of Speech Recognition on Speech Synthesis", invited paper, in Proceedings of the IEEE Workshop on Speech Synthesis, 2002.
pdf
I. Bulyko and M. Ostendorf. "A Bootstrapping Approach to Automating Prosodic Annotation for Constrained Domain Synthesis", in Proceedings of the IEEE Workshop on Speech Synthesis, 2002.
pdf
K. Kirchhoff, S. Parandekar and J. Bilmes, "Mixed-memory Markov Models for Automatic Language Identification", Proceedings of ICASSP 2002, Orlando, Florida
pdf
J. Bilmes and G. Zweig. "The Graphical Models Toolkit: An Open Source Software System for Speech and Time-Series Processing", IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, June 2002. Orlando Florida.
pdf
G. Zweig, J. Bilmes, T. Richardson, K. Filali, K. Livescu, P. Xu, K.
Jackson, Y. Brandman, E. Sandness, E. Holtz, J. Torres, B. Byrne.
"Structurally Discriminative Graphical Models for Automatic Speech
Recognition: Results from the 2001 Johns Hopkins Summer Workshop", IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, June 2002. Orlando Florida
pdf
I. Bulyko, M. Ostendorf, and J. Bilmes "Robust Splicing Costs and
Efficient Search with BMM Models for Concatenative Speech Synthesis". Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, 1:461-464, May 2002.
pdf
Sarah Schwarm and Mari Ostendorf, "Text Normalization with Varied Data
Sources for Conversational Speech Language Modeling," In Proceedings of the International Conference on Acoustic, Speech and Signal Processing, vol. I, pp. 789-792, May 2002.
pdf
2001
R. Sproat, A. Black, S. Chen, S. Kumar, M. Ostendorf and C. Richards, "Normalization of Non-Standard Words," Computer Speech and Language, vol. 15, no. 3, pp. 287-333, July 2001.
J. Bilmes, G. Ji, and M. Meila "Intransitive Likelihood-Radio Classifiers", NIPS'2001, Vancouver Canada, Dec 2001.
pdf
I. Shafran, M. Ostendorf, and R. Wright, "Prosody and phonetic
variability: lessons learned from acoustic model clustering," in Proc. of the ISCA Workshop on Prosody in Speech Recognition and Understanding, pp. 127-131, October 2001.
R. Bates and M. Ostendorf, "Reducing the Effects of Pronunciation
Variability on Spontaneous Speech Recognition using Prosody and
Discourse," in Proc. of the ISCA Workshop on Prosody in Speech Recognition and Understanding, pp. 17-22, October 2001.
M. Ostendorf, I. Shafran, S. Shattuck-Hufnagel, B. Byrne and L.
Carmichael, "A prosodically labeled database of spontaneous speech," in
Proc. of the ISCA Workshop on Prosody in Speech Recognition and Understanding, pp. 119-121, October 2001.
K. Kirchhoff and S. Parandekar, "Multi-stream statistical N-gram
modeling with application to automatic language identification", Proceedings of Eurospeech 2001, Aalborg, Denmark, September 2001
pdf
I. Bulyko and M. Ostendorf. "Unit Selection for Speech Synthesis Using
Splicing Costs with Weighted Finite State Transducers", In Proceedings of Eurospeech, 2:987-990, 2001.
pdf
D. Palmer and M. Ostendorf, "Improved Word Confidence Estimation using Long Range Features," in Proc. of Eurospeech, pp. 2117-2120, September 2001.
E. Riskin, C. Boulis, S. Otterson and M. Ostendorf, "Graceful
Degradation of Speech Recognition Performance Over Lossy Packet
Networks," in Proc. of Eurospeech, pp. 2715-2718, September 2001.
K. Kirchhoff, "A Comparison of Classification Techniques for the
Automatic Detection of Error Corrections in Human-Computer Dialogues", Proceedings of the NAACL Workshop on Adaptation in Dialogue Systems, Pittsburgh, PA, June 2001.
I. Bulyko and M. Ostendorf. "Joint Prosody Prediction and Unit Selection for Concatenative Speech Synthesis", In Proc. of the International Conference on Acoustics, Speech and Signal Processing, 2:781-784, 2001.
pdf
M. Ostendorf, L. Atlas, R. Fish, O. Cetin, S. Sukittanon, and G. D. Bernard, "Joint Use of Dynamical Classifiers and Ambiguity Plane Features," in Proc. of the International Conference on Acoustics, Speech and Signal Processing, vol. VI, pp. 3589-3592, 2001.
D. Palmer and M. Ostendorf, "Improving Information Extraction by Modeling Errors in ASR Output," in Proc. of the Human Language Technology Workshop, pp. 156-160, March 2001.
2000
Mari Ostendorf, "Incorporating linguistic theories of phonological variation into speech recognition models," Phil. Trans. Royal Society, vol. 358, no. 1769, pp. 1325-1338, 2000.
David Palmer and Mari Ostendorf, "Robust information extraction from automatically generated speech transcriptions," Speech Communication, vol. 32, pp. 95-109, 2000.
M. H. Siu and M. Ostendorf, "Variable N-grams and Extensions for Conversational Speech Language Modeling," in IEEE Transactions on Speech and Audio Processing, vol. 8, no. 1, pp. 63-75, 2000.
K. Kirchhoff and J. Bilmes. "Combination and Joint Training of Acoustic
Classifiers for Speech Recognition", Proceedings of ASR 2000, Paris,
France, 2000
J. Bilmes and K. Kirchhoff. "Directed Graphical Models of Classifier Combination: Application to Phone Recognition", Proceedings of ICSLP, Beijing, China, 2000
D. Ellis and J. Bilmes, "Using Mutual Information to Design Feature Combinations," Proc. International Conference on Spoken Language Processing, Beijing, October 2000.
pdf
K. Kirchhoff. "Speech Analysis by Rule Extraction from Trained Artificial Neural Networks", Proceedings of International Conference on Spoken Language Processing, Beijing, 2000
J. Bilmes. "Dynamic Bayesian Multi-Networks", The 16th Conference on Uncertainty in Artificial Intelligence, Stanford, July 2000.
pdf
K. Kirchhoff, G.A. Fink and G. Sagerer, "Conversational Speech
Recognition Using Acoustic and Articulatory Input", ICASSP 2000,
Istanbul, Turkey, June 2000
ps
J. Bilmes, "Factored Sparse Inverse Covariance Matrices," Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, June 2000.
pdf
Izhak Shafran and Mari Ostendorf, "Use of higher level linguistic structure in acoustic modeling for speech recognition," in Proc. ICASSP, vol. III, pp. 1643-1646, 2000.
Manhung Siu and Mari Ostendorf, "Integrating a context-dependent phrase grammar in the variable n-gram framework," in Proc. ICASSP, vol. II, pp. 1021-1024, 2000.
Les Atlas, Mari Ostendorf, and Gary Bernard, "Hidden Markov models for monitoring machining tool-wear," in Proc. ICASSP, vol. VI, pp. 3887-3890, 2000.
K. Kirchhoff, "Integrating Articulatory Features into Acoustic Models
for Speech Recognition", Workshop PhonASR, Saarbruecken, Germany, May
2000
1999
Mari Ostendorf, "Moving beyond the `beads-on-a-string' model of speech," in Proc. IEEE ASRU Workshop, 1999.
David Palmer, Mari Ostendorf and John Burger, "Robust information extraction from spoken language data," in Proc. Eurospeech, 1999, pp. 1035-1038.
Ivan Bulyko and Mari Ostendorf, "Predicting Gradient F0 Variation: Pitch Range and Accent Prominence," in Proc. Eurospeech, 1999, pp. 1819-1822.
K. Kirchhoff and J. Bilmes. "Statistical Acoustic Indications of Coarticulation", Proceedings 14th International Congress of Phonetic Sciences, San Francisco, USA, August 1999
ps
I. Bulyko, M. Ostendorf and P. J. Price, "On the Relative Importance of
Different Prosodic Factors for Improving Speech Synthesis," in XIVth International Congress of Phonetic Sciences, 1999, pp. 81-84.
K. Kirchhoff and J. Bilmes. "Dynamic Classifier Combination in Hybrid
Speech Recognition Systems using Utterance-Level Confidence Values",
Proceedings IEEE International Conference on Acoustics, Speech, and
Signal Processing, Phoenix, USA, March 1999
ps
J. Bilmes. "Buried Markov Models for Speech Recognition," Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, March 1999.
pdf
Edited Books
T. Schultz and K. Kirchhoff (eds.), Multilingual Speech Processing , Elsevier, 2006
M. Johnson, S. Khudanpur, M. Ostendorf and R. Rosenfeld (eds.),
"Mathematical Foundations of Speech and Language Processing", Institute
of Mathematical Analysis Volumes in Mathematics Series, vol. 138,
Springer-Verlag, 2004.
Graduate Student Theses
Jeremy G. Kahn, Linguistics M.A. 2005,
Constantinos Boulis, EE Ph.D. 2005,
Özgür Çetin, EE Ph.D. 2004,
Chia-Ping Chen, EE Ph.D. 2004,
Joungbum Kim, EE M.S. 2004,
-
Automatic Detection of Sentence Boundaries, Disfluencies, and Conversational Fillers in Spontaneous Speech
Sonia Parandekar, EE M.S. 2003,
-
Feature-based Language Identification Using Data-Driven Model Selection
Rebecca Bates, EE Ph.D. 2003,
-
Speaker Dynamics as a Source of Pronunciation Variability for Continuous Speech Recognition Models
Ivan Bulyko, EE Ph.D. 2002,
Izhak Shafran, EE Ph.D. 2001,
Randall K. Fish, EE Ph.D. 2001,
David Palmer, EE Ph.D. 2001,
Technical Reports
2006
Jeff Bilmes, Marina Meila, "Intransitive Classification and Choice", UW EE Technical Report UWEETR-2006-0021, 2006
pdf
Mei-Yuh Hwang, Xin Lei, Tim Ng, Mari Ostendorf, Andreas Stolcke, Wen
Wang, Jing Zheng and Venkata Ramana Rao Gadde, UW EE Technical Report
UWEETR-2006-0013, 2006.
pdf
Kevin Duh and Katrin Kirchhoff, "Lexicon Acquisition for Resource-Poor
Languages using Transductive Learning", UWEETR-2006-0012,
pdf
Andrei Alexandrescu and Katrin Kirchhoff. "Factored Neural Language Models", UW EE Technical Report UWEETR-2006-0014, 2006.
abstract
pdf
ps.gz
BibTeX
Jonathan Malkin, Neil Lawrence, Jeff Bilmes, "The GP-LVM for Vocal
Joystick Control," University of Washington, Department of Electrical
Engineering, Technical Report UWEETR-2006-0016, Oct. 2006
pdf
Sarah E. Petersen and Mari Ostendorf. "A Machine Learning Approach to
Reading Level Assessment." University of Washington CSE Technical
Report 2006-06-06.
pdf
2005
Darby Wong, Mari Ostendorf and Jeremy G. Kahn, "Using Weakly Supervised
Learning to Improve Prosody Labeling", UWEETR-2005-0003, January 2005
pdf
2004
Xiao Li, Jonathan Malkin, Jeff Bilmes, "High-speed, Low-Resource ASR
Back-end Based on Custom Arithmetic", UWEETR-2004-0019, June 2004
pdf
Mukund Narasimhan, Jeff Bilmes, "Optimization on Seperator Trees", UWEETR-2004-0018
pdf
Chia-ping Chen, Jeff Bilmes, Dan Ellis, "Blind MVA Speech Feature Processing on Aurora", UWEETR-2004-0017, June 2004
pdf
Kevin Duh, Katrin Kirchhoff, "Automatic Learning of Language Model", UWEETR-2004-0014, April 2004
pdf
O. Cetin, M. Ostendorf and G. Bernard, "Multi-rate hidden Markov
models for monitoring of machining toolwear", UWEETR-2004-0011, April
2004
pdf
Chris Bartels and Jeff Bilmes. "Elimination is Not Enough: Non-Minimal
Triangulations for Graphical Models", UWEETR-2004-0010, June 2004
pdf
Mukund Narasimhan, Jeff Bilmes. "Efficient PAC-learning bounded tree-width Graphical Models", UWEETR-2004-0009, March 2004
pdf
2003
Chia-ping Chen, Jeff Bilmes, "MVA Processing of Speech Features", UWEETR-2003-0024, November 2003
pdf
Karim Filali, Xiao Li and Jeff Bilmes, "Algorithms for Data-Driven ASR Parameter Quantization", UWEETR-2003-0010, June 2003
pdf
Xiao Li, Jeff Bilmes, "Selectively Computing Dynamic Features in the
Likelihood Computation of ASR Systems", UWEETR-2003-0009, May 2003
pdf
Jeff Bilmes, Chris Bartels, "On Triangulating Dynamic Graphical Models", UWEETR-2003-0007, May 2003
pdf
David Palmer, Mari Ostendorf, "Improving Out-of-Vocabulary Name", UWEETR-2003-0005, March 2003
pdf
Ivan Bulyko, Mari Ostendorf, Andreas Stolcke, "Class-dependent
Interpolation for Estimating Language Models from Multiple Text
Sources", UWEETR-2003-0003, March 2003
pdf
2002
Constantinos Boulis, Jeffrey Bilmes, "Mixtures of Gaussians with Sparse Regression Matrices", UWEETR-2002-0017
pdf
Gang Ji, Jeff Bilmes, "Necessary Intransitive Likelihood Ratio Classifiers", UWEETR-2002-0014
pdf
Jeff Bilmes, "What HMMs Can Do", UWEETR-2002-0003
pdf
Chia-Ping Chen, Katrin Kirchhoff, Jeff Bilmes, "Towards Simple Methods of Noise-Robustness", UWEETR-2002-0002
pdf
2001
Jeff Bilmes, Geoff Zweig, Thomas Richardson, Karim Filali, Karen
Livescu, Peng Xu, Kirk Jackson, Yigal Brandman, Eric Sandness, Eva
Holtz, Jerry Torres, Bill Byrne, "Discriminatively Structured Graphical
Models for Speech Recognition", UWEETR-2001-0006
pdf
Jeff Bilmes, "Graphical Models and Automatic Speech Recognition", UWEETR-2001-0005
pdf
Costas Boulis, Mari Ostendorf, Eve A. Riskin, and Scott Otterson,
"Graceful Degradation of Speech Recognition Performance over Lossy
Packet Networks", UWEETR-2001-0003
pdf