University of Washington
Department of Electrical Engineering
Speech and Signal Processing Seminar
Spring Quarter, 2002
RM EE403
New EE Bldg
(unless otherwise noted)
University of Washington, Seattle
11 April 2002 (EE1 403, 1PM)
Language Modeling for Spontaneous Speech
-- Giuseppe (Beppe) Riccardi, AT&T Labs-Research
Abstract
Spontaneous speech poses many challenges to the speech
recognition and understanding problem.
In this talk we will address the language modeling issues for speech
recognition and understanding for large vocabulary spoken dialog
systems.
We will show how finite state representation and stochastic modeling
provide rich tools to model different language models: from n-grams,
to word phrases, phrase grammars and head-dependecy structures.
Phrase-based models are powerful in that they enhance
the traditional n-gram model and allow for a tight integration with the
understanding features ("Recognition for Understanding").
We will also present our latest results on automatically learned
head-dependency grammars and speech disfluency-based language models.
We show that people's responses to computer prompts vary over time and
(dialog) state and we propose a framework to track time-varying
statistical parameters of a spoken dialog system.
The set of algorithms presented will be evaluated over the 100K
human-machine dialogs for different ( in time and topic ) "How May I
Help You?" speech databases.
2 April 2002 (EE1 403, 11AM)
Modelling Sequential Data with Dynamic Bayesian Networks:
New models, applications and algorithms
-- Kevin Murphy, U.C. Berkeley
Abstract
Modelling sequential data is important in many areas of science and
engineering. Hidden Markov models (HMMs) are popular for this because
they are simple and flexible. Dynamic Bayesian Networks (DBNs)
generalize HMMs by allowing the state space to be represented in
factored form. This gives us the ability to represent much richer
models. In this talk, I will discuss the following topics, which
illustrate the versatility and difficulties involved with using DBNs.
- We show how to represent hierarchical HMMs as DBNs, and thereby
speedup inference from O(T^3) to O(T) time, where T is the length of the
sequence. We then apply HHMMs to the problem of learning hierarchical
models of behavior from some people-tracking data.
- Time-space tradeoffs for exact inference in DBNs. We show how to
reduce the space requirements from O(T) to O(log T), if we increase
running time by a O(log T) factor. This allows us to learn DBNs from
very long sequences of data.
- Rao-Blackwellised particle filtering (RBPF) for approximate
inference in DBNs. RBPF combines exact inference with sequential
importance sampling. We illustrate RBPF by applying it to the problem
of simultaneous localization and mapping (SLAM) for a simulated mobile
robot.
Past Quarter's Seminars
Last updated ($Date: 2002/08/26 22:38:43 $)