EE512 - Graphical Models - Fall Quarter 2009
Instructor:
Prof. Jeff A. Bilmes --- Email meOffice: 418 EE/CS Bldg., +1 206 221 5236
Office hours: Mondays 11:00am-12:00pm, EEB-418
Announcements
- (Dec 17th) Reminder: final project presentations tomorrow (2:30) in our normal room. Each presentation should be no more than 10 minutes long.
- (Dec 11th) Don't forget about project status reports, due Dec 10th.
- (Dec 4th) lecture 18 slides now ready.
- (Dec 1st) lecture 17 slides now ready.
- (Nov 24th) lecture 16 slides now ready.
- (Nov 24th) Read chapters 4 and 5 in Wainwright/Jordan book
- (Nov 22nd) lecture 15 slides now ready.
- (Nov 17th) lecture 14 slides now ready.
- (Nov 15th) lecture 13 slides now ready.
- (Nov 11th) Web page for EE595A Dynamic Graphical Models is available.
- (Nov 11th) lecture 12 slides now ready.
- (Nov 9th) lecture 11 slides now ready.
- (Nov 4th) lecture 10 slides now ready.
- (Nov 4th) lecture 9 slides now ready.
- (Oct 28nd) lecture 8 slides now ready.
- (Oct 26th) 8:19pm. Typo on problem 5 in HW1 is now fixed (please look at latest version).
- (Oct 22nd) New readings on evidence now ready, location on the bboard.
- (Oct 22nd) lecture 7 slides now ready.
- (Oct 22nd) Problem set 1 is now ready. Due Friday, 10/30 5:00pm by sending PDF to me by email.
- (Oct 21st) slides from L6 now available below.
- (Oct 15th) slides from L5 now available below.
- (Oct 14th) slides from L4 now available below.
- (Oct 11th) Next chapter readings are available (see the discussion board)
- (Oct 11th) This wednesday's CSS seminar at 12:30pm is about Bayesian networks, see here
- (Oct 6th) Lecture slides for lecture 2 are available below.
- (Oct 4th) The first readings are availabe. Sign on to the discussion board to find out where they are.
- (Oct 1st) Where the the reading is, and how to get it, will appear here either late tonight or early tomorrow morning.
- (Sep 18th) Welcome to the class!
Information
Course Format: Four hours of lecture per week (TTh 1:30-3:20 EEB-054) per week.
Description: This course will serve as a thorough overview of graphical models including Bayesian networks, undirected graphical models, and factor graphs.
- Introduction and background to statistical inference. Independence, conditional independence, and factorization properties. Axioms of independence and independence systems. Consequences of independence in computations. Families of models and their characterization and operations on families.
- Undirected graphical models, their families and semantics. corresponding conditional independence properties. Markov properties on undirected graphs, factorization, Hammersley-Clifford theorems, inclusion-exclusion, Markov Random fields, Gibbs distributions, Boltzman distributions, multi-variate Gaussians as undirected models. Static (non-dynamic) conditional random fields, and graphs for representing conditional distributions in general.
- Chordal Graph Theory: moralization; triangulated, decomposable, and intersection graphs, $k$-trees, hyper-graphs. Tree-width and path-width parameters of a graph. Graph separators and their crossings. Relational data-base systems and joins, and phylogenetic tree construction.
- Factor graphs, their families and semantics. Corresponding properties and algorithms
- directed graphical models and Bayesian networks and their families, properties, factorization and conditional independence (e.g., d-separation). Markov properties for directed models. Markov property equivalences.
- Directed models: linear Gaussian models, mixture models, factor analysis, probabilistic decision trees, multivariate Gaussians as directed graphical models.
- Exact Probabilistic Inference: inference on trees, junction trees, belief propagation in its various forms (including Pearl's formulation, Hugin, Shafer-Shenoy, Bucket-elimination, etc.); join-trees and data-structures; optimal triangulations. The elimination family of algorithms. Generalized distributed law (GDL). Relation to dynamic programming. Generality (such as Viterbi, MPE, the fast Fourier transform). NP hardness results, and NP hardness results for finding the best form.
- Inference as search: Search-based approaches to inference and how it relates to junction trees, message passing, and BP. Results from the constraint satisfaction (CSP) and SAT literatures, constraint solvers, and heuristics used in these domains. Cutset and recursive conditioning, comparison with elimination. Branch-and-bound.
- Approximate Probabilistic Inference: loopy belief propagation (BP), expectation propagation (EP), generalized belief propagation (GBP), cluster GBP, comparisons with mean-field and variational approaches, structured mean-field, statistical physics, Bethe and Kikuchi free-energies, fixed points, recent theoretical results and open questions.
- sampling approaches to inference, such as MCMC and particle algorithms (such as Rao-Blackwellized particle filtering). Pruning based approaches to inference. How the above approximate inference procedures relate
- Learning graphical models: learning Bayesian networks, hardness results for learning networks in the maximum likelihood sense, learning in the PAC (probably approximately correct) framework, EM for parameter and structure learning, alternating minimization, discriminative parameter and structure learning, conditional (CRF) training vs. discriminative training methods (discriminative objective functions), other learning methods. Information theory and graphical models.
- Models in practice: Real-world static graphs. turbo-coding, low-density parity check codes, other codes on graphs, belief propagation algorithms on these graphs. Practical issues such as data structures and algorithms useful for performing inference.
Note that Fall 2009 will concentrate on static graphical models, while in Winter 2009, there will be a new course (EE599) offered Winter 2010 on dynamic graphical models (which includes Hidden Markov models, dynamic Bayesian networks, conditional random fields, Kalman filtering, linear dynamic systems, and dynamic graphical models).
Prerequisites: basic probability, statistics, and random processes (e.g., EE505 or a Stat 5xx class or consent of the instructor). Knowledge of matlab. The course is open to students in all UW departments. If you are in doubt about taking this course, please
Texts: We will mainly use written material that will be made available on the web page, as well as other printed and online material. Please take a look at the web page for more information. This will include a text entitled "An Introduction to Graphical Models", by Mike Jordan. We will also use a recent text by Daphne Koller and Nir Friedman.
Grades and Assignments: Grades will be based on a combination of a final project (35%), homeworks (35%), and the take home midterm exam (30%). There will be a new homework assignment due about every 1-2n weeks (for a total of about 4-5 homeworks).
Final project: The final project will consist of a 4-page paper (conference style) and a 10 minute final project presentation. The project must involve using or dealing mathematically with graphical models in some way or another. For their application to data, there are a number of software tools (such as BNT or GMTK) which can be used for this purpose, or you may wish to develop your own system. Projects can be done in groups of no more than two people (i.e., projects may be either 1 or two people, but no more).
Homework
There will be 4-5 homeworks for the quarter. Each will be due about one week after it is assigned. You can either hand the homework to me in person, or email me the PDF version.- Problem set 1 Due Friday, 10/30 5:00pm by sending PDF to me by email.
Lecture Slides
Lecture slides will be made available as they are being prepared --- they will probably appear here right before a given lecture, and they will be in PDF format (original source is latex). You might also find it useful to look at slides from the lectures the previous time this course was offered, although this year these slides are beeing completely revised, to account for new material that is being given this year.- Lecture 1 from 10/1/09. (last updated 10/4)
- Lecture 2 from 10/6/09. (last updated 10/7)
- Lecture 3 from 10/8/09. (last updated 10/13)
- Lecture 4 from 10/13/09. (last updated 10/14)
- Lecture 5 from 10/15/09. (last updated 10/15)
- Lecture 6 from 10/20/09. (last updated 10/21)
- Lecture 7 from 10/22/09. (last updated 10/22)
- Lecture 8 from 10/27/09. (last updated 10/28)
- Lecture 9 from 10/29/09. (last updated 11/04)
- Lecture 10 from 11/03/09. (last updated 11/03)
- Lecture 11 from 11/05/09. (last updated 11/09)
- Lecture 12 from 11/10/09. (last updated 11/11)
- Lecture 13 from 11/12/09. (last updated 11/17)
- Lecture 14 from 11/17/09. (last updated 11/17)
- Lecture 15 from 11/19/09. (last updated 11/22)
- Lecture 16 from 11/24/09. (last updated 11/24)
- Lecture 17 from 12/01/09. (last updated 12/01)
- Lecture 18 from 12/03/09. (last updated 12/04)
Discussion Board
You can post questions, discussion topics, or general information at this link.Books and Links
There are many books available that discuss the material that we are covering in this course. A few of the ones that I use regularly, and a few other web links that I think are useful, are included below.- "An Introduction to Bayesian Networks", F.V. Jensen, 1996. A good general introduction to Bayesian networks (out of print, but available in the library).
- "Bayesian Networks and Decision Graphs", F.V. Jensen, 2001. Another good general introduction to Bayesian networks (not out of print).
- "Graphical Models", S.L. Lauritzen. Oxford, 1996. A very complete, theoretically precise, but dense text, authored by one of the field's leading authorities.
- "Probabilistic Networks and Expert Systems", R.G. Cowell, A.P. Dawid, S.L. Lauritzen, and D.J. Spiegelhalter. 1999. Similar to the previous text, but includes more material on inference, applications, and other general problems.
- "Artificial Intelligence: A Modern Approach: 2nd Edition", S. Russel and P. Norvig, 2003. Has a nice introductory chapter on Bayesian networks.
- "Learning in Graphical Models", Ed. by M.I. Jordan. An excellent collection of recent research papers compiled by Mike Jordan, one of the leading experts in this field.
- "Probabilistic Reasoning in Intelligent Systems", J. Pearl. 1998. A classic early text by one of the founders of the field. Pearl is credited with inventing the term "Bayesian networks".
- "Causality", J. Pearl. 2000. A relatively newer text by Pearl specifically on causality, and causal modeling. A second edition of this book is out in 2009.
- "Pattern Classification", R. Duda, P. Hart and D. Stork (the text used for 596I). The original text (published in 1973) is still widely read.
- "The Elements of Statistical Learning: Data Mining, Inference, and Prediction", Hastie, Tibshirani, and Friedman. There is a 2nd edition that came out in 2008.
- "Neural Networks for Pattern Recognition", by C. Bishop, 1996. (available now in the UW bookstore). This book mainly contains background material, but has become a classic text in pattern recognition even though "neural networks" is in the title, and is worth reading if you plan to do any work at all in pattern recognition.
- Another book by C. Bishop that came out in 2006 is "Pattern Recognition and Machine Learning", which is here. This book also contains a nice overview chapter on graphical models and Bayesian networks.
Important Dates
- Wednesday, November 11th, Veterans Day
- Thursday/Friday, November 26-27th, Thanksgiving
- TBD, Final Project Presentations