The ongoing saga of our continued quest to become experts in machine translation by reviewing and discussing a number of both standard classic (and out of date) and recent statistical (and some non-statistical) techniques for MT (machine translation). We will also cover any other relevant papers from computational linguistics and machine learning. The group meetings will be informal, encouraging creative discussion in any time remaining after the end of reviewing a paper. Also, see below for upcoming calls and related links.
The group is part of SSLI-Lab. The Signal, Speech, and Language Interpolation (SSLI) laboratory the University of Washington, department of Electrical Engineering involves research related to all methods of working with time signals, in particular speech and language, but other forms of such signals as well.
To receive announcements for this group, send mail to
katrin@ee.washington.edu and/or
If you would like to lead a discussion (and we encourage you to volunteer), please email. While the list of papers below will give you something to choose from, you are encouraged to suggest other relevant papers in this area.
Fall 2005 quarter, we will meet every week in room Sieg-424, on Wednesdays from 3:30-5:30pm
| Topic | Readings | Date/Time/Location | Discussion leader | Slides | Notes |
| Word Alignment | B. Taskar, S. Lacoste-Julien and D. Klein, A Discriminative Matching Approach to Word Alignment | Wed, Oct 19th, 3:30-5:30 Sieg-424 | Takahiro Shinozaki | slides | notes |
| Translation Model | D. Chiang, A Hierarchical Phrase-Based Model for Statistical Machine Translation | Wed, Oct 26th, 3:30-5:30 Sieg-424 | Karim Filali | - | - |
| Reordering | S. Kanthak et al. Novel reordering approaches in phrase-based statistical machine translation | Wed, Nov 2nd, 3:30-5:30 Sieg-424 | Katrin Kirchhoff | slides | notes |
| Example-based MT | Lavie et al. A Trainable Transfer-based Machine Translation Approach for Languages with Limited Resources | Wed, Nov 9th, 3:30-5:30 Sieg-424 | Marcus Sammer and Ethan Phelps | slides | notes |
| Reordering | Michael Collins, Philipp Koehn, and Ivona Kucerova. Clause Restructuring for Statistical Machine Translation | Wed, Nov 16th, 3:30-5:30 Sieg-424 | Sarah Schwarm | - | - |
| - | - | Wed, Nov 23rd, 3:30-5:30 Sieg-424 | - | - | - |
| - | - | Wed, Nov 30th, 3:30-5:30 Sieg-424 | - | - | - |
| - | - | Wed, Dec 7th, 3:30-5:30 Sieg-424 | - | - | - |
| Topic | Readings | Date/Time | Location | Discussion leader | Notes |
| Example based MT | this (from ACL91) and that (from ACL96) (paper 1 and 2 below) | Thursday, April 21st, 2005 9:00am | Sieg-424 | Kevin Duh | slides for today. |
| Algorithms for Syntax-Aware Statistical Machine Translation | link (or see Melamed's #14 below) | Thursday, April 28th, 2005 9:00am | Sieg-424 | Jeremy Kahn | slides |
| PCFGs and the Inside/Outside Algorithm | Charniak, "Statistical Langauge Learning", Chapters 5-7. (see email for reading). | Thursday, May 12th, 2005 9:00am | Sieg-424 | Jeff Bilmes | slides |
| Continuation of last week | - | Thursday, May 19th, 2005 9:00am | Sieg-424 | Jeff Bilmes | - |
| UW/ISI meeting | UW/ISI meeting | Thursday, May 26th, 2005 9:00am | Sieg-424 | UW/ISI meeting | - |
| Machine Translation with Inferred Stochastic Finite-State Transducers | off-campus link and on-campus link | Thursday, June 2nd, 2005 9:00am | Sieg-424 | Sarah Schwarm | PDF slides |
| Topic | Readings | Date/Time | Location | Discussion leader | Notes |
| The Web as a Parallel Corpus, by Resnik and Smith | link | Thursday, March 3rd, 2005 12:00pm | Sieg-424 | Kevin Duh | - |
| CANCELLED | CANCELLED | Thursday, Feb 24th, 2005 12:00pm | Sieg-424 | Mari Ostendorf/Sarah Schwarm | - |
| Phrase pair rescoring with term weightings for statistical machine translation, by B. Zhao, S. Vogel and A. Waibel | link | Thursday, Feb 17rd, 2005 12:00pm | Sieg-424 | Katrin Kirchhoff | notes from today. |
| Towards MRS-Based Norwegian-English MT, Oepen et. al. from TMI 2004. | link | Thursday, Feb 10rd, 2005 12:00pm | Sieg-424 | Emily Bender | notes from today. |
| "Improving IBM Word-alignment Model 1" by Robert Moore | link | Thursday, Feb 3rd, 2005 12:00pm | Sieg-424 | Karim Filali | - |
| "Statistical Machine Translation with Scarce Resources Using Morpho-syntactic Information" by Sonja Niessen and Hermann Ney | link (you might need to be on campus to get this, or see here or here or here for alternatives). | Thursday, Jan 27, 2005 12:00pm | Sieg-424 | Jeremy Kahn | slides |
| Topic | Readings | Date/Time | Location | Discussion leader | Notes |
| Orange: a Method for Evaluating Automatic Metrics for Machine Translation, | link paper from Coling'04 (# 7 below) | Wed, Oct 20, 2004 3:30pm | Sieg-424 | Jeff Bilmes | PPT slides from today. notes from today. |
| Reordering Constraints for Phrase-based Statistitical MT | link (# 12 below) | Wed, Oct 27, 2004 3:30pm | Sieg-424 | Karim Filali | notes from today. slides from today. |
| Improving a Statistical MT System with Automatically Learned Rewrite Patterns (# 13 below) | ssli-local link, external link | Wed, Nov 3rd, 2004 3:30pm | Sieg-424 | Kevin Duh | ppt slides from today's group. |
| Language Model Adaptation for Statistical Machine Translation via Structured Query Models | link or (# 2 below) | Wed, Nov 10th, 2004 3:30pm | Sieg-424 | Sarah Schwarm | pdf slides from today. |
| Confidence Estimation for Machine Translation, Blatz et. al. Coling'04. (# 10 below) | link | Wed, Nov 17th, 2004 3:30pm | Sieg-424 | Takahiro Shinozaki | ppt slides from today. |
| MT as object in image recognition | Object Recognition as Machine Translation, ... pdf link or here (# 11 below) | Wed, Nov 24th, 2004 3:30pm | Sieg-424 | Katrin Kirchhoff | notes and slides from today. |
| POSTPONED | - | Wed, Dec 1st, 2004 3:30pm | Sieg-424 | - | - |
| Review of the July 2004 DARPA TIDES meeting. | reading material for today. | Wed, Dec 8th, 2004 3:30pm | Sieg-424 | Mari Ostendorf | - |
| Topic | Readings | Date/Time | Location | Discussion leader | Notes |
| Translation Templates for MT | link (# 11 below) | April 12th, 2004 3:00pm | AE-108 | Jeremy Kahn | notes |
| - | Postponed (thesis conflicts) | April 19th, 2004 3:00pm | - | - | |
| - | Postponed (more thesis conflicts) | April 26th, 2004 3:00pm | - | - | |
| - | - | May 3rd, 2004 3:00pm | No meeting HLT-NAACL'04 | - | - |
| Minimal Recursion Semantics | link1 and link2 | May 10th, 2004 3:00pm | MEB 251 | Emily Bender | notes |
| TBD | - | May 17th, 2004 3:00pm | No meeting ICASSP | - | - |
| Improved machine translation performance via parallel sentence extraction from comparable corpora," by D. Munteanu, A. Fraser and D. Marcu, from HLT-NAACL04 | link | May 24th, 2004 3:00pm | MEB 251 | Mari Ostendorf | - |
| TBD | - | May 31st, 2004 3:00pm | MEB 251 | Katrin Kirchhoff | - |
| TBD | - | June 7th, 2004 3:00pm | MEB 251 | Gang Ji | Last day of quarter |
| Next in order: | - | - | - | Karim Filali, Kevin, Taka, Sarah | - |
| Topic | Readings | Date/Time | Location | Discussion leader | Notes |
| Word clustering | brown class-based | Jan 15th, 2004 1:00pm | AE-108 | Jeff Bilmes | notes |
| Two papers on phrase-based translation | Koehn et. al. (#3) and Maccu and Wong (#4) below | Jan 22th, 2004 1:00pm | AE-108 | Sarah Schwarm | discussion notes (updated Jan 23, 2004) |
| Syntax-based translation | Yamada and Knight , paper #1 below | Jan 29th, 2004 1:00pm | AE-108 | Kevin Duh | discussion notes (updated Jan 30, 2004) |
| Tree-based alignment | D. Gildea (paper #2 below) | Feb 5th, 2004 1:00pm | AE-108 | Katrin Kirchhoff | discussion notes (updated Feb 9, 2004) |
| More on MT Evaluation using string-to-string distance | Paper by Leusch, Ueffing, and Ney link (#6 below) | Feb 12th, 2004 1:00pm | AE-108 | Franz Pernkopf | discussion notes (updated Feb 12, 2004) |
| Two papers on dynamic programming based search. | For Statistical Machine Translation and Using Monotone Alignments in Statistical Translation (links #7 and #8 below) | Feb 19th, 2004 1:00pm | AE-108 | Kwong Tim Ng | - |
| No Meeting due to UW-ISI Kickoff meeting in LA. | - | Feb 26th, 2004 1:00pm | AE-108 | - | - |
| Learning Dependency Transduction Models | Paper by Alshawi and Douglas, pdf link (Paper #8 below) | March 4thth, 2004 1:00pm | AE-108 | Karim Filali | notes and pdf_notes |
| Language Modeling Day | "Statistical Language Modeling based on Variable-Length Sequences", Computer Speech and Language, (17)27-41, 2003. link | March 11th, 2004 1:00pm | AE-108 | Ivan Bulyko | Last meeting of the quarter. |
| TBD | - | March 18th, 2004 1:00pm | AE-108 | Jeremy Kahn | - |
| TBD | - | March 25th, 2004 1:00pm | AE-108 | Mari Ostendorf | - |
| TBD | - | March 32nd, 2004 1:00pm | AE-108 | Emily Bender | - |
| Topic | Readings | Date/Time | Location | Discussion leader | Notes |
| Introduction, Overview | K. Knight tutorial | Oct 13, 2003, 3:30pm | EE1-M306 | Jeff Bilmes | - |
| The mathematics of statistical MT, by Brown et. al. | link | Oct 20, 2003, 3:30pm | EE1-M306 | Luca Giacinto Cazzanti and Jeff Bilmes | (finished up to model 3, models 4&5 next week) |
| Finish math of stat MT Models 4 &5 (Luca) / and Katrin will discuss 2 papers on alignment (papers 4 and 5 below) and one on higher level structure (number 10 below) | link1, link2, and link3. | Oct 27, 2003, 3:30pm | AE-108 | Luca Giacinto Cazzanti and Katrin Kirchhoff | finish up last week, and continue on. Note, Katrin's papers could be classified as either higher-level structure or alignment. |
| Och et. al.'s "Improved Alignment Models.." paper, in addition to paper 3 from last week. | link1(pdf), link2 | Nov 3rd, 2003, 3:30pm | AE-108 | Katrin Kirchhoff | - |
| Evaluation methods (BLEU point/counterpoint), papers 1, 2, and 3 (and we also did 5) below | link1, link2, and link3, link4 | Nov 10rd, 2003, 3:30pm | AE-108 | Mari Ostendorf and Sarah Schwarm | discussion notes (updated Nov 14, 2003) |
| Two papers search methods for MT (2 and 3 below), one by Garcia-Varea et. al. and another by Germann et. al. | link1 and link2 | Nov 17th, 2003, 3:30pm | AE-108 | Karim Filali | paper 6 below (pdf link) is prerequisite reading. discussion notes (updated Nov 17, 2003) |
| Grammars of standard and more obscure languages, implications for MT. | morphology, languages, and aspect. | Nov 24th, 2003, 3:30pm | AE-108 | Emily M. Bender and Jeremy G Kahn | notes and handout |
| No meeting due to ASRU'2003 | - | Dec 1st, 2003, 3:30pm | AE-108 | - | - |
| Papers 1 and 2 below in other knowledge sources. | link1 and link2 | Dec 8th, 2003, 3:30pm | AE-105 | Katrin Kirchhoff | discussion notes (updated Dec 9, 2003) This is last meeting of quarter. |
The following sites list recent and future CFPs in the area of machine translation