The Machine Translation Reading Group
at SSLI-LAB, UW-EE.
Go straight to the description of our next meeting.
Go to the MT reading group archive.
The ongoing saga of our quest to become experts in machine translation
(MT)!
Here we review and discuss a number of both classic (and out of date)
and recent statistical (and some non-statistical) techniques for
MT. We occasionally also cover other relevant papers from
computational linguistics and machine learning.
Group meetings are informal, encouraging random (yet MT-related)
discussion after reviewing the weekly paper.
Also, see below for upcoming calls and
related links.
This reading group is part of SSLI-Lab,
the Signal, Speech, and
Language Interpretation Laboratory in the department
of Electrical Engineering
at the University of
Washington. We conduct research on statistical speech and natural
language processing, including machine translation.
Administrivia:
Send mail to username katrin
at ee.washington.edu
if you would like to receive email announcements and reminders
regarding upcoming meetings, or if you would like to lead a discussion
(regardless of your department!).
While the list of papers below will give you something to choose from,
it is far from exhaustive and you are encouraged to suggest other
machine translation papers.
Discussions, Spring 2008
Time and meeting place for Spring Quarter 2008: Mon 1600-1700h, EE-M306.
The next meeting is highlighted in red.
| Topic
| Readings
| Date/Time/Location
| Discussion leader
| Slides / Notes
|
| .
| Yajuan Lu + Jin Huang + Qun Liu (Chinese Academy of Sciences)
Improving Statistical Machine Translation Performance by Training Data
Selection and Optimization (EMNLP 2007)
| Mon, 02 Jun 4-5pm, EE-M306
| Mei Yang
| slides
|
| .
| Holiday, no MTRG
| Mon, 26 May
| .
| .
|
| Morphology
| K. Toutanova + H. Suzuki + A. Ruopp (MSR)
Applying Morphology Generation Models to Machine Translation
(ACL 2008)
| Mon, 19 May 4-5pm, EE-M306
| amittai
|
slides
|
| .
| cancelled
| Mon, 12 May
| .
| .
|
| Text processing
| Unicode and its discontents. Discussion on character
encodings, and the practicalities (and hassles) of working with text.
| Mon, 05 May 4-5pm, EE-M306
| Jeremy Kahn
| slides
|
| Parsing
| Y. W. Wong + R. J. Mooney (UT Austin),
Generation by Inverting a Semantic Parser That Uses
Statistical Machine Translation
(HLT/NAACL 2007).
| Mon, 28 Apr 4-5pm, EE-M306
| Katrin Kirchhoff
| .
|
| Language Modeling
| T. Brants + A. C. Popat + P. Xu + F. J. Och + J. Dean (Google),
Large Language Models in Machine Translation
(EMNLP 2007)
| Mon, 21 Apr 4-5pm, EE-M306
| Kevin Duh
| .
|
Discussions, Winter 2008
Time and meeting place for Winter Quarter 2008: Wed 1:30-2:30, EE-M406.
No meetings on Wednesday March 19 nor 26, due to finals week and break.
MTRG will resume the week of April 1-4.
| Topic
| Readings
| Date/Time/Location
| Discussion leader
| Slides / Notes
|
| Reordering
| Y. Zhang + R. Zens + H. Ney,
Chunk-Level Reordering of Source Language Sentences with Automatically
Learned Rules for Statistical Machine Translation.
(NAACL SSST-AMTA workshop 2007)
| Wed, 12 Mar
1:30-2:30, EE-M406
| Jeremy Kahn
| slides
|
| .
| cancelled
| Wed, 05 Mar
| .
| .
|
| Reranking
| L. Shen + A. Sarkar + F.J. Och
Discriminative Reranking for Machine Translation
(NAACL 2004)
| Wed, 27 Feb
1:30-2:30, EE-M406
| Andrei Alexandrescu
| .
|
| Word alignment
| X. He, (MSR)
Using Word Dependent Transition Models in HMM based Word Alignment for
Statistical Machine Translation
(ACL WMT 2007)
| Wed, 20 Feb
1:30-2:30, EE-M406
| Mei Yang
| slides
|
| Morphology in MT
| E. Minkov + K. Toutanova + H. Suzuki (MSR)
Generating Complex Morphology for Machine Translation
(ACL 2007)
| Wed, 13 Feb
1:30-2:30, EE-M406
| amittai
| notes
(slides hidden)
|
| Domain adaptation
| C. Lim + K. Kirchhoff, (UW-EE)
"Domain Adaptation through Phrase Generalization for Improved SMT Quality"
| Wed, 06 Feb
1:30-2:30, EE-M406
| Chris Lim
| not linked (yet)
|
| Multilingual lexical translation
| Mausam + S. Soderland, (UW Computer Science) "Probabilistic Inference in Translation Graphs"
| Wed, 30 Jan
1:30-2:30, EE-M406
| Mausam
| .
|
| Pivot languages
| H. Wu + H. Wang, (Toshiba, Beijing)
Pivot Language Approach for Phrase-Based Statistical Machine
Translation (ACL 2007)
| Wed, 23 Jan 1:30-2:30, EE-M406
| Katrin Kirchhoff
| .
|
Relevant Calls for papers
The following sites list recent and future conference announcements in the area of
machine translation
Partial (and growing) Lists of Suggested Papers
(please send email with any others that you think should
be added to the list).
Introductory/Classical Papers in Statistical MT
-
Chapter from Russell & Norvig's book "
AI: A Modern Approach, 2nd Edition".
The 2nd editions chapter on MT is very good.
-
Chapter on MT from Jurafsky & Martin's book "Speech
and Language Processing: An Introduction to Natural
Language Processing, Computational Linguistics,
and Speech Recognition"
-
A Statistical MT Tutorial Workbook, Kevin Knight, unpublished, August
1999 pdf link
- P.F. Brown et al.,
A statistical approach to machine translation,
Computational Linguistics,
16, 1990, 79-85
link
-
S. Della Pietra and M. Epstein and S. Roukos and T. Ward,
Fertility models for statistical natural language understanding,
Proceedings of ACL,
1997, 168-173
link
-
The Candide System for Machine Translation (1994), Adam L. Berger,
Peter F. Brown, Stephen A. Della Pietra, Vincent J. Della Pietra, John
R. Gillett, John D. Lafferty, Robert L. Mercer, Harry Printz, Lubos
Ures
link
-
P. F. Brown, S. A. Della Pietra, V. J. Della Pietra, and
R. L. Mercer. The mathematics of statistical machine translation:
Parameter estimation. Computational linguistics, 19:263--312, June
1993.link
- Kenji Yamada's Intro to Statistical Machine Translation Notes
link
-
Knight and Koehn HLT tutorial: "What's New in Statistical Machine Translation"
link
Evaluation of MT quality
-
K. Papineni and S. Roukos and T. Ward and W. Zhu,
Bleu: a method for automatic evaluation of machine translation,
2001, IBM Research Report RC22176
link
- Hovy E., King M. & Popescu-Belis A. (2002) - An Introduction to MT
Evaluation. LREC 2002 Workshop "Machine Translation Evaluation: Human
Evaluators Meet Automated Metrics", Las Palmas de Gran Canaria, Spain,
p.1-7. link
-
Culy, Christopher and Riehemann, Susanne. 2003.
The Limits of N-Gram Translation Evaluation Metrics.
Proceedings of MT Summit IX.
link
-
FEMTI, the Framework for Machine Translation Evaluation in ISLE, a
structured repertoire of methods used to evaluate MT systems. link1 and link2
- Automatic Evaluation of Machine Translation Quality
Using N-gram Co-Occurrence Statistics, NIST document.
link
- A Novel String-to-String Distance Measure With Applications
to Machine Translation Evaluation, by Leusch, Ueffing, and Ney.
link
- Orange: a Method for
Evaluating Automatic Metrics for Machine
Translation, by Lim and Och, Coling04,
link
Using higher level structure
-
A Syntax-based Statistical Translation Model (2001) Kenji Yamada,
Kevin Knight Meeting of the Association for Computational Linguistics
link
-
Modeling with Structures in Statistical Machine Translation (1997)
Ye-Yi Wang, Alex Waibel
COLING-ACL
link
-
Statistical Phrase-Based Translation, Philipp Koehn, Franz Josef Och,
and Daniel Marcu, HLT/NAACL 2003
link
-
A phrase-based joint probability model for statistical machine translation,
D. Marcu,
Proceedings of EMNLP, 2002
link
-
Marcu, D. and Wong, W. (2002). A phrase-based, joint probability
model for statistical machine translation. In Proceedings
of the Conference on Empirical Methods in Natural Language
Processing, EMNLP.h
-
Och, F. J., Tillmann, C., and Ney, H. (1999). Improved alignment
models for statistical machine translation. In Proc. of the Joint
Conf. of Empirical Methods in Natural Language Processing and Very
Large Corpora, pages 20-28.
link
-
A Syntax-Based Statistical Translation Model" (K. Yamada and
K. Knight), Proc. of the Conference of the Association for
Computational Linguistics (ACL), 2001.
link
-
A Decoder for Syntax-Based Statistical MT" (K. Yamada and K. Knight), Proc. of the Conference of the Association for Computational Linguistics (ACL), 2002.
link
-
Syntax-based Language Models for Machine Translation,
E. Charniak, K. Knight, and K. Yamada, Proc. MT Summit IX, 2003.
link
-
Wang and Waibel, "Modeling with structures in statistical machine translation",
Proceedings of COLING-ACL, 1998
link
-
Inducing Translation Templates for Example-Based Machine Translation,
by Michael Carl. link
-
Reordering Constraints for Phrase-based
Statistitical MT, by Richard Zens, Hermann Ney, Taro Watanabe, and
Eiichiro Sumita, from Coling'04
link
-
Improving a Statistical MT System with Automatically Learned Rewrite
Patterns, by Fei Xia and Michael McCord, COLING-2004. link
-
Dan Melamed (2004). Algorithms for Syntax-Aware Statistical Machine
Translation PS PDF Proceedings
of the Conference on Theoretical and Methodological Issues in Machine
Translation (TMI'04), Baltimore, MD.
Search/Decoding/Alignment
-
Decoding Algorithm in Statistical Machine Translation , Ye-Yi Wang and
Alex Waibel Language Technology Institute School of Computer.
Proceedings of the Thirty-Fifth Annual Meeting of the Association for
Computational Linguistics and Eighth Conference of the European
Chapter of the Association for Computational Linguistics
link
-
Search algorithms for statistical machine translation based on dynamic
programming and pruning techniques, I. Garcia-Varea and
F. Casacuberta, Proceedings of the MT Summit VIII Workshop, 2001, link
-
Germann, U., Jahr, M., Knight, K., Marcu, D., and Yamada,
K. (2001). Fast decoding and optimal decoding for machine
translation. In Proceedings of ACL 39.
link
-
Vogel, Ney and Tillmann, "HMM-based word alignment in statistical translation",
Proceedings of COLING 1996
link
-
Och and Ney, "A comparison of alignment models for statistical machine translation",
Proceedings of COLING 2000
link
-
An Iterative, DP-based Search Algorithm for Statistical Machine
Translation, Ismael Garccia-Varea, Francisco Casacuber, and Hermann
Ney, ICSLP'98 link
-
A DP based Search Using Monotone Alignments in Statistical Translation
(1997)
C. Tillmann, S. Vogel, H. Ney, A. Zubiaga
link
-
A DP based Search Algorithm for Statistical Machine Translation (1998)
S. Niessen, S. Vogel, H. Ney, C. Tillmann
link
-
Accelerated Dp Based Search For Statistical Translation (1997)
C. Tillmann, S. Vogel, H. Ney, A. Zubiaga, H. Sawaf
link
-
A Polynomial-Time Algorithm for Statistical Machine Translation
Dekai Wu
link
Other Knowledge Sources
-
Knowledge Sources for Word-Level Translation Models,
P. Koehn and K. Knight,
link
-
Morpho-Syntactic Analysis for Reordering in Statistical Machine
Translation (2001),
Sonja Nieen, Hermann Ney
link
-
Ismael Garcia Varea, Franz Josef Och, Hermann Ney, Francisco Casacuberta. "Refined Lexicon Models for Statistical Machine Translation using a Maximum Entropy Approach". In: "ACL 2001: Proc. of the 39th Annual Meeting of the Association for Computational Linguistics", pp. 204-211, Toulouse, France, July 2001.
link<
-
Hearne, Mary and Way, Andy. 2003. Seeing the Wood for the Trees:
Data-Oriented Translation. Proceedings of MT Summit IX. link
Alignment
-
Improved Alignment Models for Statistical Machine Translation (1999),
Franz Josef Och, Christoph Tillmann, Hermann Ney
link
-
Loosely Tree-Based Alignment for Machine Translation, by Dan Gildea link
(also see here
for a JHU'03 presentation on the subject).
Maxent MT models
-
A Maximum Entropy/Minimum Divergence Translation Model (2000)
George Foster,
link
Statistical Language Models
-
Class-based n-gram models of natural language
by P. Brown, V. Della Pietra, P. deSouza, and J. Lai
link
-
Language Model Adaptation for Statistical Machine Translation via
Structured Query Models, by Bing Zhao, Matthias Eck and Stephan Vogel
link
Other/All of the above/General/Specific
-
Statistical Multi-Source Translation,
Franz Josef Och and Hermann Ney
link
-
Discriminative Training and Maximum Entropy Models for Statistical
Machine Translation,
Franz Josef Och and Hermann Ney,
Proc. 40th Annual Meeting of the ACL, Philadelphia, 2002
link
-
Minimum Error Rate Training in Statistical Machine Translation
pp. 160-167 Franz Josef Och, ACL-2003 link
-
Franz Josef Och, Hermann Ney.
What Can Machine Translation Learn from Speech Recognition?
In: "Workshop: MT 2010 - Towards a Road Map for MT", pp. 26-31, Santiago de Compostela, Spain, September 2001.
link
-
Models of Translational Equivalence
among Words,
I. Dan Melamed
link
-
Towards a Unified Approach to Memory- and Statistical-Based
Machine Translation,
Daniel Marcu
link
-
Machine Transliteration, (K. Knight and J. Graehl), Computational
Linguistics, 24(4), 1998. link
-
Alshawi, H. and Douglas, S., ``Learning Dependency Transduction Models from
Unannotated Examples,'' Phil Trans. of the Royal Society, Series A:
Mathematical, Physical and Engineering Sciences, vol. 358, 2000, pp. 1357-1372.
link,
pdf link
-
"Machine Transliteration," (K. Knight and J. Graehl), Proc. of the
Conference of the Association for Computational Linguistics (ACL),
1997.link
-
Confidence Estimation for Machine Translation,
John Blatz, Erin Fitzgerald, George Foster, Simona Gandrabur, Cyril Goutte, Alex Kulesza, Alberto Sanchis and Nicola Ueffing, pp. 315-321, Coling 04,
link
-
Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary,
P. Duygulu, K. Barnard, J.F.G. de Freitas and D.A. Forsyth,
7th European Conference on Computer Vision (Best paper award winner).
link
Traditional Symbolic AI papers on MT (Example & Interlingual based MT)
- Examples and Prospects of Example Based MT
E. Sumita, H. Iida, H. Kohyama (ATR). Proc. ACL 1991
link
- Example Based MT in the Pangloss system
RD Brown (CMU). Proc. ACL 1996
link
Related links
maintained by amittai.
i last remembered to update this footer on:
Mon May 19 20:15:00 PDT 2008