[slides and audio] Statistical Phrase-Based Translation

This paper presents a new phrase-based translation model and decoding algorithm that allows for the evaluation and comparison of various phrase-based translation models. The authors conducted extensive experiments to understand why phrase-based models outperform word-based models. Their results show that the highest performance can be achieved through simple methods: heuristic learning of phrase translations from word-based alignments and lexical weighting of phrase translations. Surprisingly, learning longer phrases or phrases from high-accuracy word-level alignment models does not significantly improve performance. Learning only syntactically motivated phrases actually degrades performance. The paper introduces a uniform evaluation framework for comparing different phrase extraction methods. The phrase translation model is based on the noisy channel model, and the decoder uses a beam search algorithm. The model uses a language model and a translation model to generate the best English output sentence given a foreign input sentence. The model also includes a distortion probability distribution and a factor to calibrate output length. The authors tested three methods for learning phrase translation probability tables: one based on word alignments, one based on syntactic phrases, and one based on phrase alignments. They found that the method based on word alignments performed best. They also tested the effect of lexical weighting on translation performance and found that it improved performance slightly. The authors also tested the effect of different phrase lengths on performance and found that phrases of up to three words were sufficient for high accuracy. They also tested the effect of different alignment heuristics and found that the best heuristic varied depending on the language pair and the size of the training corpus. The authors concluded that phrase-based translation models outperform word-based models. They also found that simple methods can achieve high performance, and that syntactic models do not necessarily lead to better translations. The paper provides a comprehensive evaluation of phrase-based translation models and their performance.This paper presents a new phrase-based translation model and decoding algorithm that allows for the evaluation and comparison of various phrase-based translation models. The authors conducted extensive experiments to understand why phrase-based models outperform word-based models. Their results show that the highest performance can be achieved through simple methods: heuristic learning of phrase translations from word-based alignments and lexical weighting of phrase translations. Surprisingly, learning longer phrases or phrases from high-accuracy word-level alignment models does not significantly improve performance. Learning only syntactically motivated phrases actually degrades performance. The paper introduces a uniform evaluation framework for comparing different phrase extraction methods. The phrase translation model is based on the noisy channel model, and the decoder uses a beam search algorithm. The model uses a language model and a translation model to generate the best English output sentence given a foreign input sentence. The model also includes a distortion probability distribution and a factor to calibrate output length. The authors tested three methods for learning phrase translation probability tables: one based on word alignments, one based on syntactic phrases, and one based on phrase alignments. They found that the method based on word alignments performed best. They also tested the effect of lexical weighting on translation performance and found that it improved performance slightly. The authors also tested the effect of different phrase lengths on performance and found that phrases of up to three words were sufficient for high accuracy. They also tested the effect of different alignment heuristics and found that the best heuristic varied depending on the language pair and the size of the training corpus. The authors concluded that phrase-based translation models outperform word-based models. They also found that simple methods can achieve high performance, and that syntactic models do not necessarily lead to better translations. The paper provides a comprehensive evaluation of phrase-based translation models and their performance.

Statistical Phrase-Based Translation

May-June 2003 | Philipp Koehn, Franz Josef Och, Daniel Marcu