Available under a Creative Commons Attribution Non-Commercial Share Alike 4.0 International Licence
1.2 COMPUTER AND INFORMATION SCIENCE
This paper describes an experiment to evaluate the impact of idioms on Statis- tical Machine Translation (SMT) process using the language pair English/Brazilian- Portuguese. Our results show that on sen- tences containing idioms a standard SMT system achieves about half the BLEU score of the same system when applied to sentences that do not contain idioms. We also provide a short error analysis and out- line our planned work to overcome this limitation.
Salton, G., Ross, R. & Kelleher, J. (2014) An Empirical Study of the Impact of Idioms on Phrase Based Statistical Machine Translation of English to Brazilian-Portuguese. Third Workshop on Hybrid Approaches to Transla- tion (HyTra), 14th Conference of the European Chapter of the Association for Computational Linguistics, Gothenburg, sweden, 26-30, April. doi:10.21427/D7GG6N