Please use this identifier to cite or link to this item: http://hdl.handle.net/10609/151574
Title: An n-gram approach to exploiting a monolingual corpus for machine translation
Author: Badia, Toni  
Boleda, Gemma  
Melero, Maite  
Oliver, Antoni  
Citation: Badia, T.[Toni], Boleda, G.[Gemma], Melero, [Maite] & Oliver, A. [Antoni]. (2005). An n-gram approach to exploiting a monolingual corpus for machine translation. A Proceedings of the Second Workshop on Example-Base Machine Translation. MT Summit X
Abstract: In this paper we present an approach to Statistical Machine Translation that uses a bilingual dictionary and a target language model based on n-grams extracted from a monolingual corpus. This approach is still in an experimental stage and is being developed in the context of Metis-II, a UE project that aims at constructing free text translations by retrieving the basic stock for translations from large monolingual corpora. The architecture described in this paper is being applied to translation from Spanish to English and is designed so as to depend as little as possible on complex linguistic processing tools. The only required tools are a POS tagger and lemmatizer for the source language an another for the target language.
Document type: info:eu-repo/semantics/conferenceObject
Issue Date: Sep-2005
Publication license: http://creativecommons.org/licenses/by-nc-sa/3.0/es/  
Appears in Collections:Conferencias

Files in This Item:
File Description SizeFormat 
2005-n-gram-approach-Badia-Boleda-Melero-Oliver.pdf2,14 MBAdobe PDFThumbnail
View/Open
Share:
Export:
View statistics

This item is licensed under aCreative Commons License Creative Commons