Please use this identifier to cite or link to this item:
http://hdl.handle.net/10609/151574
Title: | An n-gram approach to exploiting a monolingual corpus for machine translation |
Author: | Badia, Toni ![]() Boleda, Gemma ![]() Melero, Maite ![]() Oliver, Antoni ![]() |
Citation: | Badia, T.[Toni], Boleda, G.[Gemma], Melero, [Maite] & Oliver, A. [Antoni]. (2005). An n-gram approach to exploiting a monolingual corpus for machine translation. A Proceedings of the Second Workshop on Example-Base Machine Translation. MT Summit X |
Abstract: | In this paper we present an approach to Statistical Machine Translation that uses a bilingual dictionary and a target language model based on n-grams extracted from a monolingual corpus. This approach is still in an experimental stage and is being developed in the context of Metis-II, a UE project that aims at constructing free text translations by retrieving the basic stock for translations from large monolingual corpora. The architecture described in this paper is being applied to translation from Spanish to English and is designed so as to depend as little as possible on complex linguistic processing tools. The only required tools are a POS tagger and lemmatizer for the source language an another for the target language. |
Document type: | info:eu-repo/semantics/conferenceObject |
Issue Date: | Sep-2005 |
Publication license: | http://creativecommons.org/licenses/by-nc-sa/3.0/es/ ![]() |
Appears in Collections: | Conferencias |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
2005-n-gram-approach-Badia-Boleda-Melero-Oliver.pdf | 2,14 MB | Adobe PDF | ![]() View/Open |
Share:


This item is licensed under aCreative Commons License