Empreu aquest identificador per citar o enllaçar aquest ítem: http://hdl.handle.net/10609/77285
Títol: Parallel corpora for WordNet construction: machine translation vs. automatic sense tagging
Autoria: Oliver González, Antoni
Climent, Salvador  
Resum: In this paper we present a methodology for WordNet construction based on the exploitation of parallel corpora with semantic annotation of the English source text. We are using this methodology for the enlargement of the Spanish and Catalan versions of WordNet 3.0, but the methodology can also be used for other languages. As big parallel corpora with semantic annotation are not usually available, we explore two strategies to overcome this problem: to use monolingual sense tagged corpora and machine translation, on the one hand; and to use parallel corpora and automatic sense tagging on the source text, on the other. With these resources, the problem of acquiring a WordNet from parallel corpora can be seen as a word alignment task. Fortunately, this task is well known, and some aligning algorithms are freely available.
Paraules clau: lexical resources
wordnet
parallel corpora
machine translation
automatic sense tagging
Tipus de document: info:eu-repo/semantics/article
Data de publicació: 2012
Llicència de publicació: http://creativecommons.org/licenses/by-nc-nd/3.0/es/  
Apareix a les col·leccions:Articles
Articles cientÍfics

Arxius per aquest ítem:
Arxiu Descripció MidaFormat 
cicling2012_26_aoliver.pdfArticle Lecture Notes in Computer Science351,4 kBAdobe PDFThumbnail
Veure/Obrir
Comparteix:
Exporta:
Consulta les estadístiques

Aquest ítem està subjecte a una llicència de Creative Commons Llicència Creative Commons Creative Commons