Please use this identifier to cite or link to this item: http://hdl.handle.net/10609/151522
Title: Building WordNets by machine translation of sense tagged corpora
Author: Oliver, Antoni  
Climent, Salvador  
Citation: Oliver, A. [Antoni] & Climent Roca, S. [Salvador](2012). Building WordNets by machine translation of sense tagged corpora. Proceedings of the Global WordNet Conference 2012. Matsue (Japan) (p. 232-239). Global WordNet Association
Abstract: This paper describes a methodology for the construction of WordNets based on machine translation of an English sense tagged corpus. For the construction of such a corpus we used two freely available resources: the Semcor Corpus and the Princeton WordNet Gloss Corpus. This methodology is applied on the construction of Spanish and Catalan WordNet 3.0. In our first experiments we used a simple word alignment algorithm obtaining precision results comparable to those obtained by methods based on bilingual dictionaries. After that, we used a freely available statistical word alignment algorithm (Berkeley Aligner) obtaining better results. This methodology can be suitable for those languages with an available statistical machine translation system and can be used for either constructing WordNets from the scratch or for enlarging existing WordNets.
Document type: info:eu-repo/semantics/conferenceObject
Issue Date: Jan-2012
Publication license: http://creativecommons.org/licenses/by-nc-nd/3.0/es/  
Appears in Collections:Conferencias

Files in This Item:
File Description SizeFormat 
2012-Building_WordNets_by_machine_translation.pdf1,13 MBAdobe PDFThumbnail
View/Open
Share:
Export:
View statistics

Items in repository are protected by copyright, with all rights reserved, unless otherwise indicated.