Empreu aquest identificador per citar o enllaçar aquest ítem: http://hdl.handle.net/10609/151508
Títol: Automatic creation ofWordNets from parallel corpora
Autoria: Oliver, Antoni  
Climent, Salvador  
Citació: Oliver, A. [Antoni]. & Climent Roca, S. [Salvador]. (2014). Automatic creation of WordNets from parallel corpora. Proceedings of the 9th Language Resources and Evaluation Conference (p. 1112-1116). Reykjavik: European Language Resources Association (ELRA)
Resum: In this paper we present the evaluation results for the creation of WordNets for five languages (Spanish, French, German, Italian and Portuguese) using an approach based on parallel corpora. We have used three very large parallel corpora for our experiments: DGT-TM, EMEA and ECB. The English part of each corpus is semantically tagged using Freeling and UKB. After this step, the process of WordNet creation is converted into a word alignment problem, where we want to alignWordNet synsets in the English part of the corpus with lemmata on the target language part of the corpus. The word alignment algorithm used in these experiments is a simple most frequent translation algorithm implemented into the WN-Toolkit. The obtained precision values are quite satisfactory, but the overall number of extracted synset-variant pairs is too low, leading into very poor recall values. In the conclusions, the use of more advanced word alignment algorithms, such as Giza++, Fast Align or Berkeley aligner is suggested.
Paraules clau: WordNet
expand model
parallel corpus
Tipus de document: info:eu-repo/semantics/conferenceObject
Data de publicació: mai-2014
Llicència de publicació: http://creativecommons.org/licenses/by-nc-nd/3.0/es/  
Apareix a les col·leccions:Conferencias

Arxius per aquest ítem:
Arxiu Descripció MidaFormat 
2014-AutomaticCreation-Oliver.pdf120,35 kBAdobe PDFThumbnail
Veure/Obrir
Comparteix:
Exporta:
Consulta les estadístiques

Aquest ítem està subjecte a una llicència de Creative CommonsLlicència Creative Commons Creative Commons