Empreu aquest identificador per citar o enllaçar aquest ítem: http://hdl.handle.net/10609/151497
Títol: Use of Internet for augmenting coverage in a lexical acquisition system from raw corpora: application to Russian
Autoria: Oliver, Antoni  
Castellón Masalles, Irene  
Màrquez, Lluís
Citació: Oliver, A.[Antoni], Castellón,I.[Irene] & Màrquez, l. [Lluís]. (2003). Use of Internet for augmenting coverage in a lexical acquisition system from raw corpora: application to Russian. Proceedings of the Workshop IESL 2003. Internationa Workshop on Information Extraction for Slavonic and other Central and Eastern European Languages, 8-9 setembre de 2003, Borovets, Bulgaria
Resum: This paper presents a methodology for the automatic acquisition of lexical resources from raw corpora. This methodology has proved to be efficient for those languages that, like Russian, present a rich and mainly concatenative morphology. This method can be applied for the creation of new resources, as well as in the enrichment of existing ones. We also present an extension of the system that uses automatic querying to Internet to acquire these entries for which there is not enough information in our corpus. The new basic acquisition methodology achieves similar results compared to the previous methods, but the use of Internet queries allows to increase recall levels with only a slight decrease in precision, obtaining signigicantly better overall results.
Tipus de document: info:eu-repo/semantics/conferenceObject
Data de publicació: set-2003
Apareix a les col·leccions:Conferencias

Arxius per aquest ítem:
Arxiu Descripció MidaFormat 
Oliver_Use.pdf14,47 MBAdobe PDFThumbnail
Veure/Obrir
Comparteix:
Exporta:
Consulta les estadístiques

Els ítems del Repositori es troben protegits per copyright, amb tots els drets reservats, sempre i quan no s’indiqui el contrari.