Empreu aquest identificador per citar o enllaçar aquest ítem:
http://hdl.handle.net/10609/151499
Títol: | Enlarging the Croatian Wordnet with WN-Toolkit and CroDeriV |
Autoria: | Oliver, Antoni ![]() Sojat, Kresimir Srebabic, Matea |
Citació: | Oliver, A. [Antoni], Sojat, K. [Kresimir] & Srebacic, M. [Matea] (2015). Enlarging the Croatian Wordnet with WN-Toolkit and CroDeriv. A R. [Ruslan] Mitkov, K. [Kalina] Bontcheva & G. [Galia] Angelova (ed.). Proceedings of the International Conference Recent Advances in Natural Language Processing (RANLP 2015) (p. 480-487). Hissar: INCOMA Ltd. Shoumen |
Resum: | Wordnet is a standard semantic resource for several Natural Language Processing tasks and it is available for an increasing number of languages. The Croatian Wordnet (CroWN) was a relatively small resource with 10.026 synsets and 31.367 synset-variant pairs covering only 45.91% of the so-called Core WordNet. Comparing these figures with the size of the Princeton WordNet for English version 3.0, that has 117,659 synsets and 206,975 synset-variant pairs, it is clear that the CroWN should be expanded. First experiments for the expansion of the CroWN were performed using the WN-Toolkit, a set of Python programs for wordnet creation and expansion using dictionary, Babelnet and parallel-corpora based strategies. The WN-Toolkit was previously successfully applied to other languages as Spanish, Catalan and Galician. After this first expansion, CroWN reached 70.63% of the core wordnet. In the second step we used CroDeriv, a derivational database for Croatian and the manual creation of 1,457 synset-variant pairs until reaching 100% of the Core WordNet. After second step was completed, CroWN reached 23,137 synsets and 47,931 synset-lemma pairs. |
Tipus de document: | info:eu-repo/semantics/conferenceObject |
Data de publicació: | set-2015 |
Llicència de publicació: | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ ![]() |
Apareix a les col·leccions: | Conferencias |
Arxius per aquest ítem:
Arxiu | Descripció | Mida | Format | |
---|---|---|---|---|
2015-EnlargingCroatianWordnet-Oliver-Sojat-Srebabic.pdf | 110,21 kB | Adobe PDF | ![]() Veure/Obrir |
Comparteix:


Aquest ítem està subjecte a una llicència de Creative CommonsLlicència Creative Commons