Empreu aquest identificador per citar o enllaçar aquest ítem: http://hdl.handle.net/10609/151237
Títol: Training an NMT system for legal texts of a low-resource language variety (South Tyrolean German – Italian)
Autoria: Oliver, Antoni  
Alvarez Vidal, Sergi  
stemle, egon  
Chiocchetti, Elena  
Citació: Oliver, A. [Antoni], Álvarez. S. [Sergi], Stemle, E. [Egon] & Chiocchetti, E. [Elena](2024). Training an NMT system for legal texts of a low-resource language variety (South Tyrolean German – Italian). Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1)
Resum: This paper illustrates the process of training and evaluating NMT systems for a language pair that includes a low-resource language variety. A parallel corpus of legal texts for Italian and South Tyrolean German has been compiled, with South Tyrolean German being the low-resourced language variety. As the size of the compiled corpus is insufficient for the training, we have combined the corpus with several parallel corpora using data weighting at sentence level. We then performed an evaluation of each combination and of two popular commercial systems.
Tipus de document: info:eu-repo/semantics/conferenceObject
Data de publicació: jun-2024
Llicència de publicació: http://creativecommons.org/licenses/by-nd/3.0/es/  
Apareix a les col·leccions:Conferencias

Arxius per aquest ítem:
Arxiu Descripció MidaFormat 
EAMT2024-Oliver-Alvarez.pdf242,87 kBAdobe PDFThumbnail
Veure/Obrir
Comparteix:
Exporta:
Consulta les estadístiques

Aquest ítem està subjecte a una llicència de Creative CommonsLlicència Creative Commons Creative Commons