Please use this identifier to cite or link to this item: http://hdl.handle.net/10609/151235
Title: LitPC: a set of tools for building parallel corpora from literary works
Author: Oliver, Antoni  
Alvarez Vidal, Sergi  
Citation: Oliver, A. [Antoni] & Álvarez. S. [Sergi]. (2024). LitPC: a set of tools for building parallel corpora from literary works. Proceedings of the 1st Workshop on Creative-text Translation and Technology, p. 25–35, Sheffield, United Kingdom
Abstract: In this paper, we describe the LitPC toolkit, a variety of tools and methods designed for the quick and effective creation of parallel corpora derived from literary works. This toolkit can be a useful resource due to the scarcity of curated parallel texts for this domain. We also feature a case study describing the creation of a Russian-English parallel corpus based on the literary works by Leo Tolstoy. Furthermore, an augmented version of this corpus is used to both train and assess neural machine translation systems specifically adapted to the author’s style.
Document type: info:eu-repo/semantics/conferenceObject
Issue Date: Jun-2024
Publication license: http://creativecommons.org/licenses/by-nd/4.0/es/  
Appears in Collections:Conferencias

Files in This Item:
File Description SizeFormat 
CTT2024-Oliver-Alvarez.pdf260,75 kBAdobe PDFThumbnail
View/Open
Share:
Export:
View statistics

This item is licensed under aCreative Commons License Creative Commons