Please use this identifier to cite or link to this item: http://hdl.handle.net/10609/136991
Full metadata record
DC FieldValueLanguage
dc.contributor.authorSolé-Mauri, Francina-
dc.contributor.authorSánchez-Gijón, Pilar-
dc.contributor.authorOliver González, Antoni-
dc.contributor.otherUniversitat Oberta de Catalunya-
dc.contributor.otherUniversitat Autònoma de Barcelona (UAB)-
dc.date.accessioned2022-01-03T16:44:19Z-
dc.date.available2022-01-03T16:44:19Z-
dc.date.issued2021-07-13-
dc.identifier.citationSole-Mauri, F., Sánchez-Gijón, P., & Oliver, A. (2021). Cadlaws - An English-French Parallel Corpus of Legally Equivalent Documents. Mutatis Mutandis. Revista Latinoamericana De Traducción, 14(2), 494-508. https://doi.org/10.17533/udea.mut.v14n2a10-
dc.identifier.issn2011-799XMIAR
-
dc.identifier.urihttp://hdl.handle.net/10609/136991-
dc.description.abstractThis article presents Cadlaws, a new English-French corpus built from Canadian legal documents, and describes the corpus construction process and preliminary statistics obtained from it. The corpus contains over 16 million words in each language and includes unique features since it is composed of documents that are legally equivalent in both languages but not the result of a translation. The corpus is built upon enactments co-drafted by two jurists to ensure legal equality of each version and to re­flect the concepts, terms and institutions of two legal traditions. In this article the corpus definition as a parallel corpus instead of a comparable one is also discussed. Cadlaws has been pre-processed for machine translation and baseline Bilingual Evaluation Understudy (bleu), a score for comparing a candidate translation of text to a gold-standard translation of a neural machine translation system. To the best of our knowledge, this is the largest parallel corpus of texts which convey the same meaning in this language pair and is freely available for non-commercial use.en
dc.language.isoeng-
dc.publisherMutatis Mutandis. Revista Latinoamericana de Traducción-
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/-
dc.subjectcorpus constructionen
dc.subjectparallel corpusen
dc.subjectNeural Machine Translation (NMT)en
dc.subjectEnglish-French translationen
dc.subjectCadlawsen
dc.titleCadlaws - An Enlgish-French parallel corpus of legally equivalent documents-
dc.typeinfo:eu-repo/semantics/article-
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess-
dc.identifier.doi10.17533/udea.mut.v14n2a10-
dc.gir.idAR/0000008729-
Appears in Collections:Articles
Articles cientÍfics

Files in This Item:
File Description SizeFormat 
Cadlaws - An English-French Parallel Corpus of Legally Equivalent Documents.pdf271,36 kBAdobe PDFThumbnail
View/Open