Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10609/109820
Registro completo de metadatos
Campo DC Valor Lengua/Idioma
dc.contributor.authorAllende, César-
dc.contributor.authorJorba, Josep-
dc.contributor.authorSikora, Anna-
dc.contributor.authorCésar Galobardes, Eduardo-
dc.contributor.otherUniversitat Oberta de Catalunya. Internet Interdisciplinary Institute (IN3)-
dc.contributor.otherUniversitat Autònoma de Barcelona (UAB)-
dc.date.accessioned2020-02-18T08:23:54Z-
dc.date.available2020-02-18T08:23:54Z-
dc.date.issued2014-06-06-
dc.identifier.citationAllende, C., Jorba, J., Sikora, A. & César, E. (2014). A Performance Model for OpenMP Memory Bound Applications in Multisocket Systems. Procedia Computer Science, 29(), 2.208-2.218. doi: 10.1016/j.procs.2014.05.206es
dc.identifier.issn1877-0509MIAR
-
dc.identifier.urihttp://hdl.handle.net/10609/109820-
dc.description.abstractThe performance of OpenMP applications executed in multisocket multicore processors can be limited by the memory interface. In a multisocket environment, each multicore processor can present a performance degradation in memory-bound parallel regions when sharing the same Last Level Cache (LLC). We propose a characterization of the performance of parallel regions to estimate cache misses and execution time. This model is used to select the number of threads and affinity distribution for each parallel region. The model is applied for SP and MG benchmarks from the NAS Parallel Benchmark Suite using different workloads on two different multicore, multisocket systems.The results shown that the estimation preserves the behavior shown in measured executions for the affinity configurations evaluated. Estimated execution time is used to select a set of configurations in order to minimize the impact of memory contention, achieving significant improvements compared with a default configuration using all threads.en
dc.format.mimetypeapplication/pdf-
dc.language.isoeng-
dc.publisherProcedia Computer Science-
dc.relation.ispartofProcedia Computer Science, 2014, 29-
dc.relation.ispartofseries14th International Conference on Computational Science, Guimarães, Portugal, june 30-july 3, 2014-
dc.relation.urihttps://doi.org/10.1016/j.procs.2014.05.206-
dc.rightsCC BY-NC-ND-
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/-
dc.subjectperformance modelen
dc.subjectmulticoreen
dc.subjectmultisocketen
dc.subjectOpenMPen
dc.subjectmemory bound applicationsen
dc.subjectmodel de rendimentca
dc.subjectmodelo de rendimientoes
dc.subjectmultinuclica
dc.subjectmulti-núcleoes
dc.subjectendoll múltipleca
dc.subjecttoma múltiplees
dc.subjectOpenMPca
dc.subjectOpenMPes
dc.subjectaplicacions vinculades a la memòriaca
dc.subjectaplicaciones vinculadas a la memoriaes
dc.subject.lcshComputer storage devicesen
dc.titleA performance model for OpenMP memory bound applications in multisocket systems-
dc.typeinfo:eu-repo/semantics/conferenceObject-
dc.subject.lemacOrdinadors -- Dispositius de memòriaca
dc.subject.lcshesOrdenadores -- Dispositivos de memoriaes
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess-
dc.identifier.doi10.1016/j.procs.2014.05.206-
dc.gir.idAR/0000003872-
dc.type.versioninfo:eu-repo/semantics/publishedVersion-
Aparece en las colecciones: Articles cientÍfics
Articles

Ficheros en este ítem:
Fichero Descripción Tamaño Formato  
Jorba_PCS14_Performance.pdf776,93 kBAdobe PDFVista previa
Visualizar/Abrir