A performance model for OpenMP memory bound applications in multisocket systems

Allende, César; Jorba, Josep; Sikora, Anna; César Galobardes, Eduardo

Empreu aquest identificador per citar o enllaçar aquest ítem: http://hdl.handle.net/10609/109820

Registre complet de metadades

Camp DC	Valor	Llengua/Idioma
dc.contributor.author	Allende, César	-
dc.contributor.author	Jorba, Josep	-
dc.contributor.author	Sikora, Anna	-
dc.contributor.author	César Galobardes, Eduardo	-
dc.contributor.other	Universitat Oberta de Catalunya. Internet Interdisciplinary Institute (IN3)	-
dc.contributor.other	Universitat Autònoma de Barcelona (UAB)	-
dc.date.accessioned	2020-02-18T08:23:54Z	-
dc.date.available	2020-02-18T08:23:54Z	-
dc.date.issued	2014-06-06	-
dc.identifier.citation	Allende, C., Jorba, J., Sikora, A. & César, E. (2014). A Performance Model for OpenMP Memory Bound Applications in Multisocket Systems. Procedia Computer Science, 29(), 2.208-2.218. doi: 10.1016/j.procs.2014.05.206	es
dc.identifier.issn	1877-0509MIAR	-
dc.identifier.uri	http://hdl.handle.net/10609/109820	-
dc.description.abstract	The performance of OpenMP applications executed in multisocket multicore processors can be limited by the memory interface. In a multisocket environment, each multicore processor can present a performance degradation in memory-bound parallel regions when sharing the same Last Level Cache (LLC). We propose a characterization of the performance of parallel regions to estimate cache misses and execution time. This model is used to select the number of threads and affinity distribution for each parallel region. The model is applied for SP and MG benchmarks from the NAS Parallel Benchmark Suite using different workloads on two different multicore, multisocket systems.The results shown that the estimation preserves the behavior shown in measured executions for the affinity configurations evaluated. Estimated execution time is used to select a set of configurations in order to minimize the impact of memory contention, achieving significant improvements compared with a default configuration using all threads.	en
dc.format.mimetype	application/pdf	-
dc.language.iso	eng	-
dc.publisher	Procedia Computer Science	-
dc.relation.ispartof	Procedia Computer Science, 2014, 29	-
dc.relation.ispartofseries	14th International Conference on Computational Science, Guimarães, Portugal, june 30-july 3, 2014	-
dc.relation.uri	https://doi.org/10.1016/j.procs.2014.05.206	-
dc.rights	CC BY-NC-ND	-
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/es/	-
dc.subject	performance model	en
dc.subject	multicore	en
dc.subject	multisocket	en
dc.subject	OpenMP	en
dc.subject	memory bound applications	en
dc.subject	model de rendiment	ca
dc.subject	modelo de rendimiento	es
dc.subject	multinucli	ca
dc.subject	multi-núcleo	es
dc.subject	endoll múltiple	ca
dc.subject	toma múltiple	es
dc.subject	OpenMP	ca
dc.subject	OpenMP	es
dc.subject	aplicacions vinculades a la memòria	ca
dc.subject	aplicaciones vinculadas a la memoria	es
dc.subject.lcsh	Computer storage devices	en
dc.title	A performance model for OpenMP memory bound applications in multisocket systems	-
dc.type	info:eu-repo/semantics/conferenceObject	-
dc.subject.lemac	Ordinadors -- Dispositius de memòria	ca
dc.subject.lcshes	Ordenadores -- Dispositivos de memoria	es
dc.rights.accessRights	info:eu-repo/semantics/openAccess	-
dc.identifier.doi	10.1016/j.procs.2014.05.206	-
dc.gir.id	AR/0000003872	-
dc.type.version	info:eu-repo/semantics/publishedVersion	-
Apareix a les col·leccions:	Articles cientÍfics Articles