Empreu aquest identificador per citar o enllaçar aquest ítem:
http://hdl.handle.net/10609/109820
Registre complet de metadades
Camp DC | Valor | Llengua/Idioma |
---|---|---|
dc.contributor.author | Allende, César | - |
dc.contributor.author | Jorba, Josep | - |
dc.contributor.author | Sikora, Anna | - |
dc.contributor.author | César Galobardes, Eduardo | - |
dc.contributor.other | Universitat Oberta de Catalunya. Internet Interdisciplinary Institute (IN3) | - |
dc.contributor.other | Universitat Autònoma de Barcelona (UAB) | - |
dc.date.accessioned | 2020-02-18T08:23:54Z | - |
dc.date.available | 2020-02-18T08:23:54Z | - |
dc.date.issued | 2014-06-06 | - |
dc.identifier.citation | Allende, C., Jorba, J., Sikora, A. & César, E. (2014). A Performance Model for OpenMP Memory Bound Applications in Multisocket Systems. Procedia Computer Science, 29(), 2.208-2.218. doi: 10.1016/j.procs.2014.05.206 | es |
dc.identifier.issn | 1877-0509MIAR | - |
dc.identifier.uri | http://hdl.handle.net/10609/109820 | - |
dc.description.abstract | The performance of OpenMP applications executed in multisocket multicore processors can be limited by the memory interface. In a multisocket environment, each multicore processor can present a performance degradation in memory-bound parallel regions when sharing the same Last Level Cache (LLC). We propose a characterization of the performance of parallel regions to estimate cache misses and execution time. This model is used to select the number of threads and affinity distribution for each parallel region. The model is applied for SP and MG benchmarks from the NAS Parallel Benchmark Suite using different workloads on two different multicore, multisocket systems.The results shown that the estimation preserves the behavior shown in measured executions for the affinity configurations evaluated. Estimated execution time is used to select a set of configurations in order to minimize the impact of memory contention, achieving significant improvements compared with a default configuration using all threads. | en |
dc.format.mimetype | application/pdf | - |
dc.language.iso | eng | - |
dc.publisher | Procedia Computer Science | - |
dc.relation.ispartof | Procedia Computer Science, 2014, 29 | - |
dc.relation.ispartofseries | 14th International Conference on Computational Science, Guimarães, Portugal, june 30-july 3, 2014 | - |
dc.relation.uri | https://doi.org/10.1016/j.procs.2014.05.206 | - |
dc.rights | CC BY-NC-ND | - |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ | - |
dc.subject | performance model | en |
dc.subject | multicore | en |
dc.subject | multisocket | en |
dc.subject | OpenMP | en |
dc.subject | memory bound applications | en |
dc.subject | model de rendiment | ca |
dc.subject | modelo de rendimiento | es |
dc.subject | multinucli | ca |
dc.subject | multi-núcleo | es |
dc.subject | endoll múltiple | ca |
dc.subject | toma múltiple | es |
dc.subject | OpenMP | ca |
dc.subject | OpenMP | es |
dc.subject | aplicacions vinculades a la memòria | ca |
dc.subject | aplicaciones vinculadas a la memoria | es |
dc.subject.lcsh | Computer storage devices | en |
dc.title | A performance model for OpenMP memory bound applications in multisocket systems | - |
dc.type | info:eu-repo/semantics/conferenceObject | - |
dc.subject.lemac | Ordinadors -- Dispositius de memòria | ca |
dc.subject.lcshes | Ordenadores -- Dispositivos de memoria | es |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | - |
dc.identifier.doi | 10.1016/j.procs.2014.05.206 | - |
dc.gir.id | AR/0000003872 | - |
dc.type.version | info:eu-repo/semantics/publishedVersion | - |
Apareix a les col·leccions: | Articles cientÍfics Articles |
Arxius per aquest ítem:
Arxiu | Descripció | Mida | Format | |
---|---|---|---|---|
Jorba_PCS14_Performance.pdf | 776,93 kB | Adobe PDF | Veure/Obrir |
Comparteix:
Aquest ítem està subjecte a una llicència de Creative Commons Llicència Creative Commons