Analyzing GPU Energy Consumption in Data Movement and Storage

Paul Delestrac; Jonathan Miquel; Debjyoti Bhattacharjee; Diksha Moolchandani; Francky Catthoor; Lionel Torres; David Novo

doi:10.1109/ASAP61560.2024.00038

Communication Dans Un Congrès Année : 2024

Analyzing GPU Energy Consumption in Data Movement and Storage

(1) , (2) , (3) , (3) , (3) , (1) , (1)

1
2
3

Paul Delestrac

Fonction : Auteur
PersonId : 1163872
IdHAL : paul-delestrac
ORCID : 0000-0002-7476-1422
IdRef : 280857977

ADAptive Computing

Jonathan Miquel

Fonction : Auteur
PersonId : 1152726
ORCID : 0009-0000-6429-6152
IdRef : 275866882

Smart Integrated Electronic Systems

Debjyoti Bhattacharjee

Fonction : Auteur
PersonId : 1391406
ORCID : 0000-0001-6561-8934

IMEC

Diksha Moolchandani

Fonction : Auteur
PersonId : 1368576
ORCID : 0000-0001-8110-049X

IMEC

Francky Catthoor

Fonction : Auteur
PersonId : 1086572
ORCID : 0000-0002-3599-8515

IMEC

Lionel Torres

Fonction : Auteur
PersonId : 929667
ORCID : 0000-0001-5807-5070

ADAptive Computing

David Novo

Fonction : Auteur
PersonId : 170933
IdHAL : david-novo
ORCID : 0000-0002-5510-4152
IdRef : 244276455

ADAptive Computing

Résumé

GPUs are the prevailing solution to execute high- performance tasks (e.g., machine learning training). As the peak performance of modern GPUs increases with each generation, so does their thermal design power (TDP). Hence, identifying energy bottlenecks in the GPU architecture is crucial to designing more efficient architectures in the future. However, due to the complex proprietary nature of modern GPU architectures, providing a detailed breakdown of the GPU energy consumption is not trivial. The goal of this work is to estimate a lower bound for the energy consumed by data movement and storage in modern GPU architectures, leveraging internal power sensors. We establish a basic energy model for modern GPUs, focused on data movement to/from the hardware-managed caches and software-managed memories. We propose a methodology to calibrate the energy model using microbenchmarks, performance counters, and the internal power sensor. We experimentally calibrate the model on an A100 NVIDIA GPU. Then, we challenge the consistency of the results by cross-validating with modified microbenchmarks with additional instructions. Finally, we use the calibrated energy model to evaluate breakdowns for workloads of increasing complexity (e.g., a ResNet-50 training iteration with different software optimizations). Our results show that data movement dominates the dynamic energy consumption of the GPU (up to 84%), with DRAM accesses being the main contributor.

Domaines

Architectures Matérielles [cs.AR] Intelligence artificielle [cs.AI] Modélisation et simulation

Fichier principal

delestrac2024analyzing.pdf (715.59 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Paul Delestrac : Connectez-vous pour contacter le contributeur

https://hal.umontpellier.fr/hal-04604802

Soumis le : vendredi 7 juin 2024-13:50:49

Dernière modification le : jeudi 7 novembre 2024-16:14:03

Archivage à long terme le : dimanche 8 septembre 2024-18:41:52

Dates et versions

hal-04604802 , version 1 (07-06-2024)

Identifiants

HAL Id : hal-04604802 , version 1
DOI : 10.1109/ASAP61560.2024.00038

Citer

Paul Delestrac, Jonathan Miquel, Debjyoti Bhattacharjee, Diksha Moolchandani, Francky Catthoor, et al.. Analyzing GPU Energy Consumption in Data Movement and Storage. ASAP 2024 - IEEE 35th International Conference on Application-specific Systems, Architectures and Processors, Jul 2024, Hong Kong, Hong Kong SAR China. pp.143-151, ⟨10.1109/ASAP61560.2024.00038⟩. ⟨hal-04604802⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS LIRMM TDS-MACS ADAC SMARTIES UNIV-MONTPELLIER

154 Consultations

290 Téléchargements

Analyzing GPU Energy Consumption in Data Movement and Storage

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager