Locality-Aware Scheduling of Independant Tasks for Runtime Systems - Université Claude Bernard Lyon 1 Accéder directement au contenu
Rapport (Rapport De Recherche) Année : 2021

Locality-Aware Scheduling of Independant Tasks for Runtime Systems

Résumé

A now-classical way of meeting the increasing demand for computing speed by HPC applications is the use of GPUs and/or other accelerators. Such accelerators have their own memory, which is usually quite limited, and are connected to the main memory through a bus with bounded bandwidth. Thus, a particular care should be devoted to data locality in order to avoid unnecessary data movements. Task-based runtime schedulers have emerged as a convenient and efficient way to use such heterogeneous platforms. When processing an application, the scheduler has the knowledge of all tasks available for processing on a GPU, as well as their input data dependencies. Hence, it is able to order tasks and prefetch their input data in the GPU memory (after possibly evicting some previously-loaded data), while aiming at minimizing data movements, so as to reduce the total processing time. In this paper, we focus on how to schedule tasks that share some of their input data (but are otherwise independant) on a GPU. We provide a formal model of the problem, exhibit an optimal eviction strategy, and show that ordering tasks to minimize data movement is NP-complete. We review and adapt existing ordering strategies to this problem, and propose a new one based on tasks aggregation. These strategies have been implemented in the StarPU runtime system, which allows to test them on a variety of linear algebra problems. Our experiments demonstrate that using our new strategy together with the optimal eviction policy reduces the amount of data movement as well as the total processing time.
Fichier principal
Vignette du fichier
europar-paper.pdf (435.05 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03144290 , version 1 (17-02-2021)
hal-03144290 , version 2 (19-02-2021)
hal-03144290 , version 3 (23-02-2021)
hal-03144290 , version 4 (09-03-2021)
hal-03144290 , version 5 (20-05-2021)
hal-03144290 , version 6 (19-07-2021)
hal-03144290 , version 7 (31-08-2021)

Identifiants

  • HAL Id : hal-03144290 , version 3

Citer

Maxime Gonthier, Loris Marchal, Samuel Thibault. Locality-Aware Scheduling of Independant Tasks for Runtime Systems. [Research Report] Inria. 2021. ⟨hal-03144290v3⟩
332 Consultations
552 Téléchargements

Partager

Gmail Facebook X LinkedIn More