License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/OASIcs.WCET.2014.43
URN: urn:nbn:de:0030-drops-46033
Go to the corresponding OASIcs Volume Portal

Hirvisalo, Vesa

On Static Timing Analysis of GPU Kernels

6.pdf (0.4 MB)


We study static timing analysis of programs running on GPU accelerators. Such programs follow a data parallel programming model that allows massive parallelism on manycore processors. Data parallel programming and GPUs as accelerators have received wide use during the recent years.

The timing analysis of programs running on single core machines is well known and applied also in practice. However for multicore and manycore machines, timing analysis presents a significant but yet not properly solved problem.

In this paper, we present static timing analysis of GPU kernels based on a method that we call abstract CTA simulation. Cooperative Thread Arrays (CTA) are the basic execution structure that GPU devices use in their operation that proceeds in thread groups called warps. Abstract CTA simulation is based on static analysis of thread divergence in warps and their abstract scheduling.

BibTeX - Entry

  author =	{Vesa Hirvisalo},
  title =	{{On Static Timing Analysis of GPU Kernels}},
  booktitle =	{14th International Workshop on Worst-Case Execution Time Analysis},
  pages =	{43--52},
  series =	{OpenAccess Series in Informatics (OASIcs)},
  ISBN =	{978-3-939897-69-9},
  ISSN =	{2190-6807},
  year =	{2014},
  volume =	{39},
  editor =	{Heiko Falk},
  publisher =	{Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{},
  URN =		{urn:nbn:de:0030-drops-46033},
  doi =		{10.4230/OASIcs.WCET.2014.43},
  annote =	{Keywords: Parallelism, WCET}

Keywords: Parallelism, WCET
Collection: 14th International Workshop on Worst-Case Execution Time Analysis
Issue Date: 2014
Date of publication: 08.07.2014

DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI