License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/LIPIcs.ESA.2016.38
URN: urn:nbn:de:0030-drops-63890
URL: http://dagstuhl.sunsite.rwth-aachen.de/volltexte/2016/6389/
Go to the corresponding LIPIcs Volume Portal


Edelkamp, Stefan ; Weiss, Armin

BlockQuicksort: Avoiding Branch Mispredictions in Quicksort

pdf-format:
LIPIcs-ESA-2016-38.pdf (0.6 MB)


Abstract

Since the work of Kaligosi and Sanders (2006), it is well-known that Quicksort - which is commonly considered as one of the fastest in-place sorting algorithms - suffers in an essential way from branch mispredictions. We present a novel approach to address this problem by partially decoupling control from data flow: in order to perform the partitioning, we split the input in blocks of constant size (we propose 128 data elements); then, all elements in one block are compared with the pivot and the outcomes of the comparisons are stored in a buffer. In a second pass, the respective elements are rearranged. By doing so, we avoid conditional branches based on outcomes of comparisons at all (except for the final Insertionsort). Moreover, we prove that for a static branch predictor the average total number of branch mispredictions is at most epsilon n log n + O(n) for some small epsilon depending on the block size when sorting n elements.

Our experimental results are promising: when sorting random integer data, we achieve an increase in speed (number of elements sorted per second) of more than 80% over the GCC implementation of C++ std::sort. Also for many other types of data and non-random inputs, there is still a significant speedup over std::sort. Only in few special cases like sorted or almost sorted inputs, std::sort can beat our implementation. Moreover, even on random input permutations, our implementation is even slightly faster than an implementation of the highly tuned Super Scalar Sample Sort, which uses a linear amount of additional space.

BibTeX - Entry

@InProceedings{edelkamp_et_al:LIPIcs:2016:6389,
  author =	{Stefan Edelkamp and Armin Weiss},
  title =	{{BlockQuicksort: Avoiding Branch Mispredictions in Quicksort}},
  booktitle =	{24th Annual European Symposium on Algorithms (ESA 2016)},
  pages =	{38:1--38:16},
  series =	{Leibniz International Proceedings in Informatics (LIPIcs)},
  ISBN =	{978-3-95977-015-6},
  ISSN =	{1868-8969},
  year =	{2016},
  volume =	{57},
  editor =	{Piotr Sankowski and Christos Zaroliagis},
  publisher =	{Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{http://drops.dagstuhl.de/opus/volltexte/2016/6389},
  URN =		{urn:nbn:de:0030-drops-63890},
  doi =		{10.4230/LIPIcs.ESA.2016.38},
  annote =	{Keywords: in-place sorting, Quicksort, branch mispredictions, lean programs}
}

Keywords: in-place sorting, Quicksort, branch mispredictions, lean programs
Collection: 24th Annual European Symposium on Algorithms (ESA 2016)
Issue Date: 2016
Date of publication: 18.08.2016


DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI