Improving Sorting Algorithm Performance by Optimising Branch Prediction

This repository contains the benchmarking code for the papers "Improving Quicksort Performance by Optimizing Branch Prediction" and "Improving Mergesort Performance by Optimizing Branch Prediction".

The implementation of the algorithms "Pattern-exploiting quicksort" (PEQS) and "Pattern-exploiting mergesort" (PEMS) can be found in the files quick_peqs.h and merge_pems.h++ respectively.

Publications

This repository contains code for the papers:

Improving Quicksort Performance by Optimizing Branch Prediction

Abstract

By detecting arrays with a low degree of presortedness, an adaptive quicksort algorithm can switch to a branch free implementation in order to reduce the runtime by up to 2.5 times compared to traditional implementations and outperform strictly branch free algorithms on arrays with a high degree of presortedness. By using a fast method of detecting the degree of presortedness, no prior knowledge about the array composition is required for the algorithm.

Citation

@INPROCEEDINGS{10089318,
  author={Peeters, Jonas and Haase, Jan},
  booktitle={2022 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)}, 
  title={Improving Quicksort Performance by Optimizing Branch Prediction}, 
  year={2022},
  pages={1-6},
  keywords={Computer science;Runtime;Measurement uncertainty;Adaptive arrays;Switches;Prediction algorithms;Data engineering;in-place sorting;quicksort;branch prediction;lean programs},
  doi={10.1109/CSDE56538.2022.10089318}}

Improving Mergesort Performance by Optimizing Branch Prediction

Abstract

By detecting arrays with a low degree of presortedness, an adaptive mergesort algorithm can switch to a branch free implementation in order to reduce the runtime by up to 30% compared to traditional implementations and almost 50% compared to the mergesort based Timsort, while retaining the great performance of branch based variants on partially presorted arrays. By using a fast method of detecting the degree of presortedness, no prior knowledge about the array composition is required for the algorithm.

Citation

@INPROCEEDINGS{10089293,
  author={Peeters, Jonas and Haase, Jan},
  booktitle={2022 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE)}, 
  title={Improving Mergesort Performance by Optimizing Branch Prediction}, 
  year={2022},
  pages={1-6},
  keywords={Computer science;Runtime;Measurement uncertainty;Adaptive arrays;Switches;Prediction algorithms;Data engineering;sorting;mergesort;branch prediction;lean programs},
  doi={10.1109/CSDE56538.2022.10089293}
}

Benchmarking

Main benchmark program is in benchy.c++
output-clang-c++-full.csv and output-gcc-c++-full.csv contain the benchmark results for all algorithms compiled using GCC and Clang

To compile benchy.c++, PAPI is required, which can be found here.

With the authors current configuration, the following commands compile the program successfully:

$ clang++-12 -O3 -mtune=native -march=native -I/usr/include/x86_64-linux-gnu/c++/10/bits -lm -L/usr/local/lib -lpapi -std=c++17 benchy.c++ -o benchy
$ g++-10 -O3 -mtune=native -march=native -I/usr/include/x86_64-linux-gnu/c++/10/bits -lm -L/usr/local/lib -lpapi -std=c++17 benchy.c++ -o benchy

The benchmark program has to be started with admin rights. These are required for access to the CPU performance counters

The program can be supplied with up to four arguments:

Algorithm name: Is substring matched to the list in benchy.c++::122-137)
"Before sort action": Action that will be performed to the input array before sorting. 0. 0 = NOTHING
1. 1 = PRESORTED
2. 2 = SORT_REVERSE
3. 3 = END_RANDOM
4. 4 = LIMIT_VALUES_1000
5. 5 = LIMIT_VALUES_10000
Array size to test
Seed for rand(), otherwise 1-5 are used

If the appication stops with an error from PAPI complaining about too many different CPU counters, try removing some from the list at benchy.c++::155-159, as your CPU is capable of fewer than are in there.

The application will output the average number of CPU cycles for the biggest array tested for each algorithm-test-combo. The full results are written to the files output-clang-c++.csv and output-gcc-c++.csv depending on the compiler used.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.vscode		.vscode
BlockQuicksort		BlockQuicksort
tim		tim
.gitignore		.gitignore
README.md		README.md
benchy.c++		benchy.c++
data-random.h		data-random.h
heapsort.h		heapsort.h
heapsorto.h		heapsorto.h
insertionssort.h		insertionssort.h
median.h		median.h
merge_array.h++		merge_array.h++
merge_base.h		merge_base.h
merge_base_cmov.h		merge_base_cmov.h
merge_cmov.h++		merge_cmov.h++
merge_maths.h++		merge_maths.h++
merge_pems.h++		merge_pems.h++
merge_std.h++		merge_std.h++
minmax.h		minmax.h
output-clang-c++-full.csv		output-clang-c++-full.csv
output-gcc-c++-full.csv		output-gcc-c++-full.csv
pdqsort.h		pdqsort.h
quick_hoare.h		quick_hoare.h
quick_lomuto.h		quick_lomuto.h
quick_peqs.h		quick_peqs.h
quick_stable.h		quick_stable.h
quick_swap_array.h		quick_swap_array.h
quick_swap_asm.h		quick_swap_asm.h
quick_swap_cmov.h		quick_swap_cmov.h
quick_tuned.h		quick_tuned.h
swap.h		swap.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Improving Sorting Algorithm Performance by Optimising Branch Prediction

Publications

Improving Quicksort Performance by Optimizing Branch Prediction

Abstract

Citation

Improving Mergesort Performance by Optimizing Branch Prediction

Abstract

Citation

Benchmarking

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Improving Sorting Algorithm Performance by Optimising Branch Prediction

Publications

Improving Quicksort Performance by Optimizing Branch Prediction

Abstract

Citation

Improving Mergesort Performance by Optimizing Branch Prediction

Abstract

Citation

Benchmarking

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages