Faster sorting (run formation)

Open Mortal opened this issue 12 years ago • 2 comments

In the merge sorter, we need to overlap internal sorting (highly CPU bound) and writing the sorted runs (highly I/O bound).

We should split the available internal memory in two, using one half for accumulating a sorted run and the other half as an I/O buffer for the sorted elements. Sorting and I/O should be overlapped explicitly (e.g. with two separate threads) to run both in parallel.

Oct 24 '13 12:10 Mortal

An exploratory implementation of the merge sorter has been implemented in the parallel-merge-phase branch. A progress document has been created for this issue in order to keep track of test results. https://gist.github.com/svendcsvendsen/11253182

Apr 24 '14 12:04 svendcs

This branch by Svend seems to have been removed and the gist is also dead. Is this still a desired feature or did it turn out not to improve performance as much as one hoped?

Jul 27 '22 13:07 ssoelvsten