bpo-38938: Adjusting the heapq.merge algorithm for fewer comparisons #17729

sweeneyde · 2019-12-28T10:18:48Z

This pull request would:

Change heapq.merge to be a class whose instances are iterable objects, rather than a generator function.
Change the algorithm for heapq.merge:
- Before this PR, we repeatedly applied heapreplace on (index, iterator, item) tuples
  - Starting at the root, this would repeatedly replace parents with their smaller child. Upon reaching a leaf, it would replace that leaf using the __next__ method of the iterator corresponding to the the item we are replacing. It would then sift that new leaf down into the appropriate part of the heap.
- After this PR, we would apply a "tree sift" method:
  - General algorithm: the items compete in a tournament, as we maintain a binary "tree of losers" (see Knuth Volume 3, Chapter 5.4.1. on "Multiway Merging").
  - Here, we only need to store individual items, or key-item pairs.
  - After popping off the root, we replace it with its smaller child, the child with its smaller child, etc., until reaching a leaf. So far, this is identical to the standard siftup method.
  - However, once reaching a leaf, we replace the leaf with the iterator corresponding to the index of that leaf. That is, always filling the leftmost leaf with the first iterable that was passed, the second-leftmost with the second-passed, ..., and the rightmost leaf with the last iterable passed.
  - This eliminates the need to sift the leaf down afterward because we already know that it is larger than the item it replaces.
  - For tree sizes that aren't a power of two, permuting the iterators so that the first one is leftmost is simply a matter of shifting all iterators around the list by a few spots.
  - Heuristically, removing the requirement that there be exactly one item from each iterator in the heap at a time makes the heap more likely to consist of those items which are closest.
  - Storing the tree flatly (for the C implementation, even when using a key), rather than having a tuple at each node helps with cache locality.
  - This is a non-recursive version of recursively applying a function that merges two iterables.
Add the corresponding C and Python implementations.
Add a test case for error handling in heapq.merge

According to my benchmarking, this is at least 2.5 times as fast in almost all cases to the existing implementation, and is especially fast (about 5x as fast as the existing way) with small numbers of iterables. I tested up to 12,000 iterables, with varying degrees of "balance" among the number of items in each. Also noteworthy is that fewer comparisons on average were needed with the proposed new method.

https://bugs.python.org/issue38938


        c-fast heapq.merge


        made pure-python implementation and test cases consistent with c impl…

…ementation


        📜🤖 Added by blurb_it.


        updated documentation to include the method signatures for initializi…

…ng merge objects


        fixed NULL dereference in heapq.merge.__sizeof__ and removed duplicat…

…e documentation


        Fixed refleak from key computation failing and added corresponding te…

…st case

Dennis Sweeney added 2 commits Dec 28, 2019

c-fast heapq.merge

cc04e04

made pure-python implementation and test cases consistent with c impl…

Loading status checks…

02f002b

…ementation

sweeneyde requested a review from rhettinger as a code owner Dec 28, 2019

the-knights-who-say-ni added the CLA signed label Dec 28, 2019

bedevere-bot added the awaiting review label Dec 28, 2019

📜🤖 Added by blurb_it.

Verified

This commit was created on GitHub.com and signed with a verified signature using GitHub’s key.

GPG key ID: 4AEE18F83AFDEB23 Learn about signing commits

Loading status checks…

f7394b0

sweeneyde mentioned this pull request Dec 28, 2019

bpo-38938: Added special case to heapq.merge for small numbers of iterables #17422

Open

Dennis Sweeney added 3 commits Dec 28, 2019

updated documentation to include the method signatures for initializi…

Loading status checks…

5c23b03

…ng merge objects

fixed NULL dereference in heapq.merge.__sizeof__ and removed duplicat…

Loading status checks…

ecd0bb4

…e documentation

Fixed refleak from key computation failing and added corresponding te…

Loading status checks…

bbe1de5

…st case

python/cpython

bpo-38938: Adjusting the heapq.merge algorithm for fewer comparisons #17729

bpo-38938: Adjusting the heapq.merge algorithm for fewer comparisons #17729

sweeneyde commented Dec 28, 2019 •

edited by bedevere-bot

Sponsor python/cpython

python/cpython

Join GitHub today

bpo-38938: Adjusting the heapq.merge algorithm for fewer comparisons #17729

bpo-38938: Adjusting the heapq.merge algorithm for fewer comparisons #17729

Conversation

sweeneyde commented Dec 28, 2019 • edited by bedevere-bot

sweeneyde commented Dec 28, 2019 •

edited by bedevere-bot