bpo-35780: Fix errors in lru_cache() C code #11623

rhettinger · 2019-01-20T00:57:24Z

https://bugs.python.org/issue35780

…k->next references. This saves an unnecessary duplicate lookup. Clang assembly before: movq 16(%rax), %rcx # link->prev movq 24(%rax), %rdx # link->next movq %rdx, 24(%rcx) # link->prev->next = link->next; movq 24(%rax), %rdx # duplicate fetch of link->next movq %rcx, 16(%rdx) # link->next->prev = link->prev; Clang assembly after: movq 16(%rax), %rcx movq 24(%rax), %rdx movq %rdx, 24(%rcx) movq %rcx, 16(%rdx)

was incorrectly moved to the newest position as if the user had made a recent call with this key. The fix is to restore it the oldest position, keeping the LRU invariant where keys are tracked by recency of access.

Formerly, the code allowed cache to fall into an inconsistent state. Now, there are no code paths that have a full cache but no links.

Also move decrefs to the end of end path to make it easier to verify that there are no reentrant calls before the cache invariants have been restored.

Save hit update for last (as the pure python version does).

tirkarthi · 2019-01-20T08:15:43Z

Backporting to 3.6 would require approval from @ned-deily

Limit to just common scalar types to make the space saving technique easier to reason about (easier to show correctness).

Since the final setitem is potentially reentrant, we have to reset the full status prior to the setitem call (since we've already removed a link and its associated cache dict entry). After the setitem call, we cannot know whether some other thread has reset the status, so we cannot just restore it without checking to make sure the number of dict entries is at maxsize.

Negative maxsize was being treated as a cache size of 1 giving an almost 100% miss rate while still incurring the overhead of cache checking and eviction. The negative maxsize also showed-up in CacheInfo even though it was non-sensical to have a negative maxsize. The negative maxsize also made it into the struct for the C version. This caused erroneous results for the calculation of the "full" flag.

It is cheaper and more reliable to make on-demand checks for whether the cache is full than it is to recompute and occasionally toggle a persistent state variable.

…ource of truth.

…ad of else clauses

miss-islington · 2019-01-26T08:02:03Z

Thanks @rhettinger for the PR 🌮🎉.. I'm working now to backport this PR to: 3.7.
🐍🍒⛏🤖

(cherry picked from commit d8080c0) Co-authored-by: Raymond Hettinger <rhettinger@users.noreply.github.com>

bedevere-bot · 2019-01-26T08:02:15Z

GH-11682 is a backport of this pull request to the 3.7 branch.

rhettinger added 4 commits Jan 19, 2019

Save space for scalar keys

42ab331

Fix diction

fb148f3

Fix bug arising in recursive user calls

c8558d6

rhettinger added type-bug An unexpected behavior, bug, or error needs backport to 3.6 performance Performance or resource usage needs backport to 3.7 labels Jan 20, 2019

the-knights-who-say-ni added the CLA signed label Jan 20, 2019

bedevere-bot added the awaiting merge label Jan 20, 2019

rhettinger added 4 commits Jan 20, 2019

Add blurb

98a638c

Fix bug where in an error case, the old key about to be evicted

1fecffd

was incorrectly moved to the newest position as if the user had made a recent call with this key. The fix is to restore it the oldest position, keeping the LRU invariant where keys are tracked by recency of access.

Convert an always true test into an assertion.

79bfd59

Formerly, the code allowed cache to fall into an inconsistent state. Now, there are no code paths that have a full cache but no links.

Reset the "full" flag when a link is dropped. Update comments.

4d66413

rhettinger removed the performance Performance or resource usage label Jan 20, 2019

rhettinger added 2 commits Jan 20, 2019

Fix missing decref.

8f582dc

Also move decrefs to the end of end path to make it easier to verify that there are no reentrant calls before the cache invariants have been restored.

Minor code clarity improvements. Group link updates together.

2db2de6

Save hit update for last (as the pure python version does).

rhettinger added 2 commits Jan 20, 2019

Sync-up C code and pure Python code for scalar argument handling.

f4f969e

Limit to just common scalar types to make the space saving technique easier to reason about (easier to show correctness).

Improve clarity and accuracy of new comments

0b8bf71

rhettinger removed the needs backport to 3.6 label Jan 21, 2019

rhettinger added 9 commits Jan 22, 2019

Cleaner test using a nonlocal variable instead of a global

a64e21b

Restore cache invariants on a path where an error is raised.

a300047

Add general notes on possible sources of reentrancy

84811b5

We can restore the old link, so need to make as no longer full.

cc50597

Mention the decref policy in the general notes about reentrancy

304c104

Fix grammar in comment block

7ae7208

Remove the "full" variable.

3a6973c

It is cheaper and more reliable to make on-demand checks for whether the cache is full than it is to recompute and occasionally toggle a persistent state variable.

rhettinger added 9 commits Jan 24, 2019

Drop unneeded maxsize_O attribute. Prefer plain maxsize as a single s…

be775c8

…ource of truth.

Make control clearer by favoring intermediate return statements inste…

23114ef

…ad of else clauses

Arrange structure members in order of typical access

e207d41

Order initial assignments and deallocations to match structure order

614bfc8

Update NEWS entry

ed69975

Remove one more layer of if-logic to make the code easier to follow.

f86e311

Fix erroneous count of cache misses

c6fa049

Add more comments on reentrancy issues

14281bb

Guard against exotic cases until they can be proven not to occur

3b04baa

rhettinger merged commit d8080c0 into python:master Jan 26, 2019

bedevere-bot removed the awaiting merge label Jan 26, 2019

miss-islington pushed a commit to miss-islington/cpython that referenced this pull request Jan 26, 2019

bpo-35780: Fix errors in lru_cache() C code (pythonGH-11623)

082d9f4

(cherry picked from commit d8080c0) Co-authored-by: Raymond Hettinger <rhettinger@users.noreply.github.com>

bedevere-bot removed the needs backport to 3.7 label Jan 26, 2019

rhettinger pushed a commit that referenced this pull request Jan 26, 2019

bpo-35780: Fix errors in lru_cache() C code (GH-11623) (GH-11682)

b2b023c

rhettinger deleted the lru_fixes branch Jan 27, 2019

rhettinger mentioned this pull request Feb 2, 2019

bpo-35780: Add link guards to the lru_cache() C code #11733

Closed

YouJiacheng mentioned this pull request Jul 14, 2022

Fix race condition for weakref destructor by catching rare exceptions. google/jax#10110

Merged

bpo-35780: Fix errors in lru_cache() C code #11623

bpo-35780: Fix errors in lru_cache() C code #11623

rhettinger commented Jan 20, 2019 •

edited by bedevere-bot

tirkarthi commented Jan 20, 2019

miss-islington commented Jan 26, 2019

bedevere-bot commented Jan 26, 2019

bpo-35780: Fix errors in lru_cache() C code #11623

bpo-35780: Fix errors in lru_cache() C code #11623

Conversation

rhettinger commented Jan 20, 2019 • edited by bedevere-bot

tirkarthi commented Jan 20, 2019

miss-islington commented Jan 26, 2019

bedevere-bot commented Jan 26, 2019

rhettinger commented Jan 20, 2019 •

edited by bedevere-bot