frame.setlineno has serious flaws. #94438

markshannon · 2022-06-30T11:30:04Z

The frame_setlineno function works in in stages:

Determine a set of possible bytecode offsets as targets from the line number.
Compute the stack state for these targets and the current position
Determine a best target. That is, the first one that has a compatible stack.
Pop values form the stack and jump.

The first steps is faulty (I think, I haven't demonstrated this) as it might be possible to jump to an instruction involved in frame creation. This should be easy to fix using the new _co_firsttraceable field.

The second step has (at least) three flaws:

It does not account for NULLs on the stack, making it possible to jump from a stack with NULLs to one that cannot handle NULLs.
It does not skip over caches, so could produce incorrect stacks by misinterpreting cache entries as normal instructions.
It is out of date. For example it thinks that PUSH_EXC_INFO pushes three values. It only pushes one.

Setting the line number of a frame is only possible in the debugger, so this isn't as terrible as might appear, but it definitely needs fixing.

The text was updated successfully, but these errors were encountered:

…nes. (pythonGH-94444) (cherry picked from commit be80db1) Co-authored-by: Mark Shannon <mark@hotpy.org>

…H-94444)

* Account for NULLs on evaluation stack when jumping lines.

pablogsal · 2022-07-04T22:16:50Z

Moving this back to release blocker because apparently, this could end in many changes.

I am missing some context here on what this is affecting so I changed it from deferred blocker to release blocker if we think we can delay this to 3.12, please, say so :)

…O and POP_EXCEPT cases are no longer reachable

iritkatriel · 2022-07-05T18:54:06Z

It is out of date. For example it thinks that PUSH_EXC_INFO pushes three values. It only pushes one.

After failing to write a test that will crash on this, I analysed the code and realised that the "exception handling opcode" cases of the switch are no longer reachable - there is nothing that will initialise their stack[] entries, so they get skipped in the continue; before the switch.

I created a PR to use my favourite macro in these cases: #94582

We could backport it, but we don't have to.

iritkatriel · 2022-07-05T22:13:45Z

It does not skip over caches, so could produce incorrect stacks by misinterpreting cache entries as normal instructions.

I'm assuming this refers to the mark_stacks loop again, which is where stacks are produced.

The good news is that it looks like it just happens to work by accident. There is no case in the switch for the CACHE opcode, so it goes to the default case. Since the stack_effect of CACHE is 0, it just copies stack[i] to stack[i+1] unchanged. So the stack that is supposed to be on the next instruction will propagate as is until the first non-cache entry.

We could make it more explicit like this:

diff --git a/Objects/frameobject.c b/Objects/frameobject.c
index 34a6c46c6b..ccdf7de990 100644
--- a/Objects/frameobject.c
+++ b/Objects/frameobject.c
@@ -220,6 +220,11 @@ mark_stacks(PyCodeObject *code_obj, int len)
             }
             opcode = _Py_OPCODE(code[i]);
             switch (opcode) {
+                case CACHE:
+                {
+                    stacks[i+1] = stacks[i];
+                    break;
+                }
                 case JUMP_IF_FALSE_OR_POP:
                 case JUMP_IF_TRUE_OR_POP:
                 case POP_JUMP_FORWARD_IF_FALSE:

thoughts?

…ACHE opcodes which leaves the stack unchanged

sweeneyde · 2022-07-05T22:43:37Z

There is no case in the switch for the CACHE opcode, so it goes to the default case.

Don't cache entries get populated at specialization time with data that overwrites the CACHE opcode? I don't think we're keeping around 0s in the bytecode that aren't subject to change. If I'm understanding right, the problem comes from misinterpreting that specialized cache data as if it's some bytecode that has a stack effect.

It looks like the dis module looks up _inline_cache_entries[deop] to determine how many cache entries to skip over. Would that work here?

iritkatriel · 2022-07-05T22:45:16Z

Ah I see! Would you like to try? (I'm signing off for the night soon.)

sweeneyde · 2022-07-05T23:00:07Z

I don't think I'm going to have time in the next couple of days to come up with a decent test case and PR and everything, but I am thinking that something vaguely like this could work:

@@ -213,12 +213,15 @@ mark_stacks(PyCodeObject *code_obj, int len)
     int todo = 1;
     while (todo) {
         todo = 0;
-        for (i = 0; i < len; i++) {
+        int ncaches;
+        for (i = 0; i < len; i += (1 + ncaches)) {
+            ncaches = 0;
             int64_t next_stack = stacks[i];
             if (next_stack == UNINITIALIZED) {
                 continue;
             }
-            opcode = _Py_OPCODE(code[i]);
+            opcode = _PyOpcode_Deopt[_Py_OPCODE(code[i])];
+            ncaches = _PyOpcode_Caches[opcode];
             switch (opcode) {
                 case JUMP_IF_FALSE_OR_POP:
                 case JUMP_IF_TRUE_OR_POP:

…POP_EXCEPT cases are no longer reachable (GH-94582)

…EXC_INFO and POP_EXCEPT cases are no longer reachable (pythonGH-94582) (cherry picked from commit 50b9a77) Co-authored-by: Irit Katriel <1055913+iritkatriel@users.noreply.github.com>

…FO and POP_EXCEPT cases are no longer reachable (GH-94582) (GH-94595) (cherry picked from commit 50b9a77) Co-authored-by: Irit Katriel <1055913+iritkatriel@users.noreply.github.com>

brandtbucher · 2022-07-06T20:24:28Z

It does not skip over caches, so could produce incorrect stacks by misinterpreting cache entries as normal instructions

@iritkatriel is correct... nothing needs to be done about this. mark_stacks uses _PyCode_GetCode, which produces a "deoptimized" byte string equivalent to .co_code attribute access. Otherwise, we would need cases in this switch for all of the adaptive forms of these instructions.

So I think this issue can be closed?

iritkatriel · 2022-07-06T20:28:45Z

Does it make sense to add some test that exercises this function on highly specialised/warmed up code? I think all the tests just run some function a small number of times.

sweeneyde · 2022-07-06T20:38:38Z

Ah, my mistake, I missed the _PyCode_GetCode.

brandtbucher · 2022-07-06T20:38:43Z

Well, the function is designed to totally ignore warmed-up code and just use the deoptimized string. So new tests would keep us from accidentally using it in the future, but I'm pretty sure other stuff here would start breaking much sooner (tons of instructions would be hitting the default case and would end up having a stack effect of PY_INVALID_STACK_EFFECT, which is INT_MAX).

Perhaps it would just be better to assert(delta != PY_INVALID_STACK_EFFECT)?

iritkatriel · 2022-07-06T20:46:28Z

Let’s close then.

brandtbucher · 2022-07-21T23:04:05Z

Reopening because extended arguments aren't handled correctly. :(

…ks (GH-95110)

…k_stacks (pythonGH-95110) (cherry picked from commit e4d3a96) Co-authored-by: Brandt Bucher <brandtbucher@microsoft.com>

…ks (GH-95110) (cherry picked from commit e4d3a96) Co-authored-by: Brandt Bucher <brandtbucher@microsoft.com>

brandtbucher · 2022-07-23T00:05:28Z

Removing release blocker status since all known crashes are fixed.

I'm still going to leave this open, thoough, since the four "is None" jumps aren't handled yet (which just means that we aren't able to jump to some valid locations).

iritkatriel · 2022-08-26T16:20:23Z

I just noticed that the switch in mark_stacks doesn't have cases for POP_JUMP_IF_NONE/POP_JUMP_IF_NOT_NONE.

brandtbucher · 2022-08-26T19:02:15Z

Yep. See my above comment. :)

iritkatriel · 2022-08-26T19:04:10Z

ah yeah :)

gvanrossum · 2023-02-27T21:20:50Z

Shall we nevertheless close this, and open a more specific issue?

markshannon · 2023-02-27T23:19:43Z

Yes.
Most, if not all, of the flaws I listed have been fixed.

markshannon added type-crash A hard crash of the interpreter, possibly with a core dump deferred-blocker labels Jun 30, 2022

markshannon assigned iritkatriel and markshannon Jun 30, 2022

bedevere-bot mentioned this issue Jun 30, 2022

GH-94438: Account for NULLs on evaluation stack when jumping lines. #94444

Merged

markshannon mentioned this issue Jun 30, 2022

GC crash _PyObject_AssertFailed with pdb #94215

Closed

markshannon added 3.11 bug and security fixes 3.12 new features, bug and security fixes labels Jun 30, 2022

bedevere-bot mentioned this issue Jul 1, 2022

[3.11] GH-94438: Account for NULLs on evaluation stack when jumping lines. (GH-94444) #94484

Closed

miss-islington pushed a commit to miss-islington/cpython that referenced this issue Jul 1, 2022

pythonGH-94438: Account for NULLs on evaluation stack when jumping li…

edfdd6e

…nes. (pythonGH-94444) (cherry picked from commit be80db1) Co-authored-by: Mark Shannon <mark@hotpy.org>

markshannon added a commit that referenced this issue Jul 1, 2022

GH-94438: Account for NULLs on evaluation stack when jumping lines. (G…

be80db1

…H-94444)

bedevere-bot mentioned this issue Jul 1, 2022

[3.11] GH-94438: Backport GH-94444 #94486

Merged

markshannon added a commit that referenced this issue Jul 1, 2022

[3.11] GH-94438: Backport GH-94444 (#94486)

02b30a8

* Account for NULLs on evaluation stack when jumping lines.

pablogsal added release-blocker and removed deferred-blocker labels Jul 4, 2022

iritkatriel added a commit to iritkatriel/cpython that referenced this issue Jul 5, 2022

pythongh-94438: in frameobject's mark_stacks switch, the PUSH_EXC_INF…

9efd9e0

…O and POP_EXCEPT cases are no longer reachable

bedevere-bot mentioned this issue Jul 5, 2022

gh-94438: in frameobject's mark_stacks switch, the PUSH_EXC_INFO and POP_EXCEPT cases are no longer reachable #94582

Merged

iritkatriel added a commit to iritkatriel/cpython that referenced this issue Jul 5, 2022

pythongh-94438: in frameobject's mark_stacks, add explicit case for C…

0043d48

…ACHE opcodes which leaves the stack unchanged

bedevere-bot mentioned this issue Jul 5, 2022

gh-94438: in frameobject's mark_stacks, add explicit case for CACHE opcodes which leaves the stack unchanged #94586

Closed

iritkatriel added a commit that referenced this issue Jul 6, 2022

gh-94438: in frameobject's mark_stacks switch, the PUSH_EXC_INFO and …

50b9a77

…POP_EXCEPT cases are no longer reachable (GH-94582)

bedevere-bot mentioned this issue Jul 6, 2022

[3.11] gh-94438: in frameobject's mark_stacks switch, the PUSH_EXC_INFO and POP_EXCEPT cases are no longer reachable (GH-94582) #94595

Merged

iritkatriel assigned brandtbucher Jul 6, 2022

iritkatriel closed this as completed Jul 6, 2022

This was referenced Jul 7, 2022

Publish C-level coverage for the test suite faster-cpython/ideas#426

Closed

Publish C-level coverage of the CPython test suite #94759

Open

brandtbucher reopened this Jul 21, 2022

bedevere-bot mentioned this issue Jul 21, 2022

GH-94438: Handle extended arguments and conditional pops in mark_stacks #95110

Merged

brandtbucher added a commit that referenced this issue Jul 22, 2022

GH-94438: Handle extended arguments and conditional pops in mark_stac…

e4d3a96

…ks (GH-95110)

bedevere-bot mentioned this issue Jul 22, 2022

[3.11] GH-94438: Handle extended arguments and conditional pops in mark_stacks (GH-95110) #95154

Merged

miss-islington added a commit that referenced this issue Jul 22, 2022

GH-94438: Handle extended arguments and conditional pops in mark_stac…

bbdacb4

…ks (GH-95110) (cherry picked from commit e4d3a96) Co-authored-by: Brandt Bucher <brandtbucher@microsoft.com>

brandtbucher added type-bug An unexpected behavior, bug, or error and removed release-blocker type-crash A hard crash of the interpreter, possibly with a core dump labels Jul 23, 2022

gvanrossum closed this as completed Feb 27, 2023

frame.setlineno has serious flaws. #94438

frame.setlineno has serious flaws. #94438

markshannon commented Jun 30, 2022 •

edited by iritkatriel

pablogsal commented Jul 4, 2022 •

edited

iritkatriel commented Jul 5, 2022

iritkatriel commented Jul 5, 2022

sweeneyde commented Jul 5, 2022

iritkatriel commented Jul 5, 2022

sweeneyde commented Jul 5, 2022

brandtbucher commented Jul 6, 2022

iritkatriel commented Jul 6, 2022

sweeneyde commented Jul 6, 2022

brandtbucher commented Jul 6, 2022

iritkatriel commented Jul 6, 2022

brandtbucher commented Jul 21, 2022

brandtbucher commented Jul 23, 2022

iritkatriel commented Aug 26, 2022

brandtbucher commented Aug 26, 2022

iritkatriel commented Aug 26, 2022

gvanrossum commented Feb 27, 2023

markshannon commented Feb 27, 2023

frame.setlineno has serious flaws. #94438

frame.setlineno has serious flaws. #94438

Comments

markshannon commented Jun 30, 2022 • edited by iritkatriel

pablogsal commented Jul 4, 2022 • edited

iritkatriel commented Jul 5, 2022

iritkatriel commented Jul 5, 2022

sweeneyde commented Jul 5, 2022

iritkatriel commented Jul 5, 2022

sweeneyde commented Jul 5, 2022

brandtbucher commented Jul 6, 2022

iritkatriel commented Jul 6, 2022

sweeneyde commented Jul 6, 2022

brandtbucher commented Jul 6, 2022

iritkatriel commented Jul 6, 2022

brandtbucher commented Jul 21, 2022

brandtbucher commented Jul 23, 2022

iritkatriel commented Aug 26, 2022

brandtbucher commented Aug 26, 2022

iritkatriel commented Aug 26, 2022

gvanrossum commented Feb 27, 2023

markshannon commented Feb 27, 2023

markshannon commented Jun 30, 2022 •

edited by iritkatriel

pablogsal commented Jul 4, 2022 •

edited