gh-97008: Add a Python implementation of AttributeError and NameError suggestions #97022

ambv · 2022-09-22T16:39:33Z

Relevant tests moved from test_exceptions to test_traceback to be able to compare both implementations.

Co-authored-by: Carl Friedrich Bolz-Tereick cfbolz@gmx.de

Issue: Provide NameError/AttributeError suggestions in traceback.py #97008

…eError suggestions Relevant tests moved from test_exceptions to test_traceback to be able to compare both implementations. Co-authored-by: Carl Friedrich Bolz-Tereick <cfbolz@gmx.de>

cfbolz · 2022-09-22T16:52:06Z

Lib/traceback.py

+
+    # Both strings are the same (by identity)
+    if a == b:
+        return 0


this is not by identity in Python ;-). I think we can just fix the comment instead of switching to is

cfbolz · 2022-09-22T16:52:06Z

Lib/traceback.py

+        b = b[1:]
+    while a and b and a[-1] == b[-1]:
+        a = a[:-1]
+        b = b[:-1]


with the slicing this is quadratic in the common affixes. we should compute the size of the pre/suffix and then do a single slice

iritkatriel · 2022-09-22T17:39:44Z

Lib/test/test_traceback.py

+class CPythonTracebackErrorCaretTests(
+    CAPIExceptionFormattingMixin,
+    TracebackErrorLocationCaretTests,
+):


I think it can be a bit tidier if it would follow the pattern of PyExcReportingTests / CExcReportingTests where the c mixin doesn't subclass the python mixin.

sweeneyde · 2022-09-22T20:41:02Z

Lib/traceback.py

+    # Instead of producing the whole traditional len(a)-by-len(b)
+    # matrix, we can update just one row in place.
+    # Initialize the buffer row
+    row = list(range(1, (_MOVE_COST * len(a)) + 1, _MOVE_COST))


Shouldn't we be filling this up with even numbers, not odd numbers?

list(range(_MOVE_COST, _MOVE_COST * (len(a) + 1), _MOVE_COST))

Here's a failing test case:

>>> _MOVE_COST=2 >>> assert traceback._levenshtein_distance("ABA", "AAB", 1000) == 2*_MOVE_COST Traceback (most recent call last): File "<stdin>", line 1, in <module> AssertionError

sweeneyde · 2022-09-22T20:44:15Z

Lib/test/test_traceback.py

+        # to also exercise the Python implementation
+
+        def CHECK(a, b, expected):
+            actual = traceback._levenshtein_distance(a, b, 4044)


This doesn't currently exercise the max_distance short-circuit paths very much, whereas _testinternalcapi.test_edit_cost does.

Could we add that and then make this same test code test both implementations?

This is a rough draft of a possible randomized test case:

def test_levenshtein_distance_random(self): from functools import cache from traceback import _substitution_cost, _MOVE_COST @cache def levenshtein(a, b): if not a or not b: return (len(a) + len(b)) * _MOVE_COST option1 = levenshtein(a[:-1], b[:-1]) + _substitution_cost(a[-1], b[-1]) option2 = levenshtein(a[:-1], b) + _MOVE_COST option3 = levenshtein(a, b[:-1]) + _MOVE_COST return min(option1, option2, option3) from random import choices, randrange for _ in range(1000): a = ''.join(choices("abcABC", k=randrange(10))) b = ''.join(choices("abcABC", k=randrange(10))) expected = levenshtein(a, b) res1 = traceback._levenshtein_distance(a, b, 1000) self.assertEqual(res1, expected, msg=(a, b)) for threshold in [expected, expected + 1, expected + 2]: # big enough thresholds shouldn't change the result res2 = traceback._levenshtein_distance(a, b, threshold) self.assertEqual(res2, expected, msg=(a, b, threshold)) for threshold in range(expected): # for small thresholds, the only piece of information # we receive is "strings not close enough". res3 = traceback._levenshtein_distance(a, b, threshold) self.assertGreater(res3, threshold, msg=(a, b, threshold))

pythongh-97008: Add a Python implementation of AttributeError and Nam…

ef572ff

…eError suggestions Relevant tests moved from test_exceptions to test_traceback to be able to compare both implementations. Co-authored-by: Carl Friedrich Bolz-Tereick <cfbolz@gmx.de>

ambv requested a review from pablogsal Sep 22, 2022

ambv requested a review from iritkatriel as a code owner Sep 22, 2022

bedevere-bot added the awaiting core review label Sep 22, 2022

cfbolz reviewed Sep 22, 2022

View changes

iritkatriel reviewed Sep 22, 2022

View changes

sweeneyde reviewed Sep 22, 2022

View changes

gh-97008: Add a Python implementation of AttributeError and NameError suggestions #97022

gh-97008: Add a Python implementation of AttributeError and NameError suggestions #97022

ambv commented Sep 22, 2022 •

edited by bedevere-bot

cfbolz Sep 22, 2022

cfbolz Sep 22, 2022

iritkatriel Sep 22, 2022

sweeneyde Sep 22, 2022 •

edited

sweeneyde Sep 22, 2022

sweeneyde Sep 22, 2022

sweeneyde Sep 22, 2022

gh-97008: Add a Python implementation of AttributeError and NameError suggestions #97022

Are you sure you want to change the base?

gh-97008: Add a Python implementation of AttributeError and NameError suggestions #97022

Conversation

ambv commented Sep 22, 2022 • edited by bedevere-bot

cfbolz Sep 22, 2022

Choose a reason for hiding this comment

cfbolz Sep 22, 2022

Choose a reason for hiding this comment

iritkatriel Sep 22, 2022

Choose a reason for hiding this comment

sweeneyde Sep 22, 2022 • edited

Choose a reason for hiding this comment

sweeneyde Sep 22, 2022

Choose a reason for hiding this comment

sweeneyde Sep 22, 2022

Choose a reason for hiding this comment

sweeneyde Sep 22, 2022

Choose a reason for hiding this comment

ambv commented Sep 22, 2022 •

edited by bedevere-bot

sweeneyde Sep 22, 2022 •

edited