gh-95027: Fix regrtest stdout encoding on Windows #98492

vstinner · 2022-10-20T16:03:08Z

On Windows, when the Python test suite is run with the -jN option, the ANSI code page is now used as the encoding for the stdout temporary file, rather than using UTF-8 which can lead to decoding errors.

Issue: Running tests in parallel on Windows quits too soon #95027

vstinner · 2022-10-20T16:04:56Z

I failed to write a test reproducing the bug in test_regrtest, since test_regrtest runs the test suite with stdout being a pipe, which works around the bug :-( I tried to add this test:

    def test_nonascii(self):
        character = chr(0xe9)
        line = f"nonascii: {character}"
        encoding = locale.getencoding()
        try:
            line.encode(encoding)
        except EncodingError:
            self.skipTest(f"fail to encode character U+00E9 "
                          f"to locale encoding {encoding}")

        code = textwrap.dedent(f"""
            import unittest

            class Tests(unittest.TestCase):
                def test_nonascii(self):
                    line = {line!a}
                    print(line)
        """)
        testname = self.create_test(code=code)

        # sequential execution: may use the terminal
        output = self.run_tests(testname)
        self.check_executed_tests(output, [testname])
        self.assertIn(line, output)

        # parallel execution: stdout is a temporary file
        output = self.run_tests("-j1", testname)
        self.check_executed_tests(output, [testname])
        self.assertIn(line, output)

On Windows, when the Python test suite is run with the -jN option, the ANSI code page is now used as the encoding for the stdout temporary file, rather than using UTF-8 which can lead to decoding errors.

vstinner · 2022-10-20T16:08:32Z

@zooba wrote PR #96669 which has more side effects. This PR basically just restores the behavior before commit 199ba23.

miss-islington · 2022-10-21T14:21:40Z

Thanks @vstinner for the PR 🌮🎉.. I'm working now to backport this PR to: 3.11.
🐍🍒⛏🤖

On Windows, when the Python test suite is run with the -jN option, the ANSI code page is now used as the encoding for the stdout temporary file, rather than using UTF-8 which can lead to decoding errors. (cherry picked from commit ec1f6f5) Co-authored-by: Victor Stinner <vstinner@python.org>

vstinner · 2022-10-21T14:22:57Z

I would prefer a formal review, but I merged my PR just to unblock the 3.11.0 final release (scheduled next Monday). Maybe if something can be enhanced, it can be done later. IMO this fix is better than the current situation. In short, it just restores the old behavior: encodings used before 199ba23

miss-islington · 2022-10-21T14:25:09Z

Thanks @vstinner for the PR 🌮🎉.. I'm working now to backport this PR to: 3.11.
🐍🍒⛏🤖

On Windows, when the Python test suite is run with the -jN option, the ANSI code page is now used as the encoding for the stdout temporary file, rather than using UTF-8 which can lead to decoding errors. (cherry picked from commit ec1f6f5) Co-authored-by: Victor Stinner <vstinner@python.org>

vstinner · 2022-10-21T14:27:33Z

3.11 backport: #98521 (review)

On Windows, when the Python test suite is run with the -jN option, the ANSI code page is now used as the encoding for the stdout temporary file, rather than using UTF-8 which can lead to decoding errors. (cherry picked from commit ec1f6f5) Co-authored-by: Victor Stinner <vstinner@python.org>

bedevere-bot added the awaiting core review label Oct 20, 2022

vstinner added the needs backport to 3.11 label Oct 20, 2022

gh-95027: Fix regrtest stdout encoding on Windows

7addb33

On Windows, when the Python test suite is run with the -jN option, the ANSI code page is now used as the encoding for the stdout temporary file, rather than using UTF-8 which can lead to decoding errors.

vstinner force-pushed the regrtest_encoding branch from 43cf8f9 to 7addb33 Compare Oct 20, 2022

vstinner mentioned this pull request Oct 20, 2022

Running tests in parallel on Windows quits too soon #95027

Closed

vstinner merged commit ec1f6f5 into python:main Oct 21, 2022
15 checks passed

bedevere-bot removed the awaiting core review label Oct 21, 2022

vstinner deleted the regrtest_encoding branch Oct 21, 2022

vstinner added needs backport to 3.11 and removed needs backport to 3.11 labels Oct 21, 2022

vstinner mentioned this pull request Oct 21, 2022

gh-95027: Ensure test runner uses utf-8:surrogateescape for communicating with subprocesses #96669

Closed

gh-95027: Fix regrtest stdout encoding on Windows #98492

gh-95027: Fix regrtest stdout encoding on Windows #98492

vstinner commented Oct 20, 2022 •

edited by bedevere-bot

vstinner commented Oct 20, 2022

vstinner commented Oct 20, 2022

miss-islington commented Oct 21, 2022

vstinner commented Oct 21, 2022

miss-islington commented Oct 21, 2022

vstinner commented Oct 21, 2022

gh-95027: Fix regrtest stdout encoding on Windows #98492

gh-95027: Fix regrtest stdout encoding on Windows #98492

Conversation

vstinner commented Oct 20, 2022 • edited by bedevere-bot

vstinner commented Oct 20, 2022

vstinner commented Oct 20, 2022

miss-islington commented Oct 21, 2022

vstinner commented Oct 21, 2022

miss-islington commented Oct 21, 2022

vstinner commented Oct 21, 2022

vstinner commented Oct 20, 2022 •

edited by bedevere-bot