Skip to content

obmalloc debug info can produce invalidly encoded output #95574

Open
@Yhg1s

Description

@Yhg1s

obmalloc's debug mode (specifically _PyMem_DebugCheckAddress() and _PyObject_DebugDumpAddress()) can produce output that is invalid in the text encoding being used by the process. This is because it prints the api_id byte to stderr with fprintf() using the %c format, and the byte can be corrupted. Because of the nature of these checks (they try to prevent memory corruption) it is actually fairly likely that the byte is corrupted, which can mean the output can't then be easily handled by tooling that expects a specific encoding (like UTF-8).

It would probably make sense to check whether the value of the api_id is a printable ASCII character before trying to print it, and resort to printing the value with %x instead.

Metadata

Metadata

Assignees

Labels

interpreter-core(Objects, Python, Grammar, and Parser dirs)type-bugAn unexpected behavior, bug, or error

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions