bpo-35214: Fix OOB memory access in unicode escape parser #10506

gpshead · 2018-11-13T08:42:53Z

Discovered using clang's MemorySanitizer when it ran
test_fstring's test_misformed_unicode_character_name.

An f-string ending in \N would access one byte beyond the end of
the string while looking for a potential }.

https://bugs.python.org/issue35214

Discovered using clang's MemorySanitizer when it ran test_fstring's test_misformed_unicode_character_name. An f-string ending in `\N` would access one byte beyond the end of the string while looking for a potential `}`.

gpshead · 2018-11-13T08:44:50Z

actually my news entry and description may not be correct. this may not be specific to f-strings but to any unicode string escape parsing?

gpshead · 2018-11-13T17:59:19Z

Confirmed, this is in the unicode parser, not f-string related. that just happened to be the unittest that triggered it.

$ ./python -c 'u"\N"'
==17380==WARNING: MemorySanitizer: use-of-uninitialized-value
    #0 0x6a8d8f in _PyUnicode_DecodeUnicodeEscape /home/gps/test/b37/../cpython/Objects/unicodeobject.c:6045:17
    #1 0xbcd1c8 in decode_unicode_with_escapes /home/gps/test/b37/../cpython/Python/ast.c:4208:9
    #2 0xbcaf2c in parsestr /home/gps/test/b37/../cpython/Python/ast.c:5175:23
    #3 0xbc8be3 in parsestrplus /home/gps/test/b37/../cpython/Python/ast.c:5206:13
    #4 0xbc6516 in ast_for_atom /home/gps/test/b37/../cpython/Python/ast.c:2107:23
    #5 0xbc57af in ast_for_atom_expr /home/gps/test/b37/../cpython/Python/ast.c:2464:9
    #6 0xbc1796 in ast_for_power /home/gps/test/b37/../cpython/Python/ast.c:2501:9
    #7 0xbbec1d in ast_for_expr /home/gps/test/b37/../cpython/Python/ast.c:2689:20
    #8 0xbb99a4 in ast_for_testlist /home/gps/test/b37/../cpython/Python/ast.c:2878:16
    #9 0xbd606e in ast_for_expr_stmt /home/gps/test/b37/../cpython/Python/ast.c:2901:21
    #10 0xbb92cf in ast_for_stmt /home/gps/test/b37/../cpython/Python/ast.c:3983:24
    #11 0xbb820a in PyAST_FromNodeObject /home/gps/test/b37/../cpython/Python/ast.c:799:25
    #12 0x88d4cb in PyParser_ASTFromStringObject /home/gps/test/b37/../cpython/Python/pythonrun.c:1184:15
    #13 0x889f61 in PyRun_StringFlags /home/gps/test/b37/../cpython/Python/pythonrun.c:957:11
    #14 0x889c33 in PyRun_SimpleStringFlags /home/gps/test/b37/../cpython/Python/pythonrun.c:455:9
...

gpshead · 2018-11-13T18:08:07Z

In terms of severity, this was only ever a single byte read. The code path it entered if that byte happened to be a } did check the bounds and would result in the desired error message (which is why this went unnoticed for so long). With most memory allocations it is unlikely that this byte would be on an unmapped page and crash the program, though I suspect it would be possible to trigger that by using a large enough string to generate such a specific allocation.

At most that is a crash of an interpreter from an errant memory read. Denial of service at best if someone is handing user data to the unicode escape parser.

miss-islington · 2018-11-13T21:16:56Z

Thanks @gpshead for the PR 🌮🎉.. I'm working now to backport this PR to: 2.7, 3.6, 3.7.
🐍🍒⛏🤖

…0506) Discovered using clang's MemorySanitizer when it ran python3's test_fstring test_misformed_unicode_character_name. An msan build will fail by simply executing: ./python -c 'u"\N"' (cherry picked from commit 746b2d3) Co-authored-by: Gregory P. Smith <[email protected]>

bedevere-bot · 2018-11-13T21:17:10Z

GH-10522 is a backport of this pull request to the 3.7 branch.

bedevere-bot · 2018-11-13T21:17:17Z

GH-10523 is a backport of this pull request to the 3.6 branch.

miss-islington · 2018-11-13T21:17:18Z

Sorry, @gpshead, I could not cleanly backport this to 2.7 due to a conflict.
Please backport using cherry_picker on command line.
cherry_picker 746b2d35ea47005054ed774fecaed64fab803d7d 2.7

…0506) Discovered using clang's MemorySanitizer when it ran python3's test_fstring test_misformed_unicode_character_name. An msan build will fail by simply executing: ./python -c 'u"\N"' (cherry picked from commit 746b2d3) Co-authored-by: Gregory P. Smith <[email protected]>

Discovered using clang's MemorySanitizer when it ran python3's test_fstring test_misformed_unicode_character_name. An msan build will fail by simply executing: ./python -c 'u"\N"' (cherry picked from commit 746b2d3) Co-authored-by: Gregory P. Smith <[email protected]>

…0506) (GH-10522) Discovered using clang's MemorySanitizer when it ran python3's test_fstring test_misformed_unicode_character_name. An msan build will fail by simply executing: ./python -c 'u"\N"' (cherry picked from commit 746b2d3) Co-authored-by: Gregory P. Smith <[email protected]> https://bugs.python.org/issue35214

bedevere-bot · 2018-11-14T01:27:35Z

GH-10538 is a backport of this pull request to the 2.7 branch.

…0506) (GH-10538) Discovered using clang's MemorySanitizer. A msan build will fail by simply executing: ./python -c 'u"\N"' (cherry picked from commit 746b2d3) Co-authored-by: Gregory P. Smith <[email protected]> [Google LLC]

gpshead added 2 commits November 13, 2018 00:30

bpo-35214: Fix OOB memory access in f-string parser.

782568a

Discovered using clang's MemorySanitizer when it ran test_fstring's test_misformed_unicode_character_name. An f-string ending in `\N` would access one byte beyond the end of the string while looking for a potential `}`.

Add a news entry.

1af8580

gpshead added type-bug An unexpected behavior, bug, or error needs backport to 3.6 labels Nov 13, 2018

gpshead self-assigned this Nov 13, 2018

the-knights-who-say-ni added the CLA signed label Nov 13, 2018

bedevere-bot added the awaiting merge label Nov 13, 2018

gpshead added the awaiting changes label Nov 13, 2018

gpshead changed the title ~~bpo-35214: Fix OOB memory access in f-string parser~~ bpo-35214: Fix OOB memory access in unicode escape parser Nov 13, 2018

Correct the news entry.

db54123

gpshead removed the awaiting changes label Nov 13, 2018

gpshead added the needs backport to 2.7 label Nov 13, 2018

gpshead merged commit 746b2d3 into python:master Nov 13, 2018

bedevere-bot removed the awaiting merge label Nov 13, 2018

bedevere-bot removed the needs backport to 3.7 label Nov 13, 2018

bedevere-bot removed the needs backport to 3.6 label Nov 13, 2018

gpshead deleted the msan_fstring_oob branch November 13, 2018 21:18

bedevere-bot removed the needs backport to 2.7 label Nov 14, 2018

gpshead mentioned this pull request Aug 22, 2023

Get the test suite passing with clang Memory Sanitizer enabled #79395

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bpo-35214: Fix OOB memory access in unicode escape parser #10506

bpo-35214: Fix OOB memory access in unicode escape parser #10506

gpshead commented Nov 13, 2018 •

edited by bedevere-bot

Loading

gpshead commented Nov 13, 2018

gpshead commented Nov 13, 2018

gpshead commented Nov 13, 2018

miss-islington commented Nov 13, 2018

bedevere-bot commented Nov 13, 2018

bedevere-bot commented Nov 13, 2018

miss-islington commented Nov 13, 2018

bedevere-bot commented Nov 14, 2018

bpo-35214: Fix OOB memory access in unicode escape parser #10506

bpo-35214: Fix OOB memory access in unicode escape parser #10506

Conversation

gpshead commented Nov 13, 2018 • edited by bedevere-bot Loading

gpshead commented Nov 13, 2018

gpshead commented Nov 13, 2018

gpshead commented Nov 13, 2018

miss-islington commented Nov 13, 2018

bedevere-bot commented Nov 13, 2018

bedevere-bot commented Nov 13, 2018

miss-islington commented Nov 13, 2018

bedevere-bot commented Nov 14, 2018

gpshead commented Nov 13, 2018 •

edited by bedevere-bot

Loading