Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bpo-36819: Fix crashes in built-in encoders with weird error handlers #28593

Merged

Conversation

serhiy-storchaka
Copy link
Member

@serhiy-storchaka serhiy-storchaka commented Sep 28, 2021

If the error handler returns position less or equal than the starting
position of non-encodable characters, most of built-in encoders didn't
properly re-size the output buffer. This led to out-of-bounds writes,
and segfaults.

https://bugs.python.org/issue36819

#81000

If the error handler returns position less or equal than the starting
position of non-encodable characters, most of built-in encoders didn't
properly re-size the output buffer. This led to out-of-bounds writes,
and segfaults.
@serhiy-storchaka
Copy link
Member Author

serhiy-storchaka commented Sep 28, 2021

I need to add more tests for exceptions raised when an error handler returns invalid result, and analyze decoders.

Copy link
Member

@vstinner vstinner left a comment

Honestly, I'm not sure that it's worth it to handle this weird case: https://bugs.python.org/issue36819#msg402826

@serhiy-storchaka
Copy link
Member Author

serhiy-storchaka commented Sep 29, 2021

There are existing tests for position out of range (even negative values). They just are not harsh enough to cause a crash.

The code needs to be changed even if we restrict returned values for error handlers.

@serhiy-storchaka serhiy-storchaka marked this pull request as ready for review Oct 15, 2021
@serhiy-storchaka serhiy-storchaka requested a review from malemburg Oct 15, 2021
@serhiy-storchaka
Copy link
Member Author

serhiy-storchaka commented Oct 15, 2021

I have added more tests and it is now ready for review.

@serhiy-storchaka serhiy-storchaka merged commit 18b07d7 into python:main May 2, 2022
13 checks passed
@miss-islington
Copy link
Contributor

miss-islington commented May 2, 2022

Thanks @serhiy-storchaka for the PR 🌮🎉.. I'm working now to backport this PR to: 3.9, 3.10.
🐍🍒🤖 I'm not a witch! I'm not a witch!

@serhiy-storchaka serhiy-storchaka deleted the codecs-error-handler-pos branch May 2, 2022
@bedevere-bot
Copy link

bedevere-bot commented May 2, 2022

GH-92136 is a backport of this pull request to the 3.10 branch.

@bedevere-bot
Copy link

bedevere-bot commented May 2, 2022

GH-92137 is a backport of this pull request to the 3.9 branch.

miss-islington pushed a commit to miss-islington/cpython that referenced this pull request May 2, 2022
…pythonGH-28593)

If the error handler returns position less or equal than the starting
position of non-encodable characters, most of built-in encoders didn't
properly re-size the output buffer. This led to out-of-bounds writes,
and segfaults.
(cherry picked from commit 18b07d7)

Co-authored-by: Serhiy Storchaka <storchaka@gmail.com>
miss-islington added a commit that referenced this pull request May 2, 2022
…GH-28593)

If the error handler returns position less or equal than the starting
position of non-encodable characters, most of built-in encoders didn't
properly re-size the output buffer. This led to out-of-bounds writes,
and segfaults.
(cherry picked from commit 18b07d7)

Co-authored-by: Serhiy Storchaka <storchaka@gmail.com>
miss-islington added a commit that referenced this pull request May 2, 2022
…GH-28593)

If the error handler returns position less or equal than the starting
position of non-encodable characters, most of built-in encoders didn't
properly re-size the output buffer. This led to out-of-bounds writes,
and segfaults.
(cherry picked from commit 18b07d7)

Co-authored-by: Serhiy Storchaka <storchaka@gmail.com>
hello-adam pushed a commit to hello-adam/cpython that referenced this pull request Jun 2, 2022
…pythonGH-28593)

If the error handler returns position less or equal than the starting
position of non-encodable characters, most of built-in encoders didn't
properly re-size the output buffer. This led to out-of-bounds writes,
and segfaults.
(cherry picked from commit 18b07d7)

Co-authored-by: Serhiy Storchaka <storchaka@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type-bug An unexpected behavior, bug, or error
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

5 participants