-
-
Notifications
You must be signed in to change notification settings - Fork 31.9k
Port PyPy's new unicode name db format to optimize binary CPython size #96954
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
They were undertested, and since python#96954 might involve a rewrite of this part of the code we want to ensure that there won't be any behavioral change. Co-authored-by: Carl Friedrich Bolz-Tereick <[email protected]>
They were undertested, and since python#96954 might involve a rewrite of this part of the code we want to ensure that there won't be any behavioral change. Co-authored-by: Carl Friedrich Bolz-Tereick <[email protected]>
They were undertested, and since #96954 might involve a rewrite of this part of the code we want to ensure that there won't be any behavioral change. Co-authored-by: Carl Friedrich Bolz-Tereick <[email protected]>
…H-96955) They were undertested, and since pythonGH-96954 might involve a rewrite of this part of the code we want to ensure that there won't be any behavioral change. Co-authored-by: Carl Friedrich Bolz-Tereick <[email protected]> (cherry picked from commit 5a32eec) Co-authored-by: Batuhan Taskaya <[email protected]>
…H-96955) They were undertested, and since pythonGH-96954 might involve a rewrite of this part of the code we want to ensure that there won't be any behavioral change. Co-authored-by: Carl Friedrich Bolz-Tereick <[email protected]> (cherry picked from commit 5a32eec) Co-authored-by: Batuhan Taskaya <[email protected]>
They were undertested, and since GH-96954 might involve a rewrite of this part of the code we want to ensure that there won't be any behavioral change. Co-authored-by: Carl Friedrich Bolz-Tereick <[email protected]> (cherry picked from commit 5a32eec) Co-authored-by: Batuhan Taskaya <[email protected]>
…-96992) They were undertested, and since GH-96954 might involve a rewrite of this part of the code we want to ensure that there won't be any behavioral change. Co-authored-by: Carl Friedrich Bolz-Tereick <[email protected]> (cherry picked from commit 5a32eec) Co-authored-by: Batuhan Taskaya <[email protected]>
…-96991) They were undertested, and since GH-96954 might involve a rewrite of this part of the code we want to ensure that there won't be any behavioral change. Co-authored-by: Carl Friedrich Bolz-Tereick <[email protected]> (cherry picked from commit 5a32eec) Co-authored-by: Batuhan Taskaya <[email protected]>
…ta codepoint names (#97906) Co-authored-by: Łukasz Langa <[email protected]> Co-authored-by: Pieter Eendebak <[email protected]> Co-authored-by: Dennis Sweeney <[email protected]>
…-111764) skip_if_missing in test_makeunicodedata
I've noticed
I think I know how to fix this and will submit a PR once I test my assumption. |
I've also noticed it is now impossible to call Is it necessary to include |
This avoids: python3.13 Tools/unicode/makeunicodedata.py python3.13: can't open file '.../build/debug/Tools/unicode/makeunicodedata.py': [Errno 2] No such file or directory make: *** [Makefile:1498: regen-unicodedata] Error 2 Re-run `make regen-unicodedata` to update the script path in generated files.
regen-configure pulls a docker container image, it's not the only "regen" command using Internet. But Maybe we can exclude |
The "make regen-unicodedata" should now be run manually. By the default, it requires an Internet connection, which is not always the case. Some Linux distributions build Linux packages in isolated environment (without network).
The "make regen-unicodedata" should now be run manually. By the default, it requires an Internet connection, which is not always the case. Some Linux distributions build Linux packages in isolated environment (without network).
The initial feature request is now implemented and known issues have been fixed. I close the issue. Thanks @cfbolz for this cool optimization! |
…codedata codepoint names (python#97906) Co-authored-by: Łukasz Langa <[email protected]> Co-authored-by: Pieter Eendebak <[email protected]> Co-authored-by: Dennis Sweeney <[email protected]>
…-111764) skip_if_missing in test_makeunicodedata
…ython#112118) This avoids: python3.13 Tools/unicode/makeunicodedata.py python3.13: can't open file '.../build/debug/Tools/unicode/makeunicodedata.py': [Errno 2] No such file or directory make: *** [Makefile:1498: regen-unicodedata] Error 2 Re-run `make regen-unicodedata` to update the script path in generated files.
The "make regen-unicodedata" should now be run manually. By the default, it requires an Internet connection, which is not always the case. Some Linux distributions build Linux packages in isolated environment (without network).
…codedata codepoint names (python#97906) Co-authored-by: Łukasz Langa <[email protected]> Co-authored-by: Pieter Eendebak <[email protected]> Co-authored-by: Dennis Sweeney <[email protected]>
…-111764) skip_if_missing in test_makeunicodedata
…ython#112118) This avoids: python3.13 Tools/unicode/makeunicodedata.py python3.13: can't open file '.../build/debug/Tools/unicode/makeunicodedata.py': [Errno 2] No such file or directory make: *** [Makefile:1498: regen-unicodedata] Error 2 Re-run `make regen-unicodedata` to update the script path in generated files.
The "make regen-unicodedata" should now be run manually. By the default, it requires an Internet connection, which is not always the case. Some Linux distributions build Linux packages in isolated environment (without network).
Feature or enhancement
PyPy has a new algorithm that might help us reduce the unicode name db size of our binaries (which would be helpful on our way forward to better WASM compatibility [through reduced download sizes]). For more details, you can see the new implementation in PyPy side by @cfbolz.
CC: @ambv @cfbolz
We'll probably start with missing tests, and then build out a prototype to see how feasible it is and will share numbers on how much it would help before going forward with the implementation.
Linked PRs
make regen-unicodedata
in out-of-tree builds #112118The text was updated successfully, but these errors were encountered: