-
Notifications
You must be signed in to change notification settings - Fork 364
feat: Support weight-stripped engine and REFIT_IDENTICAL flag #3167
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
59 commits
Select commit
Hold shift + click to select a range
40349a8
support weight-stripped engine and REFIT_IDENTICAL flag
zewenli98 5d7c677
refactor with new design
zewenli98 82b7ddc
lint
zewenli98 9f6a771
samll fix
zewenli98 7ea3c0f
remove make_refittable
zewenli98 bf7553b
fast refit -> slow refit
zewenli98 46e9bc8
fix np.bool_, group_norm
zewenli98 d783fdd
add immutable_weights
zewenli98 160588e
skip engine caching for non-refittable engines, slow refit -> fast refit
zewenli98 493f981
refactored, there are 3 types of engines
zewenli98 f204104
fix and add tests
zewenli98 4663c83
fix issues #3206 #3217
zewenli98 c57ab06
small fix
zewenli98 402c9b0
resolve comments
zewenli98 d8e59da
WIP: cache weight-stripped engine
zewenli98 e8811fd
Merge branch 'main' into weight_stripped_engine
zewenli98 f2e3f00
redesigned hash func and add constant mapping to fast refit
zewenli98 31af308
refactor and add tests
zewenli98 1ae33f4
Merge branch 'main' into weight_stripped_engine
zewenli98 90bf679
update
zewenli98 a8a34f6
increase ENGINE_CACHE_SIZE
zewenli98 285bc90
skip some tests
zewenli98 2d152cf
fix tests
zewenli98 d461608
try fixing cumsum
zewenli98 d57b885
Merge branch 'main' into weight_stripped_engine
zewenli98 23d68d5
fix windows cross compile, TODO: whether windows support stripping en…
zewenli98 a928f67
CI debug test 1
zewenli98 02625ca
CI debug test 2
zewenli98 c462e40
CI debug test 3
zewenli98 9ba33b5
Merge branch 'main' into weight_stripped_engine
3d68039
reduce -n to 4 for converter tests on CI
zewenli98 2e7ef3b
reduce -n to 4 for converter tests on CI
zewenli98 9ff165c
simplify test_different_args_dont_share_cached_engine
zewenli98 8ca8e2d
reduce -n to 2
zewenli98 f9f2a70
reduce -n to 1
zewenli98 c69c61a
revert -n back to 4 and chunk converter
zewenli98 05b560d
change to opt-in feature
zewenli98 7feea97
fix conflict
zewenli98 d1521c3
fix typo
zewenli98 5a193a2
Merge branch 'main' into weight_stripped_engine
0b345be
small fix
zewenli98 6754481
Merge branch 'main' into weight_stripped_engine
zewenli98 4a7e957
update to manylinux2_28-builder
zewenli98 6e840ba
remove cuda12.6 tests
zewenli98 9a8473a
remove one_user_validator for native_layer_norm
zewenli98 6a07767
clear tests
zewenli98 ed3424a
remove the whole chunk
zewenli98 ef54239
add cuda12.6 back and export D_GLIBCXX_USE_CXX11_ABI=1
zewenli98 f166562
fix env
zewenli98 80aae71
fix container
zewenli98 676c9ce
fix env
zewenli98 bf2edc6
fix env
zewenli98 627d510
fix env
zewenli98 b393b6f
fix env
zewenli98 78d72b6
fix env
zewenli98 a5d3c18
export USE_CXX11_ABI=1 for cuda12.6
zewenli98 4f02da8
remove chunk
zewenli98 7d7423a
resolve comments
zewenli98 9f76304
Merge branch 'main' into weight_stripped_engine
zewenli98 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.