-
Notifications
You must be signed in to change notification settings - Fork 2
feat: add process delete enrty in snapshot produce #33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
af5d27f
to
a81ef9a
Compare
a81ef9a
to
59af94f
Compare
crates/iceberg/src/transaction.rs
Outdated
self.new_manifest_writer(&ManifestContentType::Data, spec_id)?, | ||
); | ||
} | ||
data_file_writer.as_mut().unwrap().add_entry(entry)?; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
add_entry -> add_delete_entry
crates/iceberg/src/transaction.rs
Outdated
self.new_manifest_writer(&ManifestContentType::Deletes, spec_id)?, | ||
); | ||
} | ||
delete_file_writer.as_mut().unwrap().add_entry(entry)?; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
add_entry -> add_delete_entry
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull Request Overview
This PR adds support for delete manifest processing in snapshot production by introducing a new partition_spec_id parameter. It also updates various builders and tests across the codebase to propagate this new field.
- Added partition_spec_id to the manifest writer functions and related builders.
- Implemented new delete manifest writing logic in transaction and snapshot production.
- Updated test files and file writers to include the additional partition_spec_id parameter.
Reviewed Changes
Copilot reviewed 21 out of 21 changed files in this pull request and generated 1 comment.
Show a summary per file
File | Description |
---|---|
crates/iceberg/src/transaction.rs | Updated new_manifest_writer signature and delete manifest logic. |
crates/iceberg/src/scan.rs crates/iceberg/src/io/object_cache.rs |
Added partition_spec_id to DataFileBuilder invocations in tests. |
crates/iceberg/src/writer/base_writer/data_file_writer.rs | Modified builder construction to include partition_spec_id parameter. |
crates/integration_tests/tests/shared_tests/append_partition_data_file_test.rs | Updated builder calls to supply partition_spec_id. |
crates/iceberg/src/writer/base_writer/sort_position_delete_writer.rs | Incorporated partition_spec_id into sort position delete logic. |
crates/iceberg/src/writer/base_writer/equality_delete_writer.rs | Revised equality delete writer config and tests for partition_spec_id. |
crates/iceberg/src/arrow/record_batch_partition_spliter.rs | Adjusted import and added accessor for partition_spec. |
crates/iceberg/src/spec/manifest.rs | Updated manifest entry conversion to pass partition_spec_id. |
crates/iceberg/src/writer/function_writer/fanout_partition_writer.rs | Added partition_spec_id rewriting to data files. |
crates/iceberg/src/writer/file_writer/parquet_writer.rs | Set partition_spec_id in file writer and updated tests accordingly. |
(Other files have similar updates to builders and tests to pass partition_spec_id.) |
Co-authored-by: Copilot <[email protected]>
* support spec id in data file * support proccess delete entry * fullfill partition spec id * fix * fix spelling mistake Co-authored-by: Copilot <[email protected]> --------- Co-authored-by: ZENOTME <[email protected]> Co-authored-by: Copilot <[email protected]>
* support spec id in data file * support proccess delete entry * fullfill partition spec id * fix * fix spelling mistake Co-authored-by: Copilot <[email protected]> --------- Co-authored-by: ZENOTME <[email protected]> Co-authored-by: Copilot <[email protected]>
* support spec id in data file * support proccess delete entry * fullfill partition spec id * fix * fix spelling mistake Co-authored-by: Copilot <[email protected]> --------- Co-authored-by: ZENOTME <[email protected]> Co-authored-by: Copilot <[email protected]>
* support spec id in data file * support proccess delete entry * fullfill partition spec id * fix * fix spelling mistake Co-authored-by: Copilot <[email protected]> --------- Co-authored-by: ZENOTME <[email protected]> Co-authored-by: Copilot <[email protected]> Signed-off-by: xxchan <[email protected]>
* support spec id in data file * support proccess delete entry * fullfill partition spec id * fix * fix spelling mistake Co-authored-by: Copilot <[email protected]> --------- Co-authored-by: ZENOTME <[email protected]> Co-authored-by: Copilot <[email protected]> Signed-off-by: xxchan <[email protected]>
Which issue does this PR close?
What changes are included in this PR?
Are these changes tested?