Description
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
We are now being careful about breaking changes (see https://github.com/apache/arrow-rs/blob/master/CONTRIBUTING.md#breaking-changes)
This means we can't merger PRs with breaking API changes to main
until early August
However there are now three potentially large parquet changes that could conflict with each other and have API changes:
- Add size statistics to
ParquetMetaData
introduced in PARQUET-2261 arrow-rs#5486 from @etseidl - Reintroduce: Write Bloom filters between row groups instead of the end arrow-rs#5933 from @progval
- Add
ParquetMetadataWriter
allow ad-hoc encoding ofParquetMetadata
arrow-rs#6000 from @adriangb
Describe the solution you'd like
Some way to avoid a massive set of merge conflicts when we start merging changes to master
for parquet 53
I would also love to be able to review and merge smaller PRs rather than keep several large ones outstanding
Describe alternatives you've considered
I would like to propose we create a feature branch (e.g. parquet-53.0.0
) in the arrow-rs repo that we can merge parquet API changes to and develop new features
Once main opens for 53 (in early August) we can merge the branch to main
This approach does require maintenance of the parquet 53 branch and runs the risk of accumulating merge conflicts as it diverges from master. I am willing to help do the proces
Additional context