Closed
Description
Is your feature request related to a problem or challenge?
Tracking ticket for next release, also a place to track desired inclusions
Last release was https://crates.io/crates/datafusion/440.0 December 31, 2024 so next major release would be around Feb 1, 2025
Steps:
- Update version and changelog: Prepare for
45.0.0
release: Version and Changelog #14397 - Voting Thread: https://lists.apache.org/thread/g20ywc9yto8xp07lcllmvgyn8g5z4420
- Release to crates.io: https://dist.apache.org/repos/dist/release/datafusion/datafusion-45.0.0
- Create ticket for next release: Release DataFusion
46.0.0
#14123 - Port changelog to main: Minor: Update changelog for
45.0.0
and tweakCHANGELOG
docs #14545
Pre-relese testing
- Test with Comet: Test DataFusion 45.0.0 with Comet #14274
- Test with delta.rs: chore: update to DataFusion 45.0.0 / arrow/parquet
54.0.0
delta-io/delta-rs#3175 - Test with SailHQ: Test DataFusion 45.0.0 with Sail #14408
- Test with InfluxData
- Test with DataFusion Python: Test DataFusion 45 with datafusion-python #14410
Prior release tickets:
Please let me know if you would like to add any items on this list or move the categorization
Items to fix before release
- Upgrade to arrow-rs/parquet
54.0.0
#14114 - Regression:
DataFrame::schema
returns incorrect schema for NATURAL JOIN #14058 - Regression in CASE expression since DF 44 #14277
- string_numeric_coercion in binary.rs does not support Utf8View #13359
- DataFusion Regression (Starting in v43): Type Coercion for UDF Arguments (X --> String) for Specified UDFs #14230
- Regression:
Invalid comparison operation: Utf8 == Utf8View
error during LEFT ANTI JOIN #13510 - Type Coercion fails for List with inner type struct which has large/view types #14154
- [Epic] A Collection of Additional UTF8View support tickets #13504
-
encode(..., "hex")
errors on non-UTF-8 binaries since Datafusion v43 #14055 - Improve deprecation message for MemoryExec #14322
- LimitPushdown rule uncorrect remove some GlobalLimitExec #14204
Items maybe to complete (not sure if they are blockers)
- Can no longer easily join duplicate schemas as of version 43 #14112
- Query produces different results after physical plan round trip to bytes #14253
- Potential performance regression with comparisions to scalar values #14291
- Error with type coercion with
CREATE TABLE AS SELECT
... insertingVALUES
#13124 - Bug: applying multiple times
EnforceDistribution
generates invalid plan #14150
Nice to Have (but non blockers -- e.g. bugs but not regressions)
- feat: metadata columns #14057
- Exponential planning time (100s of seconds) with
UNION
andORDER BY
queries #13748 - External Error prefix is repeated multiple times #14080
- FFI support for versions and alternate tokio runtimes #13937
- Improve speed of
median
by implementing specialGroupsAccumulator
#13681 -
FULL OUTER JOIN
andLIMIT
produces wrong results #14335