-
Notifications
You must be signed in to change notification settings - Fork 17
DataFusion 49.0.0
release post
#91
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Co-authored-by: Yongting You <[email protected]>
Co-authored-by: Yongting You <[email protected]>
Co-authored-by: Andrew Lamb <[email protected]>
Co-authored-by: Phillip LeBlanc <[email protected]>
Co-authored-by: Kevin Liu <[email protected]>
I also added a performance chart from https://alamb.github.io/datafusion-benchmarking/ (I am rechecking these numbers, etc) but it is looking good so far. A preview: |
😅 -- ok I think I filled out the major content parts of this post. It needs:
|
Thanks for your help with this @alamb, sorry I haven't done more. I've been deep trying to figure out a performance issue I've been having and haven't had the spare time to put into this |
I Totally understand -- we are all just trying to scrap together enough time to make progress. It is a struggle for sure! No worries. All your contributions are appreciated |
I took another pass -- I think this post is looking quite good The major thing that is missing now is actual performance numbers. I will work on that next, likely tomorrow or Friday |
Oh my, the ai loves commas, just make sure to leave out any m-dashes :) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We are getting ready for the DataFusion 49 release and we have several shoutouts in the release notes.
I tagged the relevant people on this PR -- please help / suggest additional comments / content / contributors as you are able
|
||
Here are some noteworthy optimizations added since DataFusion 48: | ||
|
||
**Equivalence system upgrade:** The lower levels of the equivalence system, which is used to implement the |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FYI @ozankabak
Information Passing for joins] (Issue | ||
[#15037](https://github.com/apache/datafusion/issues/15037) PR | ||
[#15770](https://github.com/apache/datafusion/pull/15770) by | ||
[adriangb](https://github.com/adriangb)). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FYI @adriangb
|
||
The last few months, between `46.0.0` and `49.0.0`, have seen our community grow: | ||
|
||
1. New PMC members and committers: [berkay], [xudong963] and [timsaucer] joined the PMC. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
|
||
(Issue [#6518](https://github.com/apache/datafusion/issues/6518), | ||
[PR #14837](https://github.com/apache/datafusion/pull/14837) from | ||
[goldmedal](https://github.com/goldmedal) 🏆) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fyi @goldmedal
|
||
[resulting PR]: https://github.com/apache/datafusion/pull/16398 | ||
[blog post]: https://datafusion.apache.org/blog/2025/06/30/cancellation | ||
[contribution upstream to tokio]: https://github.com/tokio-rs/tokio/pull/7405 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FYI @pepijnve
2. Pushdown of filters and expressions (see "Dynamic Filters and TopK pushdown" section above) | ||
|
||
[a long-requested feature]: https://github.com/apache/datafusion/issues/12644 | ||
[timsaucer]: https://github.com/timsaucer |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fyi @timsaucer
); | ||
``` | ||
|
||
([Issue #15216](https://github.com/apache/datafusion/issues/15216), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FYI @corwinjoy and @adamreeve
|
||
(Issue [#11732](https://github.com/apache/datafusion/issues/11732), | ||
PR [#13511](https://github.com/apache/datafusion/pull/13511), | ||
by [Garamda](https://github.com/Garamda)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FYI @Garamda
[`REGEXP_INSTR` function]: https://datafusion.apache.org/user-guide/sql/scalar_functions.html#regexp-instr | ||
([Issue #13009](https://github.com/apache/datafusion/issues/13009), | ||
[PR #15928](https://github.com/apache/datafusion/pull/15928) | ||
by [nirnayroy](https://github.com/nirnayroy)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FYI @nirnayroy
|
||
(Issue [#16130](https://github.com/apache/datafusion/issues/16130), | ||
PR [#16268](https://github.com/apache/datafusion/pull/16268) | ||
by [ding-young](https://github.com/ding-young)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FYI @ding-young
I plan to publish this tomorrow (Monday July 28 2025) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you
🚀 |
Posting as a draft as per @alamb's request @ apache/datafusion#16235 (comment)
49.0.0
(July 2025) datafusion#16235