Skip to content

Conversation

Omega359
Copy link
Contributor

@Omega359 Omega359 commented Jul 18, 2025

Omega359 and others added 30 commits February 20, 2025 12:35
@alamb
Copy link
Contributor

alamb commented Jul 19, 2025

I also added a performance chart from https://alamb.github.io/datafusion-benchmarking/ (I am rechecking these numbers, etc) but it is looking good so far. A preview:
Screenshot 2025-07-19 at 7 01 57 AM

@alamb
Copy link
Contributor

alamb commented Jul 21, 2025

😅 -- ok I think I filled out the major content parts of this post. It needs:

  1. More honing / review
  2. Review / rerun the performance chart numbers

@Omega359
Copy link
Contributor Author

Thanks for your help with this @alamb, sorry I haven't done more. I've been deep trying to figure out a performance issue I've been having and haven't had the spare time to put into this

@Omega359 Omega359 marked this pull request as ready for review July 21, 2025 21:13
@alamb
Copy link
Contributor

alamb commented Jul 21, 2025

Thanks for your help with this @alamb, sorry I haven't done more. I've been deep trying to figure out a performance issue I've been having and haven't had the spare time to put into this

I Totally understand -- we are all just trying to scrap together enough time to make progress. It is a struggle for sure! No worries.

All your contributions are appreciated

@alamb
Copy link
Contributor

alamb commented Jul 23, 2025

I took another pass -- I think this post is looking quite good :bowtie:

The major thing that is missing now is actual performance numbers. I will work on that next, likely tomorrow or Friday

@Omega359
Copy link
Contributor Author

Oh my, the ai loves commas, just make sure to leave out any m-dashes :)

Copy link
Contributor

@alamb alamb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We are getting ready for the DataFusion 49 release and we have several shoutouts in the release notes.

I tagged the relevant people on this PR -- please help / suggest additional comments / content / contributors as you are able


Here are some noteworthy optimizations added since DataFusion 48:

**Equivalence system upgrade:** The lower levels of the equivalence system, which is used to implement the
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Information Passing for joins] (Issue
[#15037](https://github.com/apache/datafusion/issues/15037) PR
[#15770](https://github.com/apache/datafusion/pull/15770) by
[adriangb](https://github.com/adriangb)).
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

FYI @adriangb


The last few months, between `46.0.0` and `49.0.0`, have seen our community grow:

1. New PMC members and committers: [berkay], [xudong963] and [timsaucer] joined the PMC.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.


(Issue [#6518](https://github.com/apache/datafusion/issues/6518),
[PR #14837](https://github.com/apache/datafusion/pull/14837) from
[goldmedal](https://github.com/goldmedal) 🏆)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.


[resulting PR]: https://github.com/apache/datafusion/pull/16398
[blog post]: https://datafusion.apache.org/blog/2025/06/30/cancellation
[contribution upstream to tokio]: https://github.com/tokio-rs/tokio/pull/7405
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

FYI @pepijnve

2. Pushdown of filters and expressions (see "Dynamic Filters and TopK pushdown" section above)

[a long-requested feature]: https://github.com/apache/datafusion/issues/12644
[timsaucer]: https://github.com/timsaucer
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

);
```

([Issue #15216](https://github.com/apache/datafusion/issues/15216),
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.


(Issue [#11732](https://github.com/apache/datafusion/issues/11732),
PR [#13511](https://github.com/apache/datafusion/pull/13511),
by [Garamda](https://github.com/Garamda))
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

FYI @Garamda

[`REGEXP_INSTR` function]: https://datafusion.apache.org/user-guide/sql/scalar_functions.html#regexp-instr
([Issue #13009](https://github.com/apache/datafusion/issues/13009),
[PR #15928](https://github.com/apache/datafusion/pull/15928)
by [nirnayroy](https://github.com/nirnayroy))
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.


(Issue [#16130](https://github.com/apache/datafusion/issues/16130),
PR [#16268](https://github.com/apache/datafusion/pull/16268)
by [ding-young](https://github.com/ding-young))
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@alamb
Copy link
Contributor

alamb commented Jul 27, 2025

I plan to publish this tomorrow (Monday July 28 2025)

Copy link
Member

@xudong963 xudong963 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you

@alamb
Copy link
Contributor

alamb commented Jul 28, 2025

🚀

@alamb alamb merged commit 9dba51d into apache:main Jul 28, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Blog post for the DataFusion 49 release Release DataFusion 49.0.0 (July 2025)

3 participants