Slow SQL in JdbcStepExecutionDao on Postgres #3634

mcheban · 2019-12-24T13:10:06Z

In SQL described as constant GET_LAST_STEP_EXECUTION

SELECT SE.STEP_EXECUTION_ID,
       SE.STEP_NAME,
       SE.START_TIME,
       SE.END_TIME,
       SE.STATUS,
       SE.COMMIT_COUNT,
       SE.READ_COUNT,
       SE.FILTER_COUNT,
       SE.WRITE_COUNT,
       SE.EXIT_CODE,
       SE.EXIT_MESSAGE,
       SE.READ_SKIP_COUNT,
       SE.WRITE_SKIP_COUNT,
       SE.PROCESS_SKIP_COUNT,
       SE.ROLLBACK_COUNT,
       SE.LAST_UPDATED,
       SE.VERSION,
       JE.JOB_EXECUTION_ID,
       JE.START_TIME,
       JE.END_TIME,
       JE.STATUS,
       JE.EXIT_CODE,
       JE.EXIT_MESSAGE,
       JE.CREATE_TIME,
       JE.LAST_UPDATED,
       JE.VERSION
from BATCH_JOB_EXECUTION JE,
     BATCH_STEP_EXECUTION SE
where SE.JOB_EXECUTION_ID in (SELECT JOB_EXECUTION_ID
                              from BATCH_JOB_EXECUTION
                              where JE.JOB_INSTANCE_ID = ?)
  and SE.JOB_EXECUTION_ID = JE.JOB_EXECUTION_ID
  and SE.STEP_NAME = ?
order by SE.START_TIME desc, SE.STEP_EXECUTION_ID desc;

subquery (SELECT JOB_EXECUTION_ID from BATCH_JOB_EXECUTION where JE.JOB_INSTANCE_ID = ?) filters by JE.JOB_INSTANCE_ID which is outside of this subquery and as a result this subquery will scan the whole table and DB performs filtering by JOB_INSTANCE_ID at the very end.
The issue is only reproducible when you have millions of records in BATCH_JOB_EXECUTION

The fix is simply rewrite subquery and remove JE. – like this where JOB_INSTANCE_ID = ?

The text was updated successfully, but these errors were encountered:

cmsource · 2020-01-29T10:47:48Z

I just hit the same problem after upgrading from Spring Batch 4.1.3.RELEASE to 4.2.0.RELEASE (the defective query appears to be in the latest 4.2.1.RELEASE too).

Using an embedded HSQLDB Job Repository this query is taking up to 50 seconds to run with 25k rows in BATCH_STEP_EXECUTION.

simi · 2020-04-09T13:51:14Z

I can confirm this problem as well. The difference is 15s vs 1ms (for right query with index present) on PostgreSQL.

fmbenhassine · 2020-04-09T14:39:33Z

Thank you all for your feedback! This will be included in the upcoming 4.3.0.M1 which will be aligned with Spring Framework 5.3.0.M1 and Spring Boot 2.4.0.M1. The release dates of those milestones are not fixed yet so I can't give a date for Spring Batch 4.3.0.M1 for now.

For the record, the query GET_LAST_STEP_EXECUTION was introduced in an effort of improving the performance of step partitioning (see #891) which was taking more than 4 minutes (!) on H2 with 5000 partitions.. We managed to improve that by a factor of 10 thanks to this new query.

Now if this query can be optimized even further, then of course that's welcome!

The issue is only reproducible when you have millions of records in BATCH_JOB_EXECUTION

@mcheban Thank you for reporting this issue and for opening a PR! Just curious, can you share some numbers about how many jobs do you have and at which frequency they are launched to end up with millions of records in BATCH_JOB_EXECUTION? Do you have a retention policy / archiving strategy as recommended in the docs?

fmbenhassine · 2020-04-25T04:57:22Z

Resolved with #3635 .

kersale-g · 2020-05-25T12:33:31Z

Hi,
Thanks for you confirmation of the issue.
Could you confirm that until fix is available, the only "easy" solution is to move "back" to 4.1.3.RELEASE.
We identified the same issue two weeks ago with hundreds of thousands of records left in tables (different subject) and I was referencing this thread to our dev team.
Issue is with postgresql :
(SELECT JOB_EXECUTION_ID from BATCH_JOB_EXECUTION where JE.JOB_INSTANCE_ID = ?)
is executed as
(SELECT JE2.JOB_EXECUTION_ID from BATCH_JOB_EXECUTION JE2 where JE.JOB_INSTANCE_ID = ?)
as if running
(SELECT JE2.JOB_EXECUTION_ID from BATCH_JOB_EXECUTION JE2 ) without any filtering

The following sub-SQL (IN condition) looks fine:
(SELECT JE2.JOB_EXECUTION_ID from BATCH_JOB_EXECUTION JE2 where JE2.JOB_INSTANCE_ID = ?)

If the SQL is the one intended (out of my scope), prefixing any table does avoid any Database parsing/optimizer consideration, as it explicitly reference the intended table to use.

Regards,
Geoffrey.

fmbenhassine · 2020-05-25T22:05:17Z

@kersale-g

Could you confirm that until fix is available, the only "easy" solution is to move "back" to 4.1.3.RELEASE.

You can also override the JdbcStepExecutionDao as explained here: #3635 (comment).

artsgard · 2021-10-05T13:44:38Z

I have upgraded my spring-boot-starter-parent from 2.2.2 to 2.4.3 and the query problem is still there: running a simple select query of less than a millisecond that turns into 100/ 200 milliseconds, delaying my batch dramatically! The select query runs at the processor section of the batch, passing through all the chunks of size 500. The DB I am using is a Postgress one.

mcheban mentioned this issue Dec 24, 2019

Improve performance in JdbcStepExecutionDao #3635

Merged

fmbenhassine added the status: waiting-for-triage Issues that we did not analyse yet label Dec 24, 2019

fmbenhassine added related-to: performance type: enhancement and removed status: waiting-for-triage Issues that we did not analyse yet labels Jan 29, 2020

fmbenhassine added this to the 4.3.0 milestone Jan 29, 2020

fmbenhassine added the type: holder Issues that hold references to back-ported issues label Jan 29, 2020

fmbenhassine added has: backports Legacy label from JIRA. Superseded by "for: backport-to-x.x.x" and removed type: holder Issues that hold references to back-ported issues labels Feb 7, 2020

fmbenhassine closed this as completed Apr 25, 2020

This was referenced Apr 25, 2020

4.3.0-M1 issues #3694

Closed

4.2.4 Backported issues #3695

Closed

dependabot bot mentioned this issue Mar 7, 2021

Bump spring.batch.version from 3.0.2.RELEASE to 4.3.1 in /spring-data-batch cyrus13/anastasakis-net-sample-code#2

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Slow SQL in JdbcStepExecutionDao on Postgres #3634

Slow SQL in JdbcStepExecutionDao on Postgres #3634

mcheban commented Dec 24, 2019

cmsource commented Jan 29, 2020

Uh oh!

simi commented Apr 9, 2020 •

edited

Loading

Uh oh!

fmbenhassine commented Apr 9, 2020

Uh oh!

fmbenhassine commented Apr 25, 2020

Uh oh!

kersale-g commented May 25, 2020 •

edited

Loading

Uh oh!

fmbenhassine commented May 25, 2020 •

edited

Loading

Uh oh!

artsgard commented Oct 5, 2021

Uh oh!

Slow SQL in JdbcStepExecutionDao on Postgres #3634

Slow SQL in JdbcStepExecutionDao on Postgres #3634

Comments

mcheban commented Dec 24, 2019

cmsource commented Jan 29, 2020

Uh oh!

simi commented Apr 9, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fmbenhassine commented Apr 9, 2020

Uh oh!

fmbenhassine commented Apr 25, 2020

Uh oh!

kersale-g commented May 25, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fmbenhassine commented May 25, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

artsgard commented Oct 5, 2021

Uh oh!

simi commented Apr 9, 2020 •

edited

Loading

kersale-g commented May 25, 2020 •

edited

Loading

fmbenhassine commented May 25, 2020 •

edited

Loading