Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

improve experience when configuring an alternative JobRepository #4718

Open
joshlong opened this issue Nov 24, 2024 · 6 comments
Open

improve experience when configuring an alternative JobRepository #4718

joshlong opened this issue Nov 24, 2024 · 6 comments

Comments

@joshlong
Copy link
Member

improve experience when configuring an alternative JobRepository, as the DefaultBatchConfiguration, upon which Spring Boot in turn relies, assumes only JDBC-based infrastructure. This means that, when using Spring Boot, you can't simply replace one bean of type JobRepository and have things work. Everything must be redone from scratch, as we used to do before Spring Boot existed.

This will be a problem for users of Spring Batch who want to avail themselves of the new "resourceless" and MongoDB job repository alternatives.

@joshlong joshlong added status: waiting-for-triage Issues that we did not analyse yet type: bug labels Nov 24, 2024
@fmbenhassine
Copy link
Contributor

fmbenhassine commented Dec 3, 2024

@EnableBatchProcessing (and in turn DefaultBatchConfiguration) was designed as the Java counterpart of the XML element job-repository from the batch namespace where a datasource is required:

<batch:job-repository id="jobRepository" data-source="dataSource" transaction-manager="transactionManager"/>

So yes, @EnableBatchProcessing assumes a JDBC infrastructure, and this should change now that we introduced the mongo job repository and the resourceless job repository.

I have a couple ideas on how we can address this issue and of course I am open to any thoughts about this as well. I will share these ideas with the community when we start working on this design issue for the next major release. This comment is only to validate and acknowledge the issue. Thank you for reporting it, @joshlong !

@krewetka
Copy link

krewetka commented Mar 10, 2025

Hi @fmbenhassine as proper fix is planned for next major version is there any earlier workaround solution? 🤔

I mean especially to overcome shutdown problem. It is quite a big deal to be honest for production usage and as fixing it not in 5.3 but in 6 seems like it will be not fixed for at least few years 🤔

Unless we simply are doing something wrong but we didn't manage to configure proper shutdown despite many attempts .

We moved from starting the job in regular container to starting it with AWS batch so at least regular CI/CD deployment is not breaking long running jobs but let's say it is only workaround we found.

@fmbenhassine
Copy link
Contributor

@krewetka which shutdown problem are you referring to? How is that related to this issue? Please open a separate discussion to explain your problem and I will be happy to help there.

FTR, v6 is planned for November this year and there is no plan for v5.3 at the moment. In the meantime and for those who want to use a non-jdbc job repository with v5.2, please check this SO thread in which I attached a complete example.

@krewetka
Copy link

krewetka commented Mar 12, 2025

Hi, sorry my bad. I am referring to #4728 (comment) which was marked as duplicate of this issue. Should I still open another one then?🤔

We have mongo setup running properly (but with H2 still there).

We did try to exclude DataSourceAutoConfiguration.class but then we were still facing some issues but let me try again one more time and see the exact results.

@fmbenhassine
Copy link
Contributor

@krewetka

Should I still open another one then?

Yes please. The shutdown lifecycle issue is different than requiring h2 for a mongo setup.

@krewetka
Copy link

krewetka commented Mar 12, 2025

Ok, will try to provide full example in new issue when it comes to shutdown. Maybe it will be solved when I manage to get rid of h2 in my code finally 🤞

Btw, I checked SO thread and noticed it is not using EnableBatchProcessing

We have it in our code still and when we remove H2 we are facing:
Error creating bean with name 'jobExplorer': Cannot resolve reference to bean 'dataSource' while setting bean property 'dataSource'
with added:@SpringBootApplication(exclude = DataSourceAutoConfiguration.class)

I guess we need to not use EnableBatchProcessing then but still somehow setup somewhere almost all other piecies from it like tablePrefix , isolationLevelForCreate , taskExecutorRef etc.

Do you have any recommendation for it how do to proper workaround for the time-being?

I did copy of EnableBatchProcessing with String dataSourceRef() default ""; and used this one but there are imports on not public packages like AutomaticJobRegistrarBeanPostProcessor and BatchRegistrar ( which then define registerJobExplorer ) etc. which I could not keep in custom version.

I skiped them and it seems to work and the jobs are run but not sure if this is good idea as some important parts of sprint batch internal logic are not missing ( for example jobExplorer is not wired-up) 🤔

After doing it I realized that it can be even still with String dataSourceRef() default "dataSource"; as it is not used anymore when there is no import of BatchRegistrar which is creating jobExplorer 😆

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants