You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Feb 27, 2023. It is now read-only.
The MarkLogic Spring Batch project is an extension of the CORE and INFRASTRUCTURE components of Spring Batch to make it easier to write batch processing programs using MarkLogic.
8
4
5
+
Start with the [project home page](https://github.com/marklogic-community/marklogic-spring-batch/wiki) to get started.
9
6
10
-
#What is Spring Batch?
7
+
## Prerequisites
11
8
12
-
[Spring Batch](http://docs.spring.io/spring-batch/trunk/reference/html/) is an open source framework for batch processing based on the [Spring Framework](http://projects.spring.io/spring-framework/).
9
+
* MarkLogic 8+
10
+
* JDK 1.8+
13
11
14
-
Figure 1 depicts a high-level diagram of a Spring Batch program. The green APPLICATION box represents the batch processing program that gets executed by a user. This is usually a command line based application but it could also be triggered by an external source like a user interface. Spring Batch provides two components, CORE and INFRASTRUCTURE, that enables the boilerplate code for creating an APPLICATION.
15
-
16
-
* CORE - contains the core runtime classes necessary to launch and control a batch job
17
-
* INFRASTRUCTURE - contains common readers and writers, and services which are used by APPLICATION and CORE
Open $PROJECT_ROOT/gradle.properties. Review the mlHost, mlRestPort, mlJobRepoPort properties to confirm there are no conflicts
22
15
23
-
Figure 2 shows the key concepts that make up the domain language of Spring Batch. The boxes in blue represent concepts from the CORE component and yellow from the INFRASTRUCTURE component. A Job has one to many steps, which has exactly one ItemReader, ItemProcessor, and ItemWriter. A job needs to be launched (JobLauncher), and meta data about the currently running process needs to be stored (JobRepository).
Run the following command to execute all project tests. All tests should pass.
35
30
36
-
# What is MarkLogic Spring Batch?
31
+
gradlew test
37
32
38
-
MarkLogic Spring Batch (MSB) extends the CORE and INFRASTRUCTURE components of Spring Batch to make it easier to work with data and MarkLogic. Sample code templates are included for developers getting started writing a batch processing APPLICATION.
33
+
## Coding Style Tests
39
34
40
-
## What are the main features of MarkLogic Spring Batch?
35
+
This project uses both checkstyle and PMD.
41
36
42
-
* Extends the [INFRASTRUCTURE]() classes to facilitate reading, writing, and processing documents for MarkLogic.
43
-
* A sample [APPLICATION]() that provides a template program to create your own Spring Batch program
44
-
* (Beta) MarkLogic implementation of a JobRepository
37
+
gradlew check
45
38
46
-
#How can I get started using MarkLogic Spring Batch?
39
+
## Deployment
47
40
48
-
Check out the [Getting Started Wiki](), review the sample application, and check out other batch processing applications using Spring Batch
41
+
The product of this project are jar files that are published to [bintray](https://dl.bintray.com/sastafford/maven/). The following libraries are created.
49
42
50
-
*[ml-migration-starter]() - Migrate data from a relational database into MarkLogic
51
-
*[Hector]() - Ingest CSV files into MarkLogic
52
-
*[Penny]() - Use Apache Natural Language Processing library to perform named entity recognition over documents
0 commit comments