Skip to content

Commit 8fa4d35

Browse files
1.0.0 release
1 parent 5673fe2 commit 8fa4d35

File tree

2 files changed

+60
-7
lines changed

2 files changed

+60
-7
lines changed

README.md

+58-5
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,54 @@
1-
# Simple OCR action #
1+
Alfresco Simple OCR Action
2+
==========================
23

4+
This addon provides an action to extract OCR text from images or plain PDFs in Alfresco.
5+
6+
**License**
7+
The plugin is licensed under the [LGPL v3.0](http://www.gnu.org/licenses/lgpl-3.0.html).
8+
9+
**State**
10+
Current addon release is 1.0.0
11+
12+
**Compatibility**
13+
The current version has been developed using Alfresco 5.1 and Alfresco SDK 2.1.1, although it should run in Alfresco 5.0.d and Alfresco 5.0.c
14+
15+
Browser compatibility: 100% supported
16+
17+
Supported OCR software:
18+
* [pdfsandwich](http://www.tobias-elze.de/pdfsandwich/)
19+
* [OCRmyPDF](https://github.com/jbarlow83/OCRmyPDF)
20+
* [Windows.Media.OCR](https://www.nuget.org/packages/Microsoft.Windows.Ocr/) as local service
21+
22+
**Languages**
23+
Currently Share interface is provided in English, Spanish and Brazilian Portuguese.
24+
OCR supported languages catalog depends directly on selected OCR software ([Tesseract OCR](https://github.com/tesseract-ocr) or [Windows.Media.OCR](https://www.nuget.org/packages/Microsoft.Windows.Ocr/))
25+
26+
***No original Alfresco resources have been overwritten***
27+
28+
BeeCon 2016
29+
-----------
30+
This addon was presented a BeeCon 2016. You can find additionals details at [Integrating a simple OCR in Alfresco](http://beecon.buzz/talks/?id=20160125005)
31+
32+
Downloading the ready-to-deploy-plugin
33+
--------------------------------------
34+
The binary distribution is made of one amp file to be deployed in Share:
35+
36+
* [repo AMP](https://github.com/keensoft/alfresco-simple-ocr/releases/download/1.0.0/alfresco-simple-ocr.amp)
37+
38+
You can install them by using standard [Alfresco deployment tools](http://docs.alfresco.com/community/tasks/dev-extensions-tutorials-simple-module-install-amp.html)
39+
40+
Building the artifacts
41+
----------------------
42+
If you are new to Alfresco and the Alfresco Maven SDK, you should start by reading [Jeff Potts' tutorial on the subject](http://ecmarchitect.com/alfresco-developer-series-tutorials/maven-sdk/tutorial/tutorial.html).
43+
44+
You can build the artifacts from source code using maven
45+
```$ mvn clean package```
46+
47+
Configuration
48+
----------------------
349
After installation, following properties must be included in **alfresco-global.properties**
450

5-
If you are using **pdfsandwich**
51+
* If you are using **pdfsandwich**
652

753
```
854
ocr.command=/usr/local/bin/pdfsandwich
@@ -14,7 +60,7 @@ ocr.server.os=linux
1460
1561
```
1662

17-
If you are using **OCRmyPDF**
63+
* If you are using **OCRmyPDF**
1864

1965
```
2066
ocr.command=/usr/local/bin/ocrmypdf
@@ -26,7 +72,7 @@ ocr.server.os=linux
2672
2773
```
2874

29-
If you are using **Windows.OCR**
75+
* If you are using **Windows.OCR**
3076

3177
```
3278
ocr.url=http://localhost:60064/api/OCR/
@@ -35,4 +81,11 @@ ocr.output.verbose=true
3581
ocr.extra.commands=Spanish
3682
ocr.server.os=windows
3783
38-
```
84+
```
85+
86+
Usage
87+
----------------------
88+
* Including a rule on a folder by selecting **ocr-extract** action
89+
* Every dropped image on this folder will be sent to OCR software in order to produce a searchable PDF file.
90+
* To perform this operation asynchronously, just use the check provided by Alfresco to configure the rule.
91+
* To allow Alfresco operating in case of OCR error, set the rule check `Continue on error`

simple-ocr-repo/pom.xml

+2-2
Original file line numberDiff line numberDiff line change
@@ -4,15 +4,15 @@
44
<modelVersion>4.0.0</modelVersion>
55
<groupId>es.keensoft.alfresco</groupId>
66
<artifactId>simple-ocr-repo</artifactId>
7-
<version>1.0-SNAPSHOT</version>
7+
<version>1.0.0</version>
88
<name>simple-ocr-repo Repository AMP project</name>
99
<packaging>amp</packaging>
1010
<description>Manages the lifecycle of the simple-ocr-repo Repository AMP (Alfresco Module Package)</description>
1111

1212
<parent>
1313
<groupId>org.alfresco.maven</groupId>
1414
<artifactId>alfresco-sdk-parent</artifactId>
15-
<version>2.2.0</version>
15+
<version>2.1.1</version>
1616
</parent>
1717

1818
<!--

0 commit comments

Comments
 (0)