this is an implementation for cloudera's pdf scanning post http://blog.cloudera.com/blog/2015/10/how-to-index-scanned-pdfs-at-scale-using-fewer-than-50-lines-of-code/ with elastic and cassandra instead of slor and hbase
-
Notifications
You must be signed in to change notification settings - Fork 0
this is an implementation for cloudera's pdf scanning post http://blog.cloudera.com/blog/2015/10/how-to-index-scanned-pdfs-at-scale-using-fewer-than-50-lines-of-code with elastic and cassandra instead of slor and hbase
fadyZohdy/cloudera-OCR
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
this is an implementation for cloudera's pdf scanning post http://blog.cloudera.com/blog/2015/10/how-to-index-scanned-pdfs-at-scale-using-fewer-than-50-lines-of-code with elastic and cassandra instead of slor and hbase
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published