Releases: scribeocr/scribe.js
Releases · scribeocr/scribe.js
v0.2.4
- Improved support with build tools such as Webpack
- Fixed bug where PDF resources were being loaded when not necessary (dd99124)
- Fixed Tesseract bug causing incorrect metrics for single-word recognition (
Recognize Word) in Scribe OCR UI (f6be561)
Full Changelog: v0.2.3...v0.2.4
v0.2.3
- Added
extractPDFTextImageoption toimportFiles- When
extractPDFTextNative,extractPDFTextOCR, andextractPDFTextImageare all set totrue, text will always be extracted from the input PDF and set as the "active" version, even if there is no text.
- When
Full Changelog: v0.2.2...v0.2.3
v0.2.2
- Added support for importing HOCR generated by Tesseract.js
Full Changelog: v0.2.1...v0.2.2
v0.2.1
- Fixed bug where comparing OCR data required providing input images
- Switched to using scoped repos for dependencies (
@scribe.js/tesseract.jsand@scribe.js/tesseract.js-core) to fix name conflicts - Other minor changes
Full Changelog: v0.2.0...v0.2.1
v0.2.0
- Added
extractInternalPDFTextfunction for extracting existing text from PDFs. - Replaced
recognizeFileswithextractTextfunction.- This function now skips recognition by default for text-native PDF inputs, which should not require OCR.
- The new name is intended to communicate that recognition is not run for all inputs.
Full Changelog: v0.1.1...v0.2.0