Skip to content

Error on running pypdfocr #61

@ediwill

Description

@ediwill

I get the following error output while running in windows 32 bit

`
C:\OCRdir>pypdfocr test1.pdf
Starting conversion of test1.pdf
'pdfimages' is not recognized as an internal or external command,
operable program or batch file.
WARNING: Could not execute pdfimages to calculate DPI (try installing xpdf or po
ppler?), so defaulting to 300dpi
Traceback (most recent call last):
File "C:\Python27\Scripts\pypdfocr-script.py", line 11, in
load_entry_point('pypdfocr==0.9.1', 'console_scripts', 'pypdfocr')()
File "C:\Python27\lib\site-packages\pypdfocr\pypdfocr.py", line 492, in main
script.go(sys.argv[1:])
File "C:\Python27\lib\site-packages\pypdfocr\pypdfocr.py", line 474, in go
self._convert_and_file_email(self.pdf_filename)
File "C:\Python27\lib\site-packages\pypdfocr\pypdfocr.py", line 480, in _conve
rt_and_file_email
ocr_pdffilename = self.run_conversion(pdf_filename)
File "C:\Python27\lib\site-packages\pypdfocr\pypdfocr.py", line 359, in run_co
nversion
hocr_filenames = self.ts.make_hocr_from_pnms(preprocess_imagefilenames)
File "C:\Python27\lib\site-packages\pypdfocr\pypdfocr_tesseract.py", line 132,
in make_hocr_from_pnms
uptodate,ver = self._is_version_uptodate()
File "C:\Python27\lib\site-packages\pypdfocr\pypdfocr_tesseract.py", line 98,
in _is_version_uptodate
ver = [int(x) for x in ver_str.split('.')]
ValueError: invalid literal for int() with base 10: '00alpha'

C:\OCRdir>

`

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions