Add more documentation/examples for extracting content from PDFs #95

michaelrsweet · 2025-01-26T13:46:12Z

As mentioned in issue #92, the current documentation for pdfioStreamGetToken doesn't explain that whitespace and comments are skipped, name values start with '/', nor that PDF operators and values are returned as strings starting with a letter.

It would be useful to expand the pdf2text.c example code discussion as well, and maybe add a pdf2images.c example as another way to show how to extract information from a PDF file.

The text was updated successfully, but these errors were encountered:

michaelrsweet self-assigned this Jan 26, 2025

michaelrsweet added the documentation Improvements or additions to documentation label Jan 26, 2025

michaelrsweet added this to the 1.5 milestone Jan 26, 2025

michaelrsweet changed the title ~~Add more documentation/examples for pdfioStreamGetToken~~ Add more documentation/examples for extracting content from PDFs Jan 26, 2025

michaelrsweet mentioned this issue Jan 26, 2025

Fail to open large/complex PDF with NULL from pdfioFileOpen #92

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add more documentation/examples for extracting content from PDFs #95

Add more documentation/examples for extracting content from PDFs #95

michaelrsweet commented Jan 26, 2025

Add more documentation/examples for extracting content from PDFs #95

Add more documentation/examples for extracting content from PDFs #95

Comments

michaelrsweet commented Jan 26, 2025