Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add more documentation/examples for extracting content from PDFs #95

Open
michaelrsweet opened this issue Jan 26, 2025 · 0 comments
Open
Assignees
Labels
documentation Improvements or additions to documentation
Milestone

Comments

@michaelrsweet
Copy link
Owner

As mentioned in issue #92, the current documentation for pdfioStreamGetToken doesn't explain that whitespace and comments are skipped, name values start with '/', nor that PDF operators and values are returned as strings starting with a letter.

It would be useful to expand the pdf2text.c example code discussion as well, and maybe add a pdf2images.c example as another way to show how to extract information from a PDF file.

@michaelrsweet michaelrsweet self-assigned this Jan 26, 2025
@michaelrsweet michaelrsweet added the documentation Improvements or additions to documentation label Jan 26, 2025
@michaelrsweet michaelrsweet added this to the 1.5 milestone Jan 26, 2025
@michaelrsweet michaelrsweet changed the title Add more documentation/examples for pdfioStreamGetToken Add more documentation/examples for extracting content from PDFs Jan 26, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

1 participant