Feat/pdfinfo parser #626

Luigi31415 · 2024-12-29T18:40:04Z

This PR adds a new parser for handling the output of the pdfinfo command.
closes #624

kellyjonbrazil · 2025-01-13T17:08:01Z

Thanks for the parser contribution! Could you fork this from the dev branch? Also, I notice other parser files in this PR. Could you ensure the PR only includes the parser.py file and tests/fixtures?

Thanks!

Luigi31415 · 2025-01-26T06:05:35Z

Hey Kelly,
I just rebased that branch, sorry for the late reply. There are no other files in this PR, only universal which I only included because of the universal output of pdfinfo, I thought it would be logical to extend a parser.

Title:          Brochure
Producer:       Skia/PDF m111 Google Docs Renderer
Tagged:         no
Form:           none
Pages:          2
Encrypted:      no
Page size:      612 x 792 pts (letter) (rotated 0 degrees)
File size:      69988 bytes
Optimized:      no
JavaScript:     no
PDF version:    1.4

Thanks for maintaining a great library man, appreciate your work.

kellyjonbrazil · 2025-01-26T19:54:31Z

Hi @Luigi31415 - thanks for the updates. I wonder if there is a simpler way to do this since it looks like pdfinfo output is really just key/value pairs and we already have a key/value parser (--kv). Is there a need for this parser?

I can see if the keys should be renamed, then maybe we just alias to the existing --kv parser (which itself is an alias of --ini) and then just run the lib.normalize_key function within _process.

Here is the jc output using the existing key/value parser:

% echo 'Title:          Brochure
Producer:       Skia/PDF m111 Google Docs Renderer
Tagged:         no
Form:           none
Pages:          2
Encrypted:      no
Page size:      612 x 792 pts (letter) (rotated 0 degrees)
File size:      69988 bytes
Optimized:      no
JavaScript:     no
PDF version:    1.4' | jc --kv -p
{
  "Title": "Brochure",
  "Producer": "Skia/PDF m111 Google Docs Renderer",
  "Tagged": "no",
  "Form": "none",
  "Pages": "2",
  "Encrypted": "no",
  "Page size": "612 x 792 pts (letter) (rotated 0 degrees)",
  "File size": "69988 bytes",
  "Optimized": "no",
  "JavaScript": "no",
  "PDF version": "1.4"
}

feat: Add pdfinfo parser

c9949d9

Luigi31415 changed the base branch from master to dev January 26, 2025 06:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/pdfinfo parser #626

Feat/pdfinfo parser #626

Luigi31415 commented Dec 29, 2024 •

edited

Loading

kellyjonbrazil commented Jan 13, 2025

Luigi31415 commented Jan 26, 2025

kellyjonbrazil commented Jan 26, 2025 •

edited

Loading

Feat/pdfinfo parser #626

Are you sure you want to change the base?

Feat/pdfinfo parser #626

Conversation

Luigi31415 commented Dec 29, 2024 • edited Loading

kellyjonbrazil commented Jan 13, 2025

Luigi31415 commented Jan 26, 2025

kellyjonbrazil commented Jan 26, 2025 • edited Loading

Luigi31415 commented Dec 29, 2024 •

edited

Loading

kellyjonbrazil commented Jan 26, 2025 •

edited

Loading