Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reversed Arabic character ordering (version 0.9.0) #506

Open
ahmedelq opened this issue Jul 20, 2024 · 1 comment
Open

Reversed Arabic character ordering (version 0.9.0) #506

ahmedelq opened this issue Jul 20, 2024 · 1 comment

Comments

@ahmedelq
Copy link

Bug Report

Current Behavior
The order of the characters within each Arabic word is reversed in the output CSV file.
For example if a cell contains the sentence تقييم الطالب in the pdf, the csv result will be بلاطلا مييقت.
(for comparison sake, if a cell contains Student Results in the pdf it will become stluseR tnedutS in the produced csv).

Steps to reproduce

camelot --format csv --output out.csv lattice arabic.pdf 

Environment

  • camelot version: 0.9.0
  • Python 3.12.4

Additional context/Screenshots
Original PDF
image

Camelot result
image

Expected result
image

@bosd
Copy link

bosd commented Aug 8, 2024

Heyl!

This repo seems to be no longer maintained.
We try to build a maintained fork at pypdf_table_extraction.

You are welcome to check it out and contribute there.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants