Improve BibTeX-from-PDF import #11999

koppor · 2024-10-16T20:14:10Z

!! This is more an issue to experiment with heuristics. How can a machine with "traditional" (non-AI) code create useful information. !!

When importing the PDF se2paper.pdf

one gets following BibTeX entry

@InProceedings{How,
  author   = {On How and We Can and Teach and Exploring New and Ways in and Professional Software and Development for Students},
  title    = {Microsoft Word - ieee_on_how_we_teach_jul_01.docx},
  abstract = {— Requirements and approaches for introductory
courses in software development at universities differ II. SETTING THE STAGE: SOFTWARE DEVELOPMENT AT 
considerably. There seems to be little consensus on which HDM 
languages are a good fit, which methodologies lead to the best 
...
  file     = {:C\:/Users/koppor/Downloads/se2paper-1.pdf:PDF},
}

However, the title should be better:

The properties of the file show

Tasks:

If title is "better" from the text importer, the title from the properties should not be used (class org.jabref.logic.importer.fileformat.PdfMergeMetadataImporter#importDatabase(java.nio.file.Path))
Improve abstract parsing. Maybe stripper.setSortByPosition(true); needs to be removed from org.jabref.logic.importer.fileformat.PdfContentImporter#getFirstPageContents. Maybe, two methods need to be done to be able to parse the title (depending on positon) and parsing the abstract (more on content)

Hint:

Rely and add test cases to org.jabref.logic.importer.fileformat.PdfContentImporterTest and ´org.jabref.logic.importer.fileformat.PdfMergeMetadataImporterTest`

The text was updated successfully, but these errors were encountered:

leaf-soba · 2024-10-18T02:43:12Z

I want to take this issue.

koppor · 2024-10-18T10:39:41Z

/assign @leaf-soba

leaf-soba · 2024-11-05T01:29:08Z

I want to check the next step is #12139, or get a correct author/abstract?

liamsebestyen · 2025-03-06T23:49:43Z

Hey I am working as a group of four Software Engineering Students at the University of Victoria. We are taking a course where we have to make an open source contribution, and we are interested on completing this as our first open source contribution.
Thank you!

liamsebestyen · 2025-03-06T23:49:47Z

/assign-me

github-actions · 2025-03-06T23:49:59Z

👋 Hey @liamsebestyen, thank you for your interest in this issue! 🎉

We're excited to have you on board. Start by exploring our Contributing guidelines, and don't forget to check out our workspace setup guidelines to get started smoothly.

In case you encounter failing tests during development, please check our developer FAQs!

Having any questions or issues? Feel free to ask here on GitHub. Need help setting up your local workspace? Join the conversation on JabRef's Gitter chat. And don't hesitate to open a (draft) pull request early on to show the direction it is heading towards. This way, you will receive valuable feedback.

Happy coding! 🚀

⏳ Please note, you will be automatically unassigned if the issue isn't closed within 45 days (by 20 April 2025). A maintainer can also add the "📌 Pinned"" label to prevent automatic unassignment.

github-actions · 2025-03-21T12:12:57Z

📋 Assignment Update

Hi @liamsebestyen, due to inactivity, you have been unassigned from this issue.

Next steps

If you still want to work on this:

Ask a maintainer to assign you again
If you're making progress, a maintainer can add the pin label to prevent future automatic unassignment

liamsebestyen · 2025-03-21T16:02:45Z

We are starting to work on this today.

liamsebestyen · 2025-03-21T16:02:54Z

/assign-me

github-actions · 2025-03-21T16:03:08Z

👋 Hey @liamsebestyen, thank you for your interest in this issue! 🎉

We're excited to have you on board. Start by exploring our Contributing guidelines, and don't forget to check out our workspace setup guidelines to get started smoothly.

In case you encounter failing tests during development, please check our developer FAQs!

Having any questions or issues? Feel free to ask here on GitHub. Need help setting up your local workspace? Join the conversation on JabRef's Gitter chat. And don't hesitate to open a (draft) pull request early on to show the direction it is heading towards. This way, you will receive valuable feedback.

Happy coding! 🚀

⏳ Please note, you will be automatically unassigned if there is not a (draft) pull request within 14 days (by 04 April 2025).

koppor · 2025-04-01T06:31:44Z

@WillMohr858 Thank you for working on this; Please comment on the PR - and not on the issue. Because your reply is related to your proposed solution to the issue; not some clarification of the issue.

koppor added the good first issue An issue intended for project-newcomers. Varies in difficulty. label Oct 16, 2024

koppor added this to Good First Issues Oct 16, 2024

github-project-automation bot moved this to Free to take in Good First Issues Oct 16, 2024

leaf-soba mentioned this issue Oct 18, 2024

Add a title guess method to get "better" title #12018

Merged

7 tasks

github-actions bot assigned leaf-soba Oct 18, 2024

github-actions bot added the 📍 Assigned Assigned by assign-issue-action (or manually assigned) label Oct 18, 2024

koppor mentioned this issue Oct 18, 2024

Add SKIP_IF_NOT_IN_PROJECT flag m7kvqbe1/github-action-move-issues#28

Merged

github-actions bot unassigned leaf-soba Feb 24, 2025

github-actions bot removed the 📍 Assigned Assigned by assign-issue-action (or manually assigned) label Feb 24, 2025

koppor moved this from Assigned to Free to take in Good First Issues Feb 25, 2025

JabRef deleted a comment from github-actions bot Feb 25, 2025

koppor added good second issue Issues that involve a tour of two or three interweaved components in JabRef and removed good first issue An issue intended for project-newcomers. Varies in difficulty. labels Feb 25, 2025

github-actions bot assigned liamsebestyen Mar 6, 2025

github-actions bot added the 📍 Assigned Assigned by assign-issue-action (or manually assigned) label Mar 6, 2025

subhramit moved this from Free to take to Assigned in Good First Issues Mar 11, 2025

github-actions bot unassigned liamsebestyen Mar 21, 2025

github-actions bot removed the 📍 Assigned Assigned by assign-issue-action (or manually assigned) label Mar 21, 2025

koppor moved this from Assigned to Free to take in Good First Issues Mar 21, 2025

github-actions bot assigned liamsebestyen Mar 21, 2025

github-actions bot added the 📍 Assigned Assigned by assign-issue-action (or manually assigned) label Mar 21, 2025

koppor moved this from Free to take to Assigned in Good First Issues Mar 21, 2025

liamsebestyen mentioned this issue Apr 1, 2025

Add best-title picking logic in mergeCandidates method #12872

Closed

3 tasks

koppor moved this from Assigned to In Progress in Good First Issues Apr 1, 2025

github-actions bot added the 📌 Pinned label Apr 1, 2025

koppor added the component: import-load label Apr 1, 2025

koppor removed this from Good First Issues Apr 1, 2025

InAnYan mentioned this issue Apr 1, 2025

Extend PdfContentImporter to extract information from bibliographical pages in books #12874

Open

github-actions bot unassigned liamsebestyen May 12, 2025

github-actions bot removed 📍 Assigned Assigned by assign-issue-action (or manually assigned) 📌 Pinned labels May 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Improve BibTeX-from-PDF import #11999

Improve BibTeX-from-PDF import #11999

koppor commented Oct 16, 2024

leaf-soba commented Oct 18, 2024

Uh oh!

koppor commented Oct 18, 2024

Uh oh!

leaf-soba commented Nov 5, 2024

Uh oh!

liamsebestyen commented Mar 6, 2025

Uh oh!

liamsebestyen commented Mar 6, 2025

Uh oh!

github-actions bot commented Mar 6, 2025

Uh oh!

github-actions bot commented Mar 21, 2025

Uh oh!

liamsebestyen commented Mar 21, 2025

Uh oh!

liamsebestyen commented Mar 21, 2025

Uh oh!

github-actions bot commented Mar 21, 2025

Uh oh!

koppor commented Apr 1, 2025

Uh oh!

Uh oh!

Improve BibTeX-from-PDF import #11999

Improve BibTeX-from-PDF import #11999

Comments

koppor commented Oct 16, 2024

leaf-soba commented Oct 18, 2024

Uh oh!

koppor commented Oct 18, 2024

Uh oh!

leaf-soba commented Nov 5, 2024

Uh oh!

liamsebestyen commented Mar 6, 2025

Uh oh!

liamsebestyen commented Mar 6, 2025

Uh oh!

github-actions bot commented Mar 6, 2025

Uh oh!

github-actions bot commented Mar 21, 2025

📋 Assignment Update

Uh oh!

liamsebestyen commented Mar 21, 2025

Uh oh!

liamsebestyen commented Mar 21, 2025

Uh oh!

github-actions bot commented Mar 21, 2025

Uh oh!

koppor commented Apr 1, 2025

Uh oh!