-
-
Notifications
You must be signed in to change notification settings - Fork 2.8k
Improve BibTeX-from-PDF import #11999
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
I want to take this issue. |
/assign @leaf-soba |
I want to check the next step is #12139, or get a correct author/abstract? |
Hey I am working as a group of four Software Engineering Students at the University of Victoria. We are taking a course where we have to make an open source contribution, and we are interested on completing this as our first open source contribution. |
/assign-me |
👋 Hey @liamsebestyen, thank you for your interest in this issue! 🎉 We're excited to have you on board. Start by exploring our Contributing guidelines, and don't forget to check out our workspace setup guidelines to get started smoothly. In case you encounter failing tests during development, please check our developer FAQs! Having any questions or issues? Feel free to ask here on GitHub. Need help setting up your local workspace? Join the conversation on JabRef's Gitter chat. And don't hesitate to open a (draft) pull request early on to show the direction it is heading towards. This way, you will receive valuable feedback. Happy coding! 🚀 ⏳ Please note, you will be automatically unassigned if the issue isn't closed within 45 days (by 20 April 2025). A maintainer can also add the "📌 Pinned"" label to prevent automatic unassignment. |
📋 Assignment UpdateHi @liamsebestyen, due to inactivity, you have been unassigned from this issue. Next stepsIf you still want to work on this:
|
We are starting to work on this today. |
/assign-me |
👋 Hey @liamsebestyen, thank you for your interest in this issue! 🎉 We're excited to have you on board. Start by exploring our Contributing guidelines, and don't forget to check out our workspace setup guidelines to get started smoothly. In case you encounter failing tests during development, please check our developer FAQs! Having any questions or issues? Feel free to ask here on GitHub. Need help setting up your local workspace? Join the conversation on JabRef's Gitter chat. And don't hesitate to open a (draft) pull request early on to show the direction it is heading towards. This way, you will receive valuable feedback. Happy coding! 🚀 ⏳ Please note, you will be automatically unassigned if there is not a (draft) pull request within 14 days (by 04 April 2025). |
@WillMohr858 Thank you for working on this; Please comment on the PR - and not on the issue. Because your reply is related to your proposed solution to the issue; not some clarification of the issue. |
!! This is more an issue to experiment with heuristics. How can a machine with "traditional" (non-AI) code create useful information. !!
When importing the PDF se2paper.pdf
one gets following BibTeX entry
However, the title should be better:
The properties of the file show
Tasks:
org.jabref.logic.importer.fileformat.PdfMergeMetadataImporter#importDatabase(java.nio.file.Path)
)stripper.setSortByPosition(true);
needs to be removed fromorg.jabref.logic.importer.fileformat.PdfContentImporter#getFirstPageContents
. Maybe, two methods need to be done to be able to parse the title (depending on positon) and parsing the abstract (more on content)Hint:
org.jabref.logic.importer.fileformat.PdfContentImporterTest
and ´org.jabref.logic.importer.fileformat.PdfMergeMetadataImporterTest`The text was updated successfully, but these errors were encountered: