-
Notifications
You must be signed in to change notification settings - Fork 367
Fix effects of reordering bug introduced in bulk metadata corrections from 2025-11-14 #6672
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Fix effects of reordering bug introduced in bulk metadata corrections from 2025-11-14 #6672
Conversation
Introduced by metadata corrections 2025-11-14 Note that the affiliations in XML don't always match with what is found on the PDF.
Introduced by metadata corrections 2025-11-14
Introduced by metadata corrections 2025-11-14 Pretty meaningless affiliation 'Institute'', but nonetheless. Noticed last author name inconsistent between PDF and metadata: fixed too
Introduced by metadata corrections 2025-11-14 Affiliation and ORCID put back to correct author
|
Build successful. Some useful links:
This preview will be removed when the branch is merged. |
Introduced by metadata corrections 2025-11-14 Affiliations put back to correct author
Introduced by metadata corrections 2025-11-14 Affiliations and orcid put back to correct author
| <author><first>Salsabil Maulana</first><last>Akbar</last><affiliation>Universitas Telkom</affiliation></author> | ||
| <author><first>Nuur</first><last>Shadieq</last><affiliation>Universitas Telkom</affiliation></author> | ||
| <author><first>Wawan</first><last>Cenggoro</last><affiliation>Binus University</affiliation></author> | ||
| <author><first>Salsabil Maulana</first><last>Akbar</last><affiliation>Institut Teknologi Bandung</affiliation></author> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Issue of metadata correction: #6328
2024.acl-long.796
Paper page: https://aclanthology.org/2024.acl-long.796.pdf
Commit showing the bug effect: f88a83e
just affiliations needed to be changed
note: affiliations don't always match with what is shown on PDF
| <author><first>Norah</first><last>Alshahrani</last><affiliation>University of Bisha</affiliation></author> | ||
| <author><first>Saied</first><last>Alshahrani</last><affiliation>ASAS AI</affiliation></author> | ||
| <author><first>Norah</first><last>Alshahrani</last><affiliation>ASAS AI</affiliation></author> | ||
| <author><first>Saied</first><last>Alshahrani</last><affiliation>University of Bisha</affiliation></author> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Issue of metadata correction: #6401
2025.arabicnlp-main.26
Paper page: https://aclanthology.org/2025.arabicnlp-main.26/
Commit showing the bug effect: db094a2
just affiliations needed to be changed
saied and norah (same last name) swapped
| <author><first>Mohamed</first><last>Samy</last><affiliation>Institute</affiliation></author> | ||
| <author><first>Mayar</first><last>Boghdady</last></author> | ||
| <author><first>Mohamed</first><last>Samy</last></author> | ||
| <author><first>Mayar</first><last>Boghdady</last><affiliation>Institute</affiliation></author> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Issue of metadata correction: #6335
2025.arabicnlp-sharedtasks.133
Paper page: https://aclanthology.org/2025.arabicnlp-sharedtasks.133/
Commit showing the bug effect: c8dfe7c
just affiliation (meaningless "NA")
| <author><first>Marwan</first><last>El Adawi</last></author> | ||
| <author><first>Mohamed</first><last>Nassar</last></author> | ||
| <author><first>Ensaf Hussein</first><last>Mohamed</last></author> | ||
| <author><first>Ensaf</first><last>Hussein</last></author> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Noted another name inconsistency: last author has name Ensaf Hussein 8according to PDF](https://aclanthology.org/2025.arabicnlp-sharedtasks.133.pdf), but metadata says last name Mohamed. In metadata correction issue only updated authors_new but not authors list itself. This author should probably also have name variants recorded, currently 3 author pages (Ensaf Hussein Mohamed, Ensaf Mohamed, Ensaf H. Mohamed) that could probably merge and several metadata PDF inconsistencies. I haven't seen an author page request for this author yet.
| <author orcid="0000-0003-0701-0204"><first>Charitha</first><last>Rathnayake</last><affiliation>Massey University</affiliation></author> | ||
| <author><first>Surangika</first><last>Ranathunga</last><affiliation>University of Moratuwa</affiliation></author> | ||
| <author><first>Charitha</first><last>Rathnayake</last><affiliation>University of Moratuwa</affiliation></author> | ||
| <author orcid="0000-0003-0701-0204"><first>Surangika</first><last>Ranathunga</last><affiliation>Massey University</affiliation></author> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Issue of metadata correction: #6422
2025.emnlp-main.1435
Paper page: https://aclanthology.org/2025.emnlp-main.1435/
Commit showing the bug effect: 7ba600b
Bug affected affiliation and orcid
check orcid belongs to correct person: https://orcid.org/0000-0003-0701-0204 Ranathunga
| <author><first>Chaitali</first><last>Agarwal</last></author> | ||
| <author><first>Sudharshan</first><last>Govindan</last></author> | ||
| <author><first>Haw-Shiuan</first><last>Chang</last></author> | ||
| <author orcid="0000-0003-4607-936X"><first>Haw-Shiuan</first><last>Chang</last><affiliation>Department of Computer Science, University of Massachusetts at Amherst</affiliation></author> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Issue of metadata correction: #6394
2025.starsem-1.18
Paper page: https://aclanthology.org/2025.starsem-1.18/
Commit showing the bug effect: ec5e183
Bug affected orcid and affiliation
confirmed ORCID now correct Haw-Shiuan Chang : https://orcid.org/0000-0003-4607-936X
| <author><first>Sina</first><last>Ahmadi</last><affiliation>University of Zurich</affiliation></author> | ||
| <author><first>Anthony</first><last>Munthali</last></author> | ||
| <author><first>Jonathan Mingfei</first><last>Liu</last><affiliation>Google</affiliation></author> | ||
| <author><first>Jonathan</first><last>Eng</last><affiliation>Google</affiliation></author> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Issue of metadata correction: #6448
2025.wmt-1.85
Paper page: https://aclanthology.org/2025.wmt-1.85/
Commit showing the bug effect: 53420bc
Bug only affected affiliation
- affiliations of two consecutive authors were swapped
- affiliations were off by one
- last two authors need to get back their affiliations from newly introduced authors before them
Step towards #6589 : fix effect of reordering bug in bulk metadata corrections from 2025-11-14
Each change annotated with comment referencing error-introducing commit to compare, paper page, initial metadata correction issue etc.
Process
Went through the "Files changed" tab https://github.com/acl-org/acl-anthology/pull/6469/files
and noted all instances, where a reordering affected associations of affiliation and orcid to authors.
Changes:
2024.acl-long.796
in
data/xml/2024.acl.xmlIssue of metadata correction: #6328
2024.acl-long.796Paper page: https://aclanthology.org/2024.acl-long.796.pdf
Commit showing the bug effect: f88a83e
just affiliations needed to be changed
note: affiliations don't always match with what is shown on PDF
2025.arabicnlp-main.26
in
data/xml/2025.arabicnlp.xmlIssue of metadata correction: #6401
2025.arabicnlp-main.26Paper page: https://aclanthology.org/2025.arabicnlp-main.26/
Commit showing the bug effect: db094a2
just affiliations needed to be changed
saied and norah (same last name) swapped
2025.arabicnlp-sharedtasks.133
in
data/xml/2025.arabicnlp.xmlIssue of metadata correction: #6335
2025.arabicnlp-sharedtasks.133Paper page: https://aclanthology.org/2025.arabicnlp-sharedtasks.133/
Commit showing the bug effect: c8dfe7c
just affiliation (meaningless "NA")
noted another name inconsistency: last author has name Ensaf Hussein, but metadata says Lastname Mohamed. In issue only updated
authors_newbut notauthorslist itself. This author should probably also have name variants recorded, currently 3 author pages (Ensaf Hussein Mohamed, Ensaf Mohamed, Ensaf H. Mohamed) that could probably merge and several metadata PDF inconsistencies.2025.starsem-1.18
in
data/xml/2025.starsem.xmlIssue of metadata correction: #6394
2025.starsem-1.18Paper page: https://aclanthology.org/2025.starsem-1.18/
Commit showing the bug effect: ec5e183
Bug affected orcid and affiliation
confirmed ORCID now correct Haw-Shiuan Chang : https://orcid.org/0000-0003-4607-936X
2025.wmt-1.85
in
data/xml/2025.wmt.xmlIssue of metadata correction: #6448
2025.wmt-1.85Paper page: https://aclanthology.org/2025.wmt-1.85/
Commit showing the bug effect: 53420bc
Bug only affected affiliation
2025.emnlp-main.1435
in
data/xml/2025.emnlp.xmlIssue of metadata correction: #6422
2025.emnlp-main.1435Paper page: https://aclanthology.org/2025.emnlp-main.1435/
Commit showing the bug effect: 7ba600b
Bug affected affiliation and orcid
check orcid belongs to correct person: https://orcid.org/0000-0003-0701-0204 Ranathunga