Skip to content

rdf to-jelly: preserve blank node IDs#156

Merged
Ostrzyciel merged 1 commit intomainfrom
piotr/to-jelly-preserve-bnodes
Jul 12, 2025
Merged

rdf to-jelly: preserve blank node IDs#156
Ostrzyciel merged 1 commit intomainfrom
piotr/to-jelly-preserve-bnodes

Conversation

@Ostrzyciel
Copy link
Copy Markdown
Member

Previously blank node IDs were reassigned to new ones, drawing from some hashed in-memory pool. This unnecessarily wasted resources (we can keep the original IDs, they are fine for our use case), made a mess in the output, and could potentially lead to OOMs for very large files.

This somewhat speeds up the conversion for files that contain blank nodes. For OSM data, I saw ~10% better throughput in converting Turtle to Jelly.

It's still not amazingly fast, mostly due to the Turtle parser, but I'm hesitant to mess with it further – we may break something important.

Previously blank node IDs were reassigned to new ones, drawing from some hashed in-memory pool. This unnecessarily wasted resources (we can keep the original IDs, they are fine for our use case), made a mess in the output, and could potentially lead to OOMs for very large files.

This somewhat speeds up the conversion for files that contain blank nodes. For OSM data, I saw ~10% better throughput in converting Turtle to Jelly.

It's still not amazingly fast, mostly due to the Turtle parser, but I'm hesitant to mess with it further – we may break something important.
@Ostrzyciel Ostrzyciel merged commit 3d2228c into main Jul 12, 2025
7 checks passed
@Ostrzyciel Ostrzyciel deleted the piotr/to-jelly-preserve-bnodes branch July 12, 2025 16:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants