CPS-0024? | Canonical CBOR Serialization #1109

HinsonSIDAN · 2025-10-29T15:44:54Z

This CPS emerged from community feedback at events like Cardano Builder Fest 2025, where developers identified CBOR serialization fragmentation as a critical pain point hindering ecosystem maturity.

This CPS addresses the growing interoperability challenges caused by non-deterministic CBOR serialization across Cardano's tooling ecosystem. The same logical transaction or script can be encoded in multiple valid ways, leading to different hashes and breaking multi-signature workflows, cross-tool transaction building, and script reference consistency.

Ideally we can establish a canonical CBOR serialization standard that would be adopted across major libraries and wallets, ensuring predictable behavior and reducing the development friction that currently exists when working across different tools.

(rendered latest document)

fallen-icarus · 2025-10-29T18:20:03Z

Thanks for opening this discussion! This topic is outside my area, but I came across this comment (and this one) by @dcoutts a while ago which AFAIU explains the reasoning at the time. The entire Issue thread might be worth a read.

GeorgeFlerovsky · 2025-10-29T18:36:04Z

FYI @nau @fernweh0

yHSJ · 2025-10-29T19:19:53Z

Thank you for drafting a CPS about this. You're right, a lack of canonical encoding has definitely been frustrating for many of us Cardano developers, especially those of us working on alternative node implementations (I can speak to my experience working on Amaru primarily). I have only done a quick skim of the content so I won't comment on the particulars in the CPS here right now, but hopefully I will have more time later to sit down and reread it carefully.

But, to give some context:
At the first node diversity workshop, canonical representation of ledger state was heavily discussed since it's needed (or at least very helpful) for many things, including testing and bootstrapping.

Sundae Labs has started work on a conformance testing suite which currently uses the NewEpochState, encoded as CBOR, from the Haskell node but we're eager to conform with a reasonable and appropriate standard there. They currently live in the Cardano Blueprints project, where I assume a canonical ledger state specification would eventually reside, along with a CIP: https://github.com/cardano-scaling/cardano-blueprint/tree/main/src/ledger/conformance-test-vectors (cc: @Quantumplation)

Tweag has received treasury funding to work on exactly that actually, and they have a draft PR open already: #1083 (cc: @nc6 and @qnikst)

CPS-????/README.md

rphair

@HinsonSIDAN now that this is marked "Ready for Review" we'll add it as Triage which puts it on the agenda for our next CIP meeting (https://hackmd.io/@cip-editors/123).

In the meantime I think it'll be useful to get some co-review with this current CIP candidate (cc @qnikst @lehins @nc6 @Ryun1 @Crypto2099):

#1083

CPS-????/README.md

Ryun1 · 2025-11-25T15:07:15Z

CPS-????/README.md

+
+- Multi-signature workflows where each signer's wallet may re-serialize the transaction
+- Cross-tool transaction building where fee calculations depend on exact byte size
+


Suggested change

- Hardware wallets, which require the keys in every map to be sorted from lowest value to highest.

this is one Ive encountered, which was frustrating to figure out

@Ryun1 I did not put hardware wallets in since there is an active standard on it - https://cips.cardano.org/cip/CIP-0021. Perhaps you are referring to this one

The biggest issue with ordering is that Haskell (or any other programming language) ordering is not guaranteed to always match the CBOR ordering.

In other words, I suspect enforcing CBOR ordering would make Ledger implementation a bit more complicated.

For example What is the ordering for a key that is a tuple?
What is bigger -1 or 10?
According to CBOR ordering that would be -1, while in Haskell it would be 10.

The biggest issue with ordering is that Haskell (or any other programming language) ordering is not guaranteed to always match the CBOR ordering.

In other words, I suspect enforcing CBOR ordering would make Ledger implementation a more complicated and error prone.

For example What is the ordering for a key that is a tuple?
What is bigger -1 or 10?
According to CBOR ordering that would be -1, while in Haskell it would be 10.

Not saying it is impossible, but something that needs to be taken under. consideration

rphair

@HinsonSIDAN we're leaving this Unconfirmed for now due to lack of current agreement & statements about why such serialisation would be considered "dangerous" (as per #1109 (comment)) even though the node itself is doing this all the time (cc @fallen-icarus @Crypto2099).

@colll78 we collectively recalled that you've posted some things about this issue on X... can you give us the gist of how you stand on the issue & any other points that the community has made about it?

HinsonSIDAN · 2025-11-28T11:47:57Z

@rphair I am drafting this CPS mostly due to the common perspective I gathered in 2025 Vietnam Builderfest and 2025 UPLC conference in edinburge. I believe it is a pain to vast majority of builders.

I am happy to keep this CPS unconfirmed indeed if that's not a too pain of an issue that people interested to put more comments in.

Tho one more comment / observation. It seems like this issue has bugged so many Cardano developers from junior to middle level, that scares away many potential talents for our community. However, when we get to more senior in Cardano dev, usually we are experience enough to figure out workaround on issues caused by this problem. Like we in SIDAN Lab & DeltaDeFi built some stupidly complicated dev tools for it, where we shared in 2025 Edinburgh UPLC conference

Eventually, no one got interested enough to solve it for others.

Sorry for this week CIP meeting I cannot squeeze time to attend, I would try to share more in next editor meeting. Appreciate the moderation!

rphair · 2025-12-06T05:09:40Z

@HinsonSIDAN since you've already volunteered to attend the next meeting (https://hackmd.io/@cip-editors/124) as per #1103 (comment), I've also added this one to the agenda: keep in mind that this agenda will be packed due to requests to recap the Leios status, but at least we should have time for you to review the relationships above & reassure the technically oriented editors & observers about the "safety" of CBOR conversion.

qnikst · 2025-12-08T14:50:10Z

Hello, sorry for jumping late on the party. From the perspective of the team who works on canonical ledger state serialisation, supporting serialised CBOR standard is a very good initiative, that we will strongly support and ready to participate in.

This CPS and CIP-0165 do overlap, or no be more concrete CIP-0165 can use the results of this intiative what ever they are. In CIP-0165 we want to use canonical representation, however as CIP-0165 does not put any restrictions on the ledger and on-wire data we just used what current CBOR RFC defines with small restrictions.

In case if community align on the concrete implementation of the CBOR serialisation we will join the efforts and going to use that in the reference implementation and fix the rules in CIP. As for the current state we relied on the
RFC7049:3.9 and RFC8949:4.2. With additional restrictions such as we forbid indefinite length structures, and this restriction works in the scenarios for the canonical state, I'm not sure if this restriction can be accepted in general, as it may harm streaming properties of the protocols. Anyway that solution was not fixed in either CIP or merged implementation, so can discussed and changed.

From my personal perspective in addition to defining canonical and deterministic serialisation rules it would be nice to agree in the document on using patterns and tag, like #258 for set. This way it would be possible to rely more on CDDL files and validation tools, that at the moment can't enforce properties like "values in the set should be ordered".

The discussion about the safety and concrete examples should be held, especially because there is state of the art in CBOR RFCs about how to define deterministic and canonical CBOR.

colll78 · 2025-12-08T20:09:55Z

@HinsonSIDAN we're leaving this Unconfirmed for now due to lack of current agreement & statements about why such serialisation would be considered "dangerous" (as per #1109 (comment)) even though the node itself is doing this all the time (cc @fallen-icarus @Crypto2099).

@colll78 we collectively recalled that you've posted some things about this issue on X... can you give us the gist of how you stand on the issue & any other points that the community has made about it?

My opinion is:

Best if we could do everything from scratch: Use protobuf or another efficiency optimized serialization format, or a completely ground up custom serialization format that we consider canonical and enforce that all information must be in this standard. Unfortunately, cbor is so engrained in the ecosystem and tooling that it would probably be impossible to get away from it now.
Best given current circumstance: We should create a custom version of CBOR that we specialize for Cardano, standardize it and consider it canonical and enforce that everything must be encoded in this format.

Any argument that enforcing canonical cbor is insecure is mute because every dapp & wallet and application on Cardano must enforce it anyway for hardware wallet compatibility, and simply deciding to ignore hardware wallet compatibility is not an option as a product.

As for why CBOR is bad, just look at:

CBOR | 10 ms | 6.7 ms | 1691 KB
Protobuf | 10 ms | 7.7 ms | 890 KB

The size of data serialized with cbor is double the size of data serialized with protobuf, and the deserialization time differs by only 1 ms. That is not even close to a reasonable tradeoff. The size of serialized data is perhaps the single largest factor in terms of blockchain scalability. This data means that if everything was encoded with protobuf, the maximum throughput would be 2x whatever it is now. Even in a future when Leios exists this doesn't cease to matter, in-fact as throughput goes up this matters even more, because higher throughput means waste per message becomes much more important (because we have more messages).

Quantumplation · 2025-12-08T20:57:35Z

Please don't make it custom; make it the existing canonical cbor standard that we've all been forced to use already 😅

rphair · 2025-12-08T21:49:12Z

@HinsonSIDAN @fallen-icarus @colll78 @qnikst @Quantumplation editors would be ready to consider the candidacy of this CIP again, especially according the lesser-of-two-evils explanation in #1109 (comment). If there's anything else we should be considering, please come to the CIP meeting tomorrow, if possible, and we'll get everything on record in any case... and @colll78 if there's any other suggestion of protobuf around CIPs going forward I'll connect it back here.

qnikst · 2025-12-09T10:28:41Z

I'm not sure that two evils are as simple as stated.

Yes, CBOR is not the most efficient in the size and efficiency of encoding/decoding protocol, as least because it's schema-less and has to keep type tags. On the other hand it does allow interspectability and allows to define global types and rules for them (read tags).

However just schema is not enough to bring canonicity with either CBOR and protobuffers, as we should be careful enough how we store the data inside the fields and if it's canonical. For example reading the documentation of the protobuf I find the following property for the map type: Wire format ordering and map iteration ordering of map values is undefined, so you cannot rely on your map items being in a particular order.
And this is a killer for the canonicity, and what we are trying to solve in this CPS! So if we chose protobuf as a basis we would have to define canonical representation anyway, and it may be a way harder comparing to CBOR, as as far as I know there are no canonical protobuf efforts (on the contrary to CBOR).

In addition there are other sources for the protobuf criticism (e.g. https://reasonablypolymorphic.com/blog/protos-are-wrong/) whether to agree with all the points or not, choosing a protobuf as an option one may not be an easy choice that just works (tm) in our scenarios. As here we need a serialisation protocol that is consistent, canonical, allows to express required data structures and efficient enough. And CBOR is not a bad choice in this setting :)

As for defining our own CBOR, not sure about the phrasing here, as canonical/deterministic CBOR definition exists in its RFCs. So likely that we should carefully check if it meets our requirements, agree on contradicting points (e.g. one RFC proposes length-ordered keys order in maps and another value based), possibly set an additional restrictions e.g. if we allow indefinite structures or not and verify that are all our choices are safe. As a result it may happen that the existing resulting rules can be fully based on existing RFC (so it's not a new CBOR). In addition we can define some additional rules like, always use sets tag for sets in does not change encoding and libraries compatible with existing RFCs.

Ryun1 · 2025-12-09T16:45:54Z

@qnikst

From my personal perspective in addition to defining canonical and deterministic serialisation rules it would be nice to agree in the document on using patterns and tag, like #258 for set. This way it would be possible to rely more on CDDL files and validation tools, that at the moment can't enforce properties like "values in the set should be ordered".

an effort for this has been made via CIPs towards this CIP-114 | CBOR Tags Registry

colll78 · 2025-12-09T22:36:45Z

I'm not sure that two evils are as simple as stated.

Yes, CBOR is not the most efficient in the size and efficiency of encoding/decoding protocol, as least because it's schema-less and has to keep type tags. On the other hand it does allow interspectability and allows to define global types and rules for them (read tags).

However just schema is not enough to bring canonicity with either CBOR and protobuffers, as we should be careful enough how we store the data inside the fields and if it's canonical. For example reading the documentation of the protobuf I find the following property for the map type: Wire format ordering and map iteration ordering of map values is undefined, so you cannot rely on your map items being in a particular order. And this is a killer for the canonicity, and what we are trying to solve in this CPS! So if we chose protobuf as a basis we would have to define canonical representation anyway, and it may be a way harder comparing to CBOR, as as far as I know there are no canonical protobuf efforts (on the contrary to CBOR).

It will be harder for sure; my point wasn't that this is a good option now, but that it's something we should have done from the start (in which case we would already have made progress in creating a canonical representation by now). As I said, at this point cbor is far too engrained in our ecosystem and all downstream tooling for us to even really consider using this approach.

Why is this approach the best option if we could start from scratch? The proof for this can be seen from observing other high performance blockchains. Solana uses a custom canonical binary wire format built from its Rust structs + shortvec varints; Aptos uses Binary Canonical Serialization, which as implied by its name is a custom canonical binary serialization format. Sui also uses BCS. Near uses Binary Object Representation Serializer for Hashing, Borsh, a canonical, non-self-describing binary format. Cosmos and all Cosmos SDK chains use protobuf. Algorand uses Canonical MsgPack.

There is a consistent trend amongst all of these high performance blockchains:

Binary encoding - They avoid JSON/CBOR-with-tags style self-description for the performance-critical path.
Canonical – Deterministic encodings (sorted map keys, minimal integer sizes, fixed field order) to prevent tx malleability.
Non-self-describing – The node/wallet must already know the schema/struct definitions in order to decode.

The benefit of having the serialization format be self-describing is relatively useless given the abundant of modern techniques for circumventing the need for interspecability entirely (these techniques were mostly pioneered by other blockchains explicitly to solve this problem). For example, even though Aptos' BCS requires the data type of a serialized value to the enforced by the application, this requirement is easily fulfilled by using unique hash seeds for each data type (a technique that they inherited from the Diem blockchain). Each of the high performance chains listed above employ their own techniques to circumvent any need for self-describing properties in their serialization formats.

So my conclusion is that either:

We picked the optimal serialization format back in 2017 and every other smart contract platform blockchain released since has been wrong.
We did not pick the most optimal serialization format back in 2017, and since then blockchain research has advanced significantly, and techniques have become available that make some of the reasons we didn't choose non-self-descriptive serialization formats, obsolete.

Personally, I believe that when every other blockchain on the market since 2017 is using non-self-descriptive serialization formats, and we are the only ones using a self-descriptive format, then it's more likely than not that we are the ones who made the non-optimal choice.

rphair

Thanks @colll78 @qnikst for helping us understand the CBOR debate, even if there are some unhappy consequences today that we can't do anything about. My understanding & the general impression from yesterday's CIP meeting is that this statement of what to do about contemporary treatment of CBOR is still well-presented and usable in this CPS draft itself.

After further review at the meeting we've decided to progress this as a CPS candidate — @HinsonSIDAN please rename this directory to CPS-0024. 🎉

Some requests for expert review were emphasised at the meeting to try to keep this proposal concrete and usable (i.e. something that we can feel good about merging, not just a "wish list"):

@qnikst sounded happy to review this as a complete document & the editors would look forward to ensuring all statements & recommendations are in order... and especially that the CPS is written so it can lead to CIPs that satisfy it. @colll78 whatever you can provide as review would be equally appreciated for the same reasons.
@lehins though this is not about any structure in the Ledger, we did think it would be nice to have your opinion about anything the CPS should contain that would make resulting CIPs more interoperable with the Ledger, and with each other, where CBOR is concerned... if there's someone else you'd prefer we tag about this particular issue, please feel free to pass it on.

CPS-????/README.md

lehins

I am very much against canonical CBOR to be enforced at the Ledger level and in general. Here is my reasoning:

Drastically increases complexity and opens up possibilities for new bugs to be added due to such transition
Non-canonical CBOR deserialization that we have today will have to be supported indefinitely anyways due to the on chain history.
It is impossible to fully switch to canonical CBOR for on chain data because of plutus data. There are likely transaction outputs that are locked on chain today requiring data that is expected to have a non-canonical form. I.e. switching to canonical CBOR would lock those funds forever.
It is not solving any issues, since proper solution should be correct support of tooling of an existing standard.
Promotes reserialization of transactions that ought to be immutable for the purpose of signing. Re-serialization for the purpose of signing is just a bad idea in general.

This is my professional opinion, which means if Cardano community as a whole is willing to sacrifice safety to make Cardano more user friendly, then Ledger team will have no chance but implement it regardless of my opinion.

Disclaimer: I am not a big fan of CBOR standard in general and if this discussion had happened at the design stages of initial version of Cardano my opinion maybe would have been different or at least I could have been persuaded to change my opinion much easier. Today, however, I very much believe this ship has sailed and it would be much safer to support full CBOR instead of enforcing a drastic change like this.

lehins · 2025-12-10T20:29:04Z

CPS-????/README.md

+
+**Transaction Hash Instability**: When a transaction is passed between tools or wallets for signing, each may re-serialize it differently. Since transaction hashes are computed over CBOR bytes, logically identical transactions produce different hashes. This breaks:
+
+- Multi-signature workflows where each signer's wallet may re-serialize the transaction


I see this as problem in this CPS, since reserialzaition should never happen for the purpose of signing and if it is then it is because tooling is doing incorrectly. I would concider it a bug in the software regardless if canonical vs non-canonical serialization is used.
Whenever a payload is given to a program for the purpose of signing, that payload should not be mocked with. Same applies to transactions, if all you need is to sign the transaction either for the purpose of multi-sig or any other, transaction hash and serializaiton should not be recomputed and the original bytes must be retained and that is what needs to be signed.

lehins · 2025-12-10T20:35:52Z

CPS-????/README.md

+**Transaction Hash Instability**: When a transaction is passed between tools or wallets for signing, each may re-serialize it differently. Since transaction hashes are computed over CBOR bytes, logically identical transactions produce different hashes. This breaks:
+
+- Multi-signature workflows where each signer's wallet may re-serialize the transaction
+- Cross-tool transaction building where fee calculations depend on exact byte size


I don't quite follow this one. If there is any change to a transaction body then the bytes might change which could affect the fee. That is quite normal and expected.

If you mean that some tooling might want to change some part of the transaction body that it itself should not change the size if canonical CBOR was used (eg. change a required signer, i.e. swap one hash for another), then I can see it as an argument, but I don't understand why such tooling couldn't just recompute the minimum fee?

lehins · 2025-12-10T21:40:55Z

CPS-????/README.md

+
+- Multi-signature workflows where each signer's wallet may re-serialize the transaction
+- Cross-tool transaction building where fee calculations depend on exact byte size
+


The biggest issue with ordering is that Haskell (or any other programming language) ordering is not guaranteed to always match the CBOR ordering.

In other words, I suspect enforcing CBOR ordering would make Ledger implementation a bit more complicated.

For example What is the ordering for a key that is a tuple?
What is bigger -1 or 10?
According to CBOR ordering that would be -1, while in Haskell it would be 10.

lehins · 2025-12-11T16:44:05Z

CPS-????/README.md

+
+- Multi-signature workflows where each signer's wallet may re-serialize the transaction
+- Cross-tool transaction building where fee calculations depend on exact byte size
+


The biggest issue with ordering is that Haskell (or any other programming language) ordering is not guaranteed to always match the CBOR ordering.

In other words, I suspect enforcing CBOR ordering would make Ledger implementation a more complicated and error prone.

For example What is the ordering for a key that is a tuple?
What is bigger -1 or 10?
According to CBOR ordering that would be -1, while in Haskell it would be 10.

Not saying it is impossible, but something that needs to be taken under. consideration

lehins · 2025-12-11T21:05:06Z

CPS-????/README.md

+- Multi-signature workflows where each signer's wallet may re-serialize the transaction
+- Cross-tool transaction building where fee calculations depend on exact byte size
+
+**Script Inconsistencies**: Smart contracts suffer from unpredictable script hashes, reference script mismatches across tools. The same compiled script may produce different hashes depending on the library used to apply parameters or cbor serialize the script.


I don't believe compiled scripts actually use CBOR serialization. @zliu41 will have a definitive answer, but from I know Plutus uses flat library for serialization.

This is correct. They're wrapped in a script type defined in the CDDL within a transaction, but the compiled scripts themselves are just flat encoded bytes.

Not sure how to describe it more precisely, the issue we faced is that the different uplc libraries in different languages bahave differently. i.e. the Aiken uplc in Rust and HLabs uplc npm package implemented differently such that in Mesh users get different script cbor if using different cores. It is a massive devexp issue

users get different script cbor if using different cores.

@HinsonSIDAN plutus core is not using CBOR for serialization. So, this issue albeit looks related, it has nothing to do with CBOR! It has to do with the fact that there is no standard for plutus core serialization at all.

In other words, if you'd like to use canonical or non-canonical CBOR for Plutus core serialization you could create a separate CIP for it, but I suspect that there will be some pushback there as well, since custom serialization that is currently in use is likely more efficient that CBOR.

lehins · 2025-12-11T21:32:44Z

CPS-????/README.md

+
+**Script Hash Consistency**: A developer publishes a reference script on-chain, then references it from their off-chain code. Currently, locally computed script hashes may not match the on-chain version due to encoding differences. Canonical serialization guarantees hash consistency across compilation and deployment pipelines.
+
+**Library Maintainers**: Serialization library authors currently must support multiple encoding strategies for compatibility. With a standard, they can focus on a single canonical implementation, reducing maintenance burden and improving deserialization reliability.


Again, serialization libraries will have to support both non-canonical for historical data and canonical CBOR for new data. So, IMHO serialization library authors will be impacted negatively by this CIP

lehins · 2025-12-11T21:36:01Z

CPS-????/README.md

+
+### Optional Goals
+
+4. **Ledger-level enforcement**: If community consensus supports it, implement validation rules in the ledger to guarantee compliance (requires hardfork and backward compatibility strategy).


Deserialization in Ledger is not part of the Ledger rules. It is a totally separate stage, when compared to transaction validation.

Suggested change

4. **Ledger-level enforcement**: If community consensus supports it, implement validation rules in the ledger to guarantee compliance (requires hardfork and backward compatibility strategy).

4. **Ledger-level enforcement**: If community consensus supports it, canonical deserilization must be correctly implemented in ledger to guarantee compliance (requires hardfork, backward compatibility and forward migration strategies).

lehins · 2025-12-11T21:38:38Z

CPS-????/README.md

+This CPS is successfully resolved when:
+
+- A canonical CBOR serialization CIP reaches "Active" status with clear specifications
+- At least 80% of major libraries and wallets demonstrate compliance


Is there a way to quantify major libraries today?
Also, there is no mention of Ledger actually have it implemented:

Suggested change

- At least 80% of major libraries and wallets demonstrate compliance

- At least 80% of major libraries and wallets demonstrate compliance

- cardano-node has a hard fork ready foe a new Ledger era that changes its deserializers to canonical CBOR.

Yes, indeed this CPS is not intended to affect ledger

Well, there is a section about it:

Should the standard be enforced at the ledger level?

As I already pointed out, I would not want this CPS to affect Ledger either. I am a bit skeptical about standards that aren't enforced by the chain itself, but there are standards like these that proved themselves to work. So, at the very least, I would suggest adding all the drawbacks that I've mentioned to the section that suggests that this "standard should be enforced by the ledger level"

lehins · 2025-12-11T21:41:12Z

CPS-????/README.md

+
+When multiple valid CBOR encodings exist, how should we decide which becomes canonical?
+
+- **Efficiency**: Minimize transaction size (e.g., smallest integer encoding, definite over indefinite length)


Definite and indefinite length encoding are have more efficient depending on number of elements. When there are less than 23 elements in an array definite length encoding is more efficient, while large count benefits from indefinite length encoding.
So, canonical will make encoding less efficient.

lehins · 2025-12-11T21:46:18Z

CPS-????/README.md

+
+## Open Questions
+
+### What are the guiding principles for choosing the canonical form?


If a custom canonical CBOR standard is to be designed then there is much higher chance that it will be implemented correctly by all of the tools. Grabbing an existing standard and accepting any potential drawback it could have (eg. non-optimal size) would be a safer bet IMHO.

rphair

@lehins I appreciate your detailed review and your providing a realistically complete look at the problem. I trust @qnikst / Tweag will also respond about their own considerations of all these issues.

Even if Ledger considerations prevent further progress, I don't regret the editors' assignment of a number and considering this a CPS "candidate" — since the standardisation problem will be discussed in the dev community no matter what, and this review thread should avoid fragmenting that discussion & might produce some acceptable resolutions here.

In any case @HinsonSIDAN the document would need a lot more detail & depth to reflect the considerations above. I don't think it would be proper to rely on CIPs to address these concerns: otherwise we would get a dispersion of approaches, instead of the unified response we would need to produce particular CIPs from (perhaps by category: Ledger, Tools, Wallets).

We should also be prepared for the possibility that this CPS may remain unmerged if the Ledger and community developer points of view can't be reconciled.

CPS-????/README.md

HinsonSIDAN · 2025-12-12T02:26:00Z

@lehins Appreciate a lot of your review! I agree with most of your concerns if we see the canonical form is built in ledger. Originally I created this CPS with multiple categories since it is possible to be built / supported in different parts of community - #1109 (comment). But eventually I think the Tools category is more appropriate since for example if it is enforced in ledger it might create some issues like permanently locking DApps which using hacks from different form of CBOR.

If we think about the standard is applied only to the level of tools, we can still solve the devexp issue without the need to change any ledger implementation. And the list, quantifying as if collecting most active transaction building libraries and its dependencies for example, is not too difficult to obtain for any CIP candidate also, such as:

- Mesh
  - whisky
    - Aiken uplc
    - csl / pallas 
  - cardano-sdk-js
  - HLabs uplc
- Lucid evo
- pycardano
- apollo
- CSL
- CML
- etc ...

So eventually we can have a map of libraries who are actively following such standard, at least by default, so it becomes safe for 99% of DApp builders to never care CBOR again.

If think about the standard is built in Tools level only, I think some of the concerns above becomes irrelevant. Let me know how it feels if using this perspective to see this CPS, appreciate that!

lehins · 2025-12-12T03:54:57Z

I trust @qnikst / Tweag will also respond about their own considerations of all these issues.

@rphair With respect to canonical ledger state, it is a feature that is totally new, which makes making such decision much easier. Because of that and because @qnikst is working on a canonical ledger state, I believe using canonical CBOR would be a great idea.

Even if Ledger considerations prevent further progress, I don't regret the editors' assignment of a number and considering this a CPS "candidate" — since the standardisation problem will be discussed in the dev community no matter what, and this review thread should avoid fragmenting that discussion & might produce some acceptable resolutions here.

I 100% agree with you. I've seen this idea of canonical CBOR being thrown around in all sorts of inappropriate places like twitter and such. This is a much better place to have a technical discussion about this

HinsonSIDAN and others added 4 commits October 29, 2025 14:13

init for canonical cbor ser standard

4e49f06

update content

6078531

updates for cbor encoding standard

19e6edc

flesh out content

f3c86d7

rphair changed the title ~~CPS: Canonical CBOR Serialization Standard~~ CPS-???? | Canonical CBOR Serialization Standard Oct 30, 2025

rphair reviewed Oct 30, 2025

View reviewed changes

CPS-????/README.md Outdated Show resolved Hide resolved

chore: choose category

fae3905

HinsonSIDAN marked this pull request as ready for review November 12, 2025 06:37

rphair reviewed Nov 13, 2025

View reviewed changes

CPS-????/README.md Outdated Show resolved Hide resolved

rphair added Category: Tools Proposals belonging to the 'Tools' category. State: Triage Applied to new PR afer editor cleanup on GitHub, pending CIP meeting introduction. labels Nov 13, 2025

filled out original Discussion: link

7bc1fd3

Ryun1 reviewed Nov 25, 2025

View reviewed changes

rphair reviewed Nov 25, 2025

View reviewed changes

rphair added State: Unconfirmed Triaged at meeting but not confirmed (or assigned CIP number) yet. and removed State: Triage Applied to new PR afer editor cleanup on GitHub, pending CIP meeting introduction. labels Nov 25, 2025

chore: update email

0d8e4fc

rphair reviewed Dec 10, 2025

View reviewed changes

rphair changed the title ~~CPS-???? | Canonical CBOR Serialization Standard~~ CPS-0024? | Canonical CBOR Serialization Standard Dec 10, 2025

rphair reviewed Dec 10, 2025

View reviewed changes

CPS-????/README.md Outdated Show resolved Hide resolved

assign CPS number 24

6e54b4a

lehins reviewed Dec 11, 2025

View reviewed changes

rphair changed the title ~~CPS-0024? | Canonical CBOR Serialization Standard~~ CPS-0024? | Canonical CBOR Serialization Dec 11, 2025

rphair reviewed Dec 12, 2025

View reviewed changes

CPS-????/README.md Outdated Show resolved Hide resolved

CPS-????/README.md Outdated Show resolved Hide resolved

CPS-????/README.md Outdated Show resolved Hide resolved

CPS-????/README.md Outdated Show resolved Hide resolved

rphair added 4 commits December 11, 2025 19:14

dropped "standard" from title as redundant + perhaps misleading

7f28dba

removed artefact from CPS template (1 of 3)

18dc2ff

removed artefact from CPS template (2 of 3)

f244d37

removed artefact from CPS template (3 of 3)

a6e419c


		- Multi-signature workflows where each signer's wallet may re-serialize the transaction
		- Cross-tool transaction building where fee calculations depend on exact byte size


	- Hardware wallets, which require the keys in every map to be sorted from lowest value to highest.


		Transaction Hash Instability: When a transaction is passed between tools or wallets for signing, each may re-serialize it differently. Since transaction hashes are computed over CBOR bytes, logically identical transactions produce different hashes. This breaks:

		- Multi-signature workflows where each signer's wallet may re-serialize the transaction


		Script Hash Consistency: A developer publishes a reference script on-chain, then references it from their off-chain code. Currently, locally computed script hashes may not match the on-chain version due to encoding differences. Canonical serialization guarantees hash consistency across compilation and deployment pipelines.

		Library Maintainers: Serialization library authors currently must support multiple encoding strategies for compatibility. With a standard, they can focus on a single canonical implementation, reducing maintenance burden and improving deserialization reliability.


		### Optional Goals

		4. Ledger-level enforcement: If community consensus supports it, implement validation rules in the ledger to guarantee compliance (requires hardfork and backward compatibility strategy).

	- At least 80% of major libraries and wallets demonstrate compliance
	- At least 80% of major libraries and wallets demonstrate compliance
	- cardano-node has a hard fork ready foe a new Ledger era that changes its deserializers to canonical CBOR.


		When multiple valid CBOR encodings exist, how should we decide which becomes canonical?

		- Efficiency: Minimize transaction size (e.g., smallest integer encoding, definite over indefinite length)


		## Open Questions

		### What are the guiding principles for choosing the canonical form?

CPS-0024? | Canonical CBOR Serialization #1109

Are you sure you want to change the base?

CPS-0024? | Canonical CBOR Serialization #1109

Conversation

HinsonSIDAN commented Oct 29, 2025 • edited by rphair Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fallen-icarus commented Oct 29, 2025

Uh oh!

GeorgeFlerovsky commented Oct 29, 2025

Uh oh!

yHSJ commented Oct 29, 2025

Uh oh!

Uh oh!

rphair left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rphair left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

HinsonSIDAN commented Nov 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rphair commented Dec 6, 2025

Uh oh!

qnikst commented Dec 8, 2025

Uh oh!

colll78 commented Dec 8, 2025

Uh oh!

Quantumplation commented Dec 8, 2025

Uh oh!

rphair commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

qnikst commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Ryun1 commented Dec 9, 2025

Uh oh!

colll78 commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rphair left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lehins left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

HinsonSIDAN commented Oct 29, 2025 •

edited by rphair

Loading

rphair left a comment •

edited

Loading

HinsonSIDAN commented Nov 28, 2025 •

edited

Loading

rphair commented Dec 8, 2025 •

edited

Loading

qnikst commented Dec 9, 2025 •

edited

Loading

colll78 commented Dec 9, 2025 •

edited

Loading

HinsonSIDAN commented Dec 12, 2025 •

edited

Loading