CIP-0051? | Preserve submitter's ordering of transaction inputs #231

szg251 · 2022-03-03T11:10:49Z

This CIP proposes a small change to the ledger specification allowing for optimisation in certain script validator executions.

…erations

SebastienGllmt · 2022-03-10T14:18:18Z

There was some talk about switching inputs from a set to a list for Babbage because set is messy to work with for Plutus. I'm not sure if this ended up being included in the list of changes that will make it into the update though

WhatisRT · 2022-03-10T14:30:46Z

The script has to properly validate the information it uses anyway, which should always require traversing the full input set. Relying on the ordering of the inputs seems very dangerous, since an attacker could use this to hide information from a script that isn't carefully validating its inputs. And checking that no such hidden information is present is pretty much exactly what is required to find the relevant inputs.

michaelpj · 2022-03-10T14:31:25Z

cip-gege251-reliably-ordered-tx-inputs.md

+
+## Backward Compatibility
+
+For ledger transactions before this CIP is implemented, the underlying CBOR data structure only changes semantically, and the difference between input order at submission and at script evaluation is irrelevant, because the scripts in those transactions do not rely on pattern matching by position. Whereas, transactions after this CIP could optionally do positional pattern matching.


because the scripts in those transactions do not rely on pattern matching by position

This is false. They perfectly well can do, it's weird, but people write lots of weird stuff.

This CIP changes the semantics of the transaction context given to scripts in an observable fashion, and as such it need a new Plutus language version, I'm afraid (see #215).

When is the next Plutus language version planned to be released? Will the June hardfork be Plutus V1 or Plutus V2?

June HF will be V2, but it is too late for additional changes to go into that.

So, this CIP would have to wait for V3?

JaredCorduan · 2022-03-10T14:42:43Z

There was some talk about switching inputs from a set to a list for Babbage because set is messy to work with for Plutus. I'm not sure if this ended up being included in the list of changes that will make it into the update though

No, this is not a part of the Babbage spec, for the reasons @WhatisRT mentioned above. At the very least, in needs more careful thought.

michaelpj · 2022-03-10T16:45:18Z

Oh, also it's wrong to describe this as "deterministic" transaction input ordering. The ordering is definitely deterministic, it's just not the same as the one in the transaction format. "Fixed" transaction ordering might be a better way to describe it.

GeorgeFlerovsky · 2022-03-15T03:10:06Z

The script has to properly validate the information it uses anyway, which should always require traversing the full input set. Relying on the ordering of the inputs seems very dangerous, since an attacker could use this to hide information from a script that isn't carefully validating its inputs. And checking that no such hidden information is present is pretty much exactly what is required to find the relevant inputs.

As @michaelpj mentioned (#231 (comment)), script inputs are already ordered and scripts can already rely on that ordering if they so choose. This CIP does not change that fact.

This CIP merely seeks to preserve the order of inputs that the transaction submitter sets when constructing the transaction, instead of the current approach of ordering transactions lexicographically by (1) their originating transaction's ID and (2) by their output index within that transaction. This would allow the transaction submitter to order the inputs in a way that better aligns to script logic, in the same way that this can currently be done with transaction outputs.

For example, consider a protocol with a transaction class that zips a variable-length (i.e. batchable) list of "pairs" [(A id,B id)] of inputs into a list of outputs [C id]. If input ordering is preserved, then the submitter can submit a transaction with the following inputs and outputs:

inputs = [A 1, B 1, A 3, B 3, A 5, B 5]
outputs = [C 1, C 3, C 5]

The script could iterate in parallel through the outputs (one element at a time) and inputs (two elements at a time), and verify that:

They are relevant to the protocol – i.e. they contain a token, datum, and/or script address that indicates that they belong to the protocol.
That they match each other – e.g. (A 1, B 1, C 1) should match, but both (A 1, B 2, C 1) and (A 1, A 1, C 1) shouldn't match.

This is quite straightforward and efficient to achieve if the script and transaction submitter can coordinate via the input/output ordering. It is much harder and inefficient (O(n log(n))?) if the inputs are ordered according to their original (TxId, OutputIndex), which is irrelevant in most contexts.

Your point about hidden information is well taken. However, I think this risk can be mitigated with a cheap, linear traversal through the rest of the inputs/outputs to check that they are not "relevant" to the transaction at hand. In any case, I don't think that this should prevent us from allowing the transaction submitter to set an input ordering that gets preserved.

GeorgeFlerovsky · 2022-03-15T03:25:49Z

Oh, also it's wrong to describe this as "deterministic" transaction input ordering. The ordering is definitely deterministic, it's just not the same as the one in the transaction format. "Fixed" transaction ordering might be a better way to describe it.

I suggest "Preserve submitter's ordering of transaction inputs"

michaelpj · 2022-03-15T10:03:35Z

Relying on the ordering of the inputs seems very dangerous, since an attacker could use this to hide information from a script that isn't carefully validating its inputs.

I also don't think this is too much of a concern: many applications don't care about what's going on with the other inputs, and can safely ignore them. And if you do care, well, you already have to look at them all, and that won't change.

michaelpj · 2022-03-22T16:19:22Z

cip-gege251-reliably-ordered-tx-inputs.md

+
+## Backward Compatibility
+
+For ledger transactions before this CIP is implemented, the underlying CBOR data structure only changes semantically, and the difference between input order at submission and at script evaluation is irrelevant for the ledger. However, because scripts could rely on pattern matching by position, a new Plutus version is advised.


Not advised, required. There is no way around it. It's a non-backwards-compatible change, it must go in a new version.

The ledger has to be able to translate the wire format from previous eras into new ones, so the CDDL will need to be:

transaction_body = { 0 : set<transaction_input> / [* transaction_input] ; inputs

And therefore the construction of the redeemers will need to be aware of which serialization option was used, so that they can point to the correct use sites (implicit lexicographic ordering for the first, explicit ordering in the second), assuming we don't want multiple types of redeemer pointers.

When transaction inputs which come across the wire as set<transaction_input> are deserialized, they will need to be sorted into the in-memory list representation so that the pointers can be ignorant of the wire format. This change should be mostly invisible to Plutus, since the translation from the ledger state to the Plutus transaction context already supplies a sorted list. You could, though, cook up weird scripts. The obvious contrived example is the script that returns true iff the inputs are sorted. This script would always return true for the old style transactions, but could be false for new ones.

@JaredCorduan I am not sure to follow your point about backward compatible when it comes to the CDDL. The format surely is backward compatible w.r.t the bytes on-the-wire, because in the end, a set<a> is merely a [ * a]. It is only the interpretation / semantic of that binary data which changes.

Or, are you thinking that there should be a way in the binary serialization to distinguish lexicographically-ordered inputs from submitter-ordered inputs?

Or, are you thinking that there should be a way in the binary serialization to distinguish lexicographically-ordered inputs from submitter-ordered inputs?

Correct. I feel an obligation to not change the semantics of previous era wire formats, since we will still be supporting them.

WhatisRT · 2022-03-31T12:01:37Z

I think there is a slippery slope with this CIP. Let's say this is implemented, what about for example preservation of 'minting order', 'withdrawal order' or even the order a Value has? All of those things are inherently ordered in the serialization and it's not unthinkable that one would want to preserve them for similar reasons. Maybe the argument 'it's much more useful for inputs' is already sufficient, but I'd like to have this addressed a bit more.

WhatisRT · 2022-06-21T08:15:08Z

I have another concern about this, which is interoperability. If this CIP would be implemented, I'd expect that most scripts (at some point in the future) would require the inputs that script cares about to be at the beginning of the list of inputs, simply because of convenience. This means that people would only be able to execute multiple scripts in the same transaction if those scripts were explicitly designed with that in mind. This is a significant change from how things work today: scripts that don't know and care about each other can be run in the same transaction, even if they were never intended to be executed simultaneously with other scripts.

michaelpj · 2022-06-21T08:28:57Z

That same issue is present today with, say, outputs. If scripts A and B both care about outputs O_A and O_B, it's easy for A to be lazily written to require O_A to be the first output, and likewise for B. So I'm not sure this makes things worse.

KtorZ · 2022-08-16T07:11:34Z

cip-gege251-reliably-ordered-tx-inputs.md

+
+## Abstract
+
+This CIP proposes a way to improve efficiency of handling transaction inputs in Plutus scripts by changing the way they are representend in the ledger.


Suggested change

This CIP proposes a way to improve efficiency of handling transaction inputs in Plutus scripts by changing the way they are representend in the ledger.

This CIP proposes a way to improve efficiency of handling transaction inputs in Plutus scripts by changing the way they are representend in the ledger.represented

KtorZ · 2022-08-16T07:16:46Z

cip-gege251-reliably-ordered-tx-inputs.md

+
+## Security considerations
+
+We considered the possibility of a man-in-the-middle attack, where the adversary could change the transaction input ordering, and altering the behaviour of a script. However the existing ledger rules for deriving txid are sufficient to guarantee the integrity of transactions, because the inputs (whether a set or a list) are included in the transaction body that gets hashed to a txid. Therefore, no changes would be required to txid derivation.


I am not sure to understand the point being made here and in particular, why transaction integrity is something that helps mitigating an adversary changing the order of inputs?

KtorZ · 2022-08-16T07:19:08Z

cip-gege251-reliably-ordered-tx-inputs.md

+
+## Rationale
+
+In theory we could pre-calculate the index of the searched transaction input off-chain, by relying on the ordering of transaction inputs (lexicographic ordering on `(TransactionID, TransactionIndex)`). But then, to pass this information to the script, we would need to use a redeemer, which could defeat the purpose of this optimisation with it's own fees. Also, we could only reliably do this after transaction balancing, but changing the transaction after balancing could affect fees, rendering the transaction unbalanced again. This method also introduces unnecessary complexity.


which could defeat the purpose of this optimization with it's own fees.

I am not sure that framing this as an optimization to save on fees is the best justification. Surely, it improves the developer experience and makes easier to work with transaction inputs. I think one of the most common problem with that is calculation of redeemer pointers and how it conflicts with transaction balancing (adding new inputs to cover for fee may invalid all existing pointers on inputs). If the argument is made about saving on fees, then it should come backed with numbers showing how much to help weigh whether the change is worth it or not.

WhatisRT · 2022-09-15T11:06:11Z

I happened to be thinking about this again, and I think what @michaelpj's comment suggests is that the order of the outputs should not have been preserved either. We could have just sorted them as well, which would have gives us better interoperability defaults. We can't change that anymore since people might rely on it somewhere, but we could strongly discourage using that information in our APIs. So while I'm not sure this CIP would make things worse, I'm also not sure that it doesn't make it worse.

michaelpj · 2022-09-15T14:14:45Z

I happened to be thinking about this again, and I think what @michaelpj's comment suggests is that the order of the outputs should not have been preserved either.

I think this is true if what we primarily cared about was encouraging scripts not to make bad assumptions. But in fact the aim was mostly just to accurately represent the structure of the transaction, and I still think that's a good default.

WhatisRT · 2022-09-16T17:09:56Z

I might have said what I meant in a way that's a bit confusing: I wouldn't want to throw away ledger information, I think that the ledger should reorder them and that should be accurately represented. In an alternate universe we could've had a version of Shelley that does sort outputs, and in hindsight I'd have preferred that.

rphair

This is up for Last Check at the CIP meeting today: although without response from the author about some issues posted above, I believe this has been vetted by IOG according to this announcement: input-output-hk/Developer-Experience-working-group#20

So (I hope @KtorZ will correct me if I'm wrong in this thinking) this is fait accompli because it's already being implemented... and therefore should be merged unless the explanations are misleading or not understandable, or the document is improperly formatted somehow (it looks OK to me).

rphair · 2022-11-30T09:46:52Z

(last label application was my operator error)

michaelpj · 2022-11-30T09:54:28Z

So this is fait accompli because it's already being implemented

This is not being implemented AFAIK.

WhatisRT · 2022-11-30T11:17:06Z

Yes, no plans on the ledger side. I also still believe that this is a step in the wrong direction, since it doesn't actually simplify much as scripts still have to check that the inputs are what they expect, see above. Instead it adds another pitfall (forgetting to properly check that the order is correct) and it may worsen interoperability.

rphair · 2022-12-06T17:51:25Z

@gege251 closing because of lack of support from Ledger & Plutus team(s) indicated above.

colll78 · 2023-07-01T02:59:13Z

Please reopen this. If you guys saw the codebases of any of the DEXs you would understand why this is such a big deal. The crazy non-sense they all do (that introduces a ton of complexity and room for vulnerabilities) to reconstruct the necessary of inputs within the transaction is actually revolting, and the DEXs that don't do this ugly reconstruction can only fit roughly half as many requests in a batch. In addition to the absolutely massive efficiency benefits, it actually solves the double satisfaction issue, because you can match the corresponding outputs based on their indices (ie input at index 1 must correspond to output at index 1).

rphair · 2023-07-01T12:33:26Z

@colll78 (@euonymos) I hear your statement (and support) but this is not an issue for implementation: it's a technical document which particularly the Plutus team decided not to support. It would make no difference if we reopened or even merged this pull request as a CIP if it's not something they're going to do.

@michaelpj @WhatisRT on the off-chance that the last 7 months have led to the IOG Plutus or Ledger team implementing such a thing, or anything else that would mitigate the problem reported especially by DEXes, then can you post here? If so we can add it to the CIP meeting agenda in 3 days' time when Plutus reps are expected to be present, and maybe reopen this.

@colll78 the general problem of DEXes sounds like a good subject for a CPS (Cardano Problem Statement) if it remains unaddressed by any other CIP or development plan. Can you check first to see if any of these are applicable? https://github.com/cardano-foundation/CIPs#cardano-problem-statements-cps-1

... especially this one, which I think covered the "double satisfaction" problem? https://github.com/cardano-foundation/CIPs/tree/master/CPS-0005

WhatisRT · 2023-07-03T12:21:38Z

On the ledger side there are no plans to support this (or anything like it), and even if we wanted to support it there's no way this would happen any time soon. From what I hear, the Plutus team is also incredibly busy with things that have higher priority.

I've pointed out how to emulate this feature using redeemers a bunch of times, and it's not all that big of a deal to implement (something like 5 lines of Haskell code, maybe more when you need something extremely optimized). I'm open to discussing this issue further, but it should be motivated by good real-world examples (note that the example given in this CIP would be a horrible idea in the real world).

rphair · 2024-07-30T00:18:19Z

Historical note: this effort is continuing through candidate CIP-0128: #758

szg251 added 2 commits March 1, 2022 12:51

First draft for Deterministic transaction input ordering

b2409a2

Add more clarification for backward compatibility and security consid…

3cde608

…erations

michaelpj reviewed Mar 10, 2022

View reviewed changes

szg251 added 2 commits March 15, 2022 14:32

Update title

227b98a

Amend backward compatibility

c929e3f

szg251 changed the title ~~Deterministic transaction input ordering~~ Preserve submitter's ordering of transaction inputs Mar 15, 2022

GeorgeFlerovsky approved these changes Mar 22, 2022

View reviewed changes

michaelpj reviewed Mar 22, 2022

View reviewed changes

Fix: new Plutus version is required

391264a

michaelpj mentioned this pull request Mar 31, 2022

What guarantees do we have about about the internal structure of the things in ScriptContext? IntersectMBO/plutus#4509

Open

KtorZ changed the title ~~Preserve submitter's ordering of transaction inputs~~ CIP-0051? | Preserve submitter's ordering of transaction inputs May 11, 2022

KtorZ added tentative CIP labels May 11, 2022

KtorZ reviewed Aug 16, 2022

View reviewed changes

Benjmhart mentioned this pull request Oct 2, 2022

Priority CIP's for late 2022/Early 2023 input-output-hk/Developer-Experience-working-group#20

Closed

rphair approved these changes Nov 30, 2022

View reviewed changes

rphair added the Waiting for Author Proposal showing lack of activity or participation from their authors. label Nov 30, 2022

rphair removed the Waiting for Author Proposal showing lack of activity or participation from their authors. label Nov 30, 2022

KtorZ added the Likely Deprecated Proposal likely abandonned, staged for closing without merge. label Nov 30, 2022

rphair closed this Dec 6, 2022

WhatisRT mentioned this pull request Feb 8, 2023

Why inputs for TX are treated as Set? IntersectMBO/plutus#4296

Closed

KtorZ mentioned this pull request Mar 24, 2023

hashes do not match (sometimes)? aiken-lang/aiken#470

Closed

1 task

WhatisRT mentioned this pull request Jan 11, 2024

One more change is needed for ScriptPurpose IntersectMBO/plutus#5709

Closed

This was referenced Jan 30, 2024

CIP-0112? | Observe script type #749

Merged

CIP-0128? | Preserving Order of Transaction Inputs #758

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CIP-0051? | Preserve submitter's ordering of transaction inputs #231

CIP-0051? | Preserve submitter's ordering of transaction inputs #231

szg251 commented Mar 3, 2022

SebastienGllmt commented Mar 10, 2022

WhatisRT commented Mar 10, 2022

michaelpj Mar 10, 2022

GeorgeFlerovsky Mar 15, 2022

michaelpj Mar 15, 2022

GeorgeFlerovsky Mar 22, 2022

michaelpj Mar 22, 2022

JaredCorduan commented Mar 10, 2022

michaelpj commented Mar 10, 2022

GeorgeFlerovsky commented Mar 15, 2022 •

edited

Loading

GeorgeFlerovsky commented Mar 15, 2022 •

edited

Loading

michaelpj commented Mar 15, 2022

michaelpj Mar 22, 2022

JaredCorduan Mar 24, 2022

KtorZ Aug 16, 2022

JaredCorduan Aug 16, 2022

WhatisRT commented Mar 31, 2022

WhatisRT commented Jun 21, 2022

michaelpj commented Jun 21, 2022

KtorZ Aug 16, 2022

KtorZ Aug 16, 2022

KtorZ Aug 16, 2022

WhatisRT commented Sep 15, 2022

michaelpj commented Sep 15, 2022

WhatisRT commented Sep 16, 2022

rphair left a comment

rphair commented Nov 30, 2022

michaelpj commented Nov 30, 2022

WhatisRT commented Nov 30, 2022

rphair commented Dec 6, 2022 •

edited by KtorZ

Loading

colll78 commented Jul 1, 2023

rphair commented Jul 1, 2023 •

edited

Loading

WhatisRT commented Jul 3, 2023 •

edited

Loading

rphair commented Jul 30, 2024


		## Backward Compatibility

		For ledger transactions before this CIP is implemented, the underlying CBOR data structure only changes semantically, and the difference between input order at submission and at script evaluation is irrelevant, because the scripts in those transactions do not rely on pattern matching by position. Whereas, transactions after this CIP could optionally do positional pattern matching.


		## Abstract

		This CIP proposes a way to improve efficiency of handling transaction inputs in Plutus scripts by changing the way they are representend in the ledger.


		## Security considerations

		We considered the possibility of a man-in-the-middle attack, where the adversary could change the transaction input ordering, and altering the behaviour of a script. However the existing ledger rules for deriving txid are sufficient to guarantee the integrity of transactions, because the inputs (whether a set or a list) are included in the transaction body that gets hashed to a txid. Therefore, no changes would be required to txid derivation.


		## Rationale

		In theory we could pre-calculate the index of the searched transaction input off-chain, by relying on the ordering of transaction inputs (lexicographic ordering on `(TransactionID, TransactionIndex)`). But then, to pass this information to the script, we would need to use a redeemer, which could defeat the purpose of this optimisation with it's own fees. Also, we could only reliably do this after transaction balancing, but changing the transaction after balancing could affect fees, rendering the transaction unbalanced again. This method also introduces unnecessary complexity.

CIP-0051? | Preserve submitter's ordering of transaction inputs #231

CIP-0051? | Preserve submitter's ordering of transaction inputs #231

Conversation

szg251 commented Mar 3, 2022

SebastienGllmt commented Mar 10, 2022

WhatisRT commented Mar 10, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JaredCorduan commented Mar 10, 2022

michaelpj commented Mar 10, 2022

GeorgeFlerovsky commented Mar 15, 2022 • edited Loading

GeorgeFlerovsky commented Mar 15, 2022 • edited Loading

michaelpj commented Mar 15, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

WhatisRT commented Mar 31, 2022

WhatisRT commented Jun 21, 2022

michaelpj commented Jun 21, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

WhatisRT commented Sep 15, 2022

michaelpj commented Sep 15, 2022

WhatisRT commented Sep 16, 2022

rphair left a comment

Choose a reason for hiding this comment

rphair commented Nov 30, 2022

michaelpj commented Nov 30, 2022

WhatisRT commented Nov 30, 2022

rphair commented Dec 6, 2022 • edited by KtorZ Loading

colll78 commented Jul 1, 2023

rphair commented Jul 1, 2023 • edited Loading

WhatisRT commented Jul 3, 2023 • edited Loading

rphair commented Jul 30, 2024

GeorgeFlerovsky commented Mar 15, 2022 •

edited

Loading

GeorgeFlerovsky commented Mar 15, 2022 •

edited

Loading

rphair commented Dec 6, 2022 •

edited by KtorZ

Loading

rphair commented Jul 1, 2023 •

edited

Loading

WhatisRT commented Jul 3, 2023 •

edited

Loading