CHIP-2021-05-vm-limits: Targeted Virtual Machine Limits

    Title: Targeted Virtual Machine Limits
    Type: Standards
    Layer: Consensus
    Maintainer: Jason Dreyzehner
    Status: Draft
    Initial Publication Date: 2021-05-12
    Latest Revision Date: 2024-02-02

Summary

This proposal replaces several poorly-targeted virtual machine (VM) limits with alternatives that protect against the same malicious cases, while significantly increasing the power of the Bitcoin Cash contract system:

The 201 operation limit is removed.
The 520-byte stack element length limit is raised to 10,000 bytes, a constant equal to the consensus-maximum VM bytecode length (A.K.A. MAX_SCRIPT_SIZE) prior to this proposal.
A new cumulative hashing limit is introduced, limiting contracts to 1000 digest iterations per evaluation, the effective limit prior to this proposal.
A new stack memory usage limit is introduced, limiting contracts to 130,000 bytes of data on the stack, a constant rounded up from the effective limit prior to this proposal.

This proposal intentionally avoids modifying other existing properties of the VM:

The limits on signature operation count (A.K.A. SigChecks) density, maximum standard input bytecode length (A.K.A. MAX_TX_IN_SCRIPT_SIG_SIZE – 1,650 bytes), consensus-maximum VM bytecode length (A.K.A. MAX_SCRIPT_SIZE – 10,000 bytes), maximum standard transaction byte-length (A.K.A. MAX_STANDARD_TX_SIZE – 100,000 bytes) and consensus-maximum transaction byte-length (A.K.A. MAX_TX_SIZE – 1,000,000 bytes) are not modified.
The cost and incentives around blockchain “data storage” are not measurably affected.
The worst-case transaction validation processing and memory requirements of the VM are not measurably affected.

Deployment

Deployment of this specification is proposed for the May 2025 upgrade.

Activation is proposed for 1731672000 MTP, (2024-11-15T12:00:00.000Z) on chipnet.
Activation is proposed for 1747310400 MTP, (2025-05-15T12:00:00.000Z) on the BCH network (mainnet), testnet3, testnet4, and scalenet.

Motivation

Bitcoin Cash contracts are strictly limited to prevent maliciously-designed transactions from requiring excessive resources during transaction validation. Two of these limits are poorly-targeted, unintentionally preventing valuable use cases.

The 520-byte stack element length limit (A.K.A. MAX_SCRIPT_ELEMENT_SIZE) currently prevents items longer than 520 bytes from being pushed, OP_NUM2BINed, or OP_CATed on to the stack. This also limits the length of Pay-To-Script-Hash (P2SH) redeem bytecode to 520 bytes.
The 201 operation limit prevents contracts with more than 201 non-push operations from being relayed on the network or mined in blocks.

Together, these limits:

Cap maximum stack memory usage to ~130,000 bytes, and
Cap the maximum processing requirements of the VM to 1000 digest iterations of SHA-256 hashing.

While these limits have been sufficient to prevent Denial of Service (DOS) attacks by capping the maximum cost of transaction validation, their current design prevents valuable contract use cases and wastefully increases the size of certain transactions.

Benefits

By replacing these limits with better-tuned alternatives, the Bitcoin Cash contract system can be made more powerful and efficient, without sacrificing node validation performance.

More Advanced Contracts

The 520-byte stack element length limit prevents additional use cases by limiting the length of Pay-To-Script-Hash (P2SH) contracts, as P2SH redeem bytecode is pushed to the stack prior to evaluation. Additionally, the element length limit also prevents the use of larger hash preimages, e.g. in contracts which inspect parent transactions or utilize OP_CHECKDATASIG.

By raising this limit, more advanced contracts can be supported.

More Efficient Contracts

The 520-byte stack element length limit sometimes requires contract authors to design less byte-efficient contracts in order to fit contract code into 520 bytes. For example, rather than embedding data elements directly in P2SH redeem bytecode, authors may be required to pick and validate data from the unlocking bytecode, wasting transaction space.

Likewise, both the stack element length limit and the 201 operation limit sometimes require contract authors to offload validation to additional outputs, increasing transaction sizes with otherwise-unnecessary overhead.

With better-targeted VM limits, many contracts can be made more efficient.

Costs & Risk Mitigation

The following costs and risks have been assessed.

Modification to Transaction Validation

Modifications to VM limits have the potential to increase worst-case transaction validation costs and expose VM implementations to Denial of Service (DOS) attacks.

Mitigations: this proposal includes a series of benchmarks which measure worst-case validation performance and memory usage for contracts before and after the proposed upgrade. The newly proposed limits have been chosen to target performance and memory usage equivalent to the existing limits.

Node Upgrade Costs

This proposal affects consensus – all fully-validating node software must implement these VM changes to remain in consensus.

These VM changes are backwards-compatible: all past and currently-possible transactions remain valid under these new rules.

Ecosystem Upgrade Costs

Because this proposal only affects internals of the VM, standard wallets, block explorers, and other services will not require software modifications for these changes. Only software which offers emulation of VM evaluation (e.g. Bitauth IDE) will be required to upgrade.

Wallets and other services may also upgrade to add support for new contracts which will become possible after deployment of this proposal.

Technical Specification

The existing Stack Element Size Limit is raised, the Operation Limit is removed, and two new limits are introduced: a Hashing Limit and a Stack Memory Usage Limit.

Increased Stack Element Length Limit

The existing 520-byte stack element size limit (A.K.A. MAX_SCRIPT_ELEMENT_SIZE) is raised to 10,000 bytes.

Note on Increase of Stack Element Length Limit

This increases the maximum size of stack elements to be equal to the maximum allowed VM bytecode length (A.K.A. MAX_SCRIPT_SIZE – 10,000 bytes). The maximum standard input bytecode length (A.K.A. MAX_TX_IN_SCRIPT_SIG_SIZE – 1,650 bytes) is unchanged.

Removal of Operation Limit

The existing 201-operation limit per evaluation (A.K.A. MAX_OPS_PER_SCRIPT) is removed.

Note on Replacement of Operation Limit

After activation of this proposal, operation count no longer requires tracking in the VM. The below Hashing Limit and Stack Memory Usage Limit cover all pathological constructions which might be prevented by the operation limit.

Hashing Limit

A new limit is placed on OP_RIPEMD160 (0xa6), OP_SHA1 (0xa7), OP_SHA256 (0xa8), OP_HASH160 (0xa9), OP_HASH256 (0xaa), OP_CHECKDATASIG (0xba), and OP_CHECKDATASIGVERIFY (0xbb) to prevent excessive hashing function usage.

Before any hashing function is called, the expected cost of the call – in terms of digest iterations – is added to a cumulative total for the evaluation. If the total required digest iterations exceeds 1000, the operation produces an error.

Selection of 1000 Digest Iteration Maximum

At a Maximum Digest Iteration of 1000, this Hashing Limit replaces the combined effect of the existing Stack Element Length Limit and Operation Limit.

Prior to deployment of this proposal, the most hashing-intensive (standard P2SH) contract is the Pre-Deployment Hashing Limit Benchmark, which requires 2 operations to produce 10 unique (memoization-resistant) SHA-256 digest iterations:

Unlock:
<1>

Lock:
<0> <520> OP_NUM2BIN OP_HASH256       // bytecode:   0x0002080280aa
<520> OP_NUM2BIN OP_HASH256 [...99x]  // bytecode:   0x02080280aa
OP_DROP

This contract requires exactly 201 operations and a 506 byte unlocking bytecode (including a 502 byte push of the P2SH redeem bytecode). In each line, the first round of SHA-256 requires 9 digest iterations, and the second round of SHA-256 operates on the 32-byte result of the first, requiring one additional digest iteration. At a total of 100 lines, this contract requires exactly 1000 digest iterations.

Digest Iteration Count

The hashing limit caps the count of message blocks which may be processed by any hashing function within a single evaluation. This places an upper limit on the sum of bytes hashed (including padding).

Given a message length, digest iterations may be calculated with the following C function:

int digest_iterations (int message_length) {
  return 1 + ((message_length + 8) / 64);
}

Calculate Digest Iterations in JavaScript

const digestIterations = (messageLength) =>
  1 + (((messageLength + 8) / 64) | 0);

Digest Iteration Count Test Vectors

Message Length (Bytes)	Digest Iterations
0	1
1	1
55	1
56	2
64	2
119	2
120	3
183	3
184	4
247	5
248	5
503	8
504	9
1015	16
1016	17
63928	1000
63991	1000
63992	1001

Note on 64-Byte Message Block Size

Each VM-supported hashing algorithm – RIPEMD-160, SHA-1, and SHA-256 – uses a Merkle–Damgård construction with a block size of 512 bits (64 bytes), so the number of message blocks/digest iterations required for every message size is equal among all VM-supported hashing functions.

Hashing Operations

Prior to calling their respective hashing functions, the OP_RIPEMD160 (0xa6), OP_SHA1 (0xa7), OP_SHA256 (0xa8), OP_HASH160 (0xa9), and OP_HASH256 (0xaa) operations must compute the expected digest iterations for the length of the message to be hashed, adding to the result to the evaluation's count of total digest iterations. If the new total exceeds the limit, the operation produces an error.

Note on Two-Round Hashing Operations

The two-round hashing operations – OP_HASH160 (0xa9) and OP_HASH256 (0xaa) – pass the 32-byte result of their initial SHA-256 hashing round into their second round, requiring one additional digest iteration beyond the single-round OP_SHA256.

Stack Memory Usage Limit

A new limit is placed on all operations which push or modify elements on the stack: after each operation, the total cumulative byte-length of all stack elements may not exceed 130,000 bytes.

Selection of 130,000 Byte Stack Memory Usage Limit

Prior to the this proposal, stack memory usage is limited by a combination of the (520-byte) stack element length limit and the (201) operation limit. The following contract demonstrates the largest possible (standard P2SH) memory consumption:

OP_1
<520-byte push>
<520-byte push>
<201-byte redeem bytecode>
--- // 201-byte P2SH lock:
OP_2DUP OP_3DUP OP_3DUP     // 10x 520-byte elements on the stack
OP_3DUP [...x77]            // 241x 520-byte elements on the stack
OP_2DROP [...x120]
OP_DROP                     // `0x01` on stack

At peak, this contract requires 125,320 bytes of memory to hold the raw bytecode contents of 242 stack items. This value is rounded up to the 130,000 constant to avoid invalidating any possible unspent P2SH transaction outputs.

Computing Stack Memory Usage

For consensus purposes, stack memory usage is the total byte-length of all stack element contents, without regard for the depth of the stack. (The existing 1000-item stack depth limit remains in place.)

For example, a stack of ['', '0102', '030405'] contains 3 elements: element 0 is empty, element 1 contains two bytes (0x0102), and element 2 contains three bytes (0x030405). The memory usage of this stack is 5 bytes: zero bytes for element 0, 2 bytes for element 1, and 3 bytes for element 2.

Implementation Note

Computing stack memory usage can become expensive in poorly-optimized stack implementations. To avoid Denial of Service (DOS) vulnerabilities, implementations should ensure that their worst-case stack memory usage computation remains insignificant when compared with worst-case hashing performance.

Benchmarks

The following contracts are provided for testing worst-case implementation performance. For interactive examples, see this template in Bitauth IDE.

Pre-Deployment Hashing Limit Benchmark

This benchmark measures the worst-known standard transaction validation performance prior to deployment of this specification. This contract can be evaluated by spending a standard P2SH output, and does not require miner participation. (See Exclusion of Additional Relay Policy Limits.)

Unlock:
<1>

Lock:
<0> <520> OP_NUM2BIN OP_HASH256  // bytecode:   0x0002080280aa
<520> OP_NUM2BIN OP_HASH256      // bytecode:   0x02080280aa
...
<520> OP_NUM2BIN OP_HASH256      // bytecode:   0x02080280aa
OP_DROP

Post-Deployment Benchmarks

To remove the operation limit without affecting the worst-case transaction validation performance status quo, the following benchmarks must be validated faster than Pre-Deployment Hashing Limit Benchmark.

Post-Deployment Hashing Limit Benchmark

This is the post-deployment equivalent of Pre-Deployment Hashing Limit Benchmark:

<0> <10000> OP_NUM2BIN OP_HASH256
<10000> OP_NUM2BIN OP_HASH256
<10000> OP_NUM2BIN OP_HASH256
<10000> OP_NUM2BIN OP_HASH256
<10000> OP_NUM2BIN OP_HASH256
<10000> OP_NUM2BIN OP_HASH256
<3703> OP_NUM2BIN OP_HASH256
OP_DROP

Benchmark: Stack Memory Check at Maximum Depth

This benchmark ensures that validating stack memory usage (for the Stack Memory Usage Limit) does not consume excessive resources:

OP_1 OP_DUP OP_2DUP OP_3DUP OP_3DUP      // 10 `0x01`s on the stack
OP_3DUP [...x330]                        // 1000 `0x01`s on the stack
OP_1ADD [...x9167]
OP_2DROP [...x499]
OP_DROP                                  // one `0x01` on stack

Benchmark: Excessive OP_IF

This benchmark ensures stack implementations remain efficient with abusive use of OP_IF after removal of the operation limit:

OP_1 OP_IF [...x2000]
OP_1 [...x1000]
OP_1 OP_ENDIF [...x2000]

Note: an O(1) Execution Stack has been implemented in a derivative of the Satoshi client.

Benchmark: Excessive OP_ROLL

This benchmark ensures stack implementations remain efficient with abusive use of OP_ROLL after removal of the operation limit:

OP_1 OP_DUP OP_2DUP OP_3DUP OP_3DUP      // 10 `0x01`s on the stack
OP_3DUP [...x330]                        // 1000 `0x01`s on the stack
<999> OP_ROLL [...x9167]
OP_2DROP [...x499]
OP_DROP                                  // one `0x01` on stack

Rationale

This section documents design decisions made in this specification.

Hashing Limit of 1000 Digest Iterations

In this proposal, the precise value of 1000 digest iterations has been selected to simplify review – at this value, the new limit does not increase the worst-case computational cost of validating maliciously-designed transactions.

This limit is likely also adequate for highly-complex public covenants: verifying a leaf replacement within a merkle tree requires 2 to 3 digest iterations per tree level; contracts could easily inspect many parent transactions and large merkle proofs within the same evaluation without reaching the 1000 digest limit. (Note: in practice, such contracts will remain limited by other factors, particularly the 1,650-byte maximum unlocking bytecode length, A.K.A. MAX_TX_IN_SCRIPT_SIG_SIZE.)

Hashing Limit by Digest Iterations

One possible alternative to the proposed Hashing Limit design is to limit evaluations to a fixed count of "bytes hashed" (rather than digest iterations). While this seems simpler, it performs poorly in an adversarial environment: an attacker can choose message sizes to minimize bytes hashed while maximizing digest iterations required (e.g. 1-byte and 56-byte messages).

For non-memoized VM implementations, a "bytes hashed" limit must be as low as 1000 bytes to offer protection equivalent to the 1000 digest iteration limit. However, by directly measuring and limiting digest iterations, non-malicious contracts can be allowed a limit of 63,991 bytes hashed (efficiently) without increasing the worst-case validation requirements of the VM as it existed prior to this proposal.

Exclusion of Additional Relay Policy Limits

Note that worst-case block validation times have less impact on Bitcoin Cash than on other cryptocurrency networks: if most miners validate transactions as they are received (and most blocks clear the entire "mempool"), most standard transactions in a block must have been previously validated by other miners, and the new block can be validated quickly (without revalidating previously-seen transactions). Worst-case transaction validation times are only a concern for miners intentionally mining non-standard transactions – if most miners receive and begin building on a competing block before a miner's slow-to-validate block, the miner of the slow-to-validate block loses revenue – creating a natural disincentive for miners to produce such hard-to-validate blocks.

By allowing larger stack elements, this proposal increases the efficiency of a hashing-limited transaction (Post-Deployment Hashing Limit Benchmark), increasing the approximate validation time for 32MB worst-case blocks from ~0.19s to ~1.29s on 2018 consumer CPUs¹. However, hashing-limited transactions remain an ineffective DOS strategy – in practice they are equivalent to simple block-stuffing DOS attacks (for which the existing mitigation is transaction fees and large-enough blocks). No further relay policy limits are necessary in the context of this proposal.

Calculations

The maximum-efficiency transaction employing the Pre-Deployment Hashing Limit Benchmark has 176 inputs: version (4 bytes), input count (1 byte, 0xb2), 182 ✖️ { outpoint (36 bytes), unlocking bytecode length (3 bytes, 0xfdfa01), unlocking bytecode (506 bytes), sequence number (4 bytes)}, output count (1 byte, 0x01), value (8 bytes), locking bytecode length (1 byte 0x01), locking bytecode (1 byte, OP_RETURN), locktime (4 bytes). Total transaction size: 4 + 1 + 182 * (36 + 3 + 523 + 4) + 1 + 8 + 1 + 1 + 4 = 99,636 bytes. Total digest iterations: 182 * 1000 = 182,000 digest iterations. 182000/99636 ~= 1.827; attack costs 1 byte per ~1.827 SHA-256 digest iterations.

The maximum-efficiency transaction employing the Post-Deployment Hashing Limit Benchmark has 1249 inputs: version (4 bytes), input count (3 bytes), 884x{ outpoint (36 bytes), unlocking bytecode length (1 byte, 0x27), unlocking bytecode (39 bytes), sequence number (4 bytes) }, output count (1 byte, 0x01), value (8 bytes), locking bytecode length (1 byte 0x01), locking bytecode (1 byte, OP_RETURN), locktime (4 bytes). Total transaction size: 4 + 1 + 1249 * (36 + 1 + 72 + 4) + 1 + 8 + 1 + 1 + 4 = 99,940 bytes. Total digest iterations: 1249 * 1000 = 1,249,000 digest iterations. 1249000/99912 ~= 12.501; attack costs 1 byte per ~12.501 SHA-256 digest iterations.

According to AnandTech's CPU 2021 Benchmarks, most CPUs sold since 2015 perform SHA-256 (Linux, OpenSSL, multi-threaded) at over 2000 MB/s for 8K message blocks: 2000 * 1000000 / 8000 * 126 iterations = 31,500,000 digest iterations per second. The AMD Ryzen 5 2600 (a top-selling consumer CPU of 2018) performed at 19675 MB/s for the same benchmark: 19675 * 1000000 / 8000 * 126 iterations = 309,881,250 digest iterations per second.

At a Bitcoin Cash block size of 32MB, a full block of Pre-Deployment Hashing Limit Benchmark transactions will include 321 transactions requiring 58,422,000 digest iterations (equivalent to hashing a single ~3,739 MB message); approx. ~1.85s on 2015 CPUs, ~0.19s on 2018 CPUs (58,422,000 iterations / 31,500,000 iterations per second = ~1.85s, 58,422,000 iterations / 309,881,250 iterations per second = ~0.19s).

At a Bitcoin Cash block size of 32MB, a full block of Post-Deployment Hashing Limit Benchmark transactions will include 320 transactions requiring 399,680,000 digest iterations (equivalent to hashing a single ~25,580 MB message); approx. ~12.69s on 2015 CPUs, ~1.29s on 2018 CPUs (399,680,000 iterations / 31,500,000 iterations per second = ~12.69s, 399,680,000 iterations / 309,881,250 iterations per second = ~1.29s).

Implementations

Please see the following reference implementations for additional examples and test vectors:

[TODO: after initial public feedback]

Stakeholders & Statements

[TODO]

Feedback & Reviews

Changelog

This section summarizes the evolution of this document.

Draft v2.0.0
- Simplify stack memory limit calculation (#6)
- Correct hashing benchmarks, update hashing limit (#6)
- Propose for May 2025 Upgrade
v1.0.0 – 2021-05-12 (5b24b0ec)
- Initial publication

Copyright

This document is placed in the public domain.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
readme.md		readme.md

bitjson/bch-vm-limits

Folders and files

Latest commit

History

readme.md