Add chapter about payment batching

bitcoinops · Mar 1, 2019 · dfa8d91 · dfa8d91
1 parent 3a722bd
commit dfa8d91
Show file tree

Hide file tree

Showing 6 changed files with 375 additions and 0 deletions.
diff --git a/x.payment_batching/img/batch-screenshot.png b/x.payment_batching/img/batch-screenshot.png
diff --git a/x.payment_batching/img/batching.plot b/x.payment_batching/img/batching.plot
@@ -0,0 +1,118 @@
+#!/usr/bin/gnuplot -p
+
+set style line 1 lc rgb '#8b1a0e' pt 6 ps 0.1 lt 1 lw 2
+set style line 2 lc rgb '#5e9c36' pt 6 ps 0.1 lt 1 lw 2
+set style line 3 lc rgb '#0025ad' pt 6 ps 0.1 lt 1 lw 2
+set style line 4 lc rgb '#9400d3' pt 6 ps 0.1 lt 1 lw 2
+set style line 5 lc rgb '#d95319' pt 6 ps 0.1 lt 1 lw 2
+set style line 6 lc rgb '#edb120' pt 6 ps 1 lt 1 lw 2
+set style line 7 lc rgb '#4dbeee' pt 6 ps 1 lt 1 lw 2
+set style line 8 lc rgb '#9400d3' pt 6 ps 1 lt 1 lw 2
+set style line 9 lc rgb '#d3d3d3' pt 6 ps 1 lt 1 lw 2
+set style line 10 lc rgb '#808080' pt 1 ps 1 lt 1 lw 2
+set style line 11 lc rgb '#808080' lt 1
+set style line 12 lc rgb '#808080' lt 0 lw 1
+set style line 12 lc rgb '#e0e0e0' lt 0 lw 1
+
+set terminal pngcairo size 800,200 font "Sans,12"
+
+set grid
+set tics nomirror
+unset border
+unset key
+set samples 1000
+
+savings(original, alternative) = (1 - (alternative / original))*100
+
+## Deconstructed transaction for confirming vbyte calculations
+# Weight x4
+# |  Weight x1
+# |  |  Serialized bytes & description
+# |  |  | ## Version and witness flag
+# | 4|  | 01000000 ... version
+# |  | 2| 0001 ... flag
+#
+# |  |  | ## Inputs
+# | 1|  | 01 ... num inputs
+# |36|  | 95109ede0d9c9841eb3a7206b0bfdcfeda563199110e1cb0b156a442333bf5eb00000000 ... outpoint
+# | 1|  | 00 ... scriptSig len
+# | 4|  | ffffffff ... nSequence
+#
+# |  |  | ## Outputs
+# | 1|  | 02 ... num outputs
+# |  |  | ## P2PKH output
+# | 8|  | e067350000000000 ... nAmount
+# | 1|  | 19 ... scriptPubKey len
+# |25|  | 76a91498471635b0ef4bc198746f43993b0dfaa3bcb7d688ac ... scriptPubKey
+# |  |  | ## P2WPKH output
+# | 8|  | ea44375100000000 ... nAmount
+# | 1|  | 16 ... scriptPubKey len
+# |22|  | 00147b98a381e0347c3dff2802fd27c48ce4b27f969d ... scriptPubKey
+#
+# |  |  | ## Witnesses
+# |  | 1| 02 ... num witness elements
+# |  | 1| 47 ... element #1 len
+# |  |71| 30440220300c83b4f1bd73b233646efd
+#       | 5169d9b0d000f3a58ff9f7184d336366
+#       | d6af9da7022069d919552ca44a61708f
+#       | 1e2caba6d1f042613ea42eebb2c8354e
+#       | 7986256962c501 ... element #1 (sig)
+# |  | 1| 21 ... element #2 len
+# |  |33| 0216e225529e9b107ce4c2009779a194
+#       | 88acff26b65db3bc14871229bb786ecc3f ... element 2 (pubkey)
+#
+# | 4|  | 00000000 ... nLockTime
+
+# Input: outpoint + scriptSig_size + nSequence + (witness_elements_count + size + signature + size + pubkey)/4
+p2wpkh_input = 36 + 1 + 4
+# Input witness (size + signature + size + pubkey)
+p2wpkh_input_witness = (1 + 71 + 1 + 33)/4.0
+# Output: nValue + size + scriptPubKey(OP_0 OP_PUSH20 <hash160>)
+p2wpkh_output = 8 + 1 + (1 + 1 + 20)
+# Tx: nVersion + witness(marker + flag) + input_count + inputs*n + output_count + outputs*n + witness_elements + witnesses*inputs + nLockTime
+p2wpkh_vbytes(inputs, outputs) = 4 + 2/4.0 + 1 + p2wpkh_input*inputs + 1 + p2wpkh_output*outputs + 1/4.0 + p2wpkh_input_witness*inputs + 4
+# Repeated single-payment txes for comparison
+unbatched_payments(inputs, repeats) = p2wpkh_vbytes(inputs, 2)*repeats
+
+######################
+## Best-case vbytes ##
+######################
+set output './p2wpkh-batching-best-case.png'
+set xtics (1,2,3,4,5,10,15,20,25)
+set ytics 50
+set ylabel "Vbytes per\npayment"
+set xlabel "Number of payments"
+# x is the number of payments; the number of outputs is x + 1 change output
+plot [1:25] p2wpkh_vbytes(1, x+1)/x ls 1
+
+########################
+## Savings comparison ##
+########################
+set output './p2wpkh-batching-cases-combined.png'
+unset yrange
+set ytics 20
+set ylabel "Savings"
+set format y "%.0f%%"
+set label 1 "Best case: 1 input, x payments" at 10,82 textcolor ls 1
+#> we can imagine [...] requiring at least 10 inputs for
+#> every output added.
+set label 2 "Unoptimized typical case: x inputs, x payments" at 10,35 textcolor ls 2
+
+plot [1:25] savings(unbatched_payments(1, x)/x, p2wpkh_vbytes(1, x+1)/x) ls 1 \
+    , savings(unbatched_payments(1, x)/x, p2wpkh_vbytes(x, x+1)/x) ls 2 \
+
+###########################
+## Consolidation savings ##
+###########################
+set output './p2wpkh-batching-after-consolidation.png'
+## Cost to consolidate 100 inputs at 20% the normal spend feerate
+consolidation100 = p2wpkh_vbytes(100, 1)*0.2
+
+set label 1 "Best case" at 20,84 textcolor ls 1
+set label 2 "Optimized typical case" at 10,50 textcolor ls 2
+
+## 1. best case: we already have large inputs.
+## 2. one consolidated input pays for 100 payments
+plot [1:25] savings(unbatched_payments(1, x)/x, p2wpkh_vbytes(1, x+1)/x) ls 1 \
+    , savings(unbatched_payments(1, x)/x, (x*(consolidation100/100) + p2wpkh_vbytes(1, x+1))/x) ls 2 \
+
diff --git a/x.payment_batching/img/p2wpkh-batching-after-consolidation.png b/x.payment_batching/img/p2wpkh-batching-after-consolidation.png
diff --git a/x.payment_batching/img/p2wpkh-batching-best-case.png b/x.payment_batching/img/p2wpkh-batching-best-case.png
diff --git a/x.payment_batching/img/p2wpkh-batching-cases-combined.png b/x.payment_batching/img/p2wpkh-batching-cases-combined.png
diff --git a/x.payment_batching/payment_batching.md b/x.payment_batching/payment_batching.md
@@ -0,0 +1,257 @@
+# Payment Batching
+
+This chapter describes how
+high-frequency spenders can use the scaling technique of *payment
+batching* to reduce transaction sizes and fees by about 75% in
+practical situations.
+As of February 2019, payment batching is used by multiple popular
+Bitcoin services (mainly exchanges), is available as a built-in feature
+of many wallets (including Bitcoin Core), and should be easy to
+implement in custom wallets and payment-sending solutions.  On the
+downside, use of the technique can lead to temporary unexpected behavior
+for the receivers of payments, a possible inability to fee bump, and may result in a reduction of privacy.
+
+## Transaction size per receiver
+
+A typical Bitcoin transaction using P2WPKH inputs and outputs contains
+one input from the spender of about 67 vbytes and two outputs of about
+31 vbytes each, one to the receiver and one as change back to the
+spender.  An additional 11 vbytes are used for transaction overhead
+(version, locktime, and other fields).
+
+![Best-case P2WPKH vbytes per payment](img/p2wpkh-batching-best-case.png)
+
+If we add just 4 more receivers, including an additional 31 vbyte output
+for each one of them, but otherwise keep the transaction the same, the
+total size of the transaction becomes 264 vbytes.  Whereas the previous
+transaction used all 140 vbytes to pay a single receiver, the batched
+transaction uses only about 53 vbytes per receiver---a bit over 60%
+savings per payment.
+
+Extrapolating this simple best-case situation, we see that the number of
+vbytes used per receiver asymptotically approaches the size of a single
+output.  This makes the maximum savings possible a bit over 75%.
+
+![Saving rates for best and typical cases of payment batching](img/p2wpkh-batching-cases-combined.png)
+
+Realistically, the more a transaction spends, the more likely it is to
+need additional inputs.  This doesn't prevent payment batching from
+being useful, although it does reduce its effectiveness.  For example,
+we expect a typical service to
+receive payments of about the same value as the payments they make, so
+for every output they add, they need to add one input on average.
+Savings in this typical case peak at about 30%.
+
+Services that find themselves frequently using more than one input per
+transaction may be able to increase their savings using a two-step
+procedure.  In the first step, multiple small inputs are
+[consolidated][chapter consolidation] into a single larger input using
+slow (but low-feerate) transactions that spend the service's money back
+to itself.  In the second step, the service spends from one of its
+consolidated inputs using payment batching and achieves the best-case
+efficiency described above.
+
+If we assume that consolidation transactions will pay only 20% of the
+feerate of a normal transaction and will consolidate 100 inputs at a
+time, we can calculate the savings of using the two-step procedure for
+our one input per output scenario above (while showing, for comparison,
+the simple best-case scenario of already having a large input available).
+
+![Saving rates for best and typical cases of payment batching after consolidation](img/p2wpkh-batching-after-consolidation.png)
+
+For the typical case,
+consolidation loses efficiency when only making a single payment,
+but when actually batching, it performs almost as well as the best case
+scenario.
+
+In addition to payment batching directly providing a fee savings,
+batching also uses limited block space more efficiently by reducing the
+number of vbytes per payment.  This increases the available supply of
+block space and so, given constant demand, can make it more affordable.
+In that way, increased use of payment batching may lower the feerate for
+all Bitcoin users.
+
+In summary, payment batching provides significant savings for services
+that typically have inputs available that are 5 to 20 times larger than
+their typical output.  For services not in that position, the savings
+from batching alone are smaller but perhaps still worth the effort;
+if the services are willing to also pre-consolidate their inputs, the
+savings can be quite dramatic.
+
+Note: the figures and plots above all assume use of P2WPKH inputs and
+outputs.  We expect that to become the dominant script type on the
+network in the future (until something better comes along).  However, if
+you use a different script type (P2PKH, or multisig using P2SH or
+P2WSH), the number of vbytes used to spend them are even larger, so the
+savings rate will be higher.
+
+## Concerns
+
+The fee-reduction benefits of payment batching do create tradeoffs and
+concerns that will need to be addressed by any service using the
+technique.
+
+### Delays
+
+This is the primary concern with payment batching.  Although some
+situations naturally lend themselves to payment batching (e.g. a mining
+pool paying hashrate providers in a block the pool mined), many
+services primarily send money to users when those users make a
+withdrawal request.  In order to batch payments, the service must get
+the user to accept that their payment will not be sent immediately---it
+will be held for some period of time and then combined with other
+withdrawal requests.
+
+Users will notice this delay because they won't receive a notification
+in their receiving wallet that an unconfirmed transaction is on the way
+until you send the batch containing their payment.  Also by delaying
+sending of their payment, you also delay when it's confirmed (all other
+things being equal, such as feerates).
+
+To mitigate the problem of delays, you may allow the
+user to choose between an immediate payment and a delayed payment with
+a different fee provided for each option.  For example:
+
+    [X] Free withdrawal (payment sent within 6 hours)
+    [ ] Immediate withdrawal (withdrawal fee 0.123 mBTC)
+
+### Reduced privacy
+
+A second concern with payment batching is that it can make users feel
+like they have less privacy.  Every user you pay in the same transaction
+can reasonably assume that everyone else receiving an output from that
+transaction is being paid by you.  If you had sent separate
+transactions, any onchain relationship between the payments might be
+less apparent or even non-existent.
+
+![Screenshot of a possible transaction batch in a block explorer](img/batch-screenshot.png)
+
+Note that transactions belonging to particular Bitcoin services are
+often identifiable by experts even if they don't use payment
+batching, so batching doesn't necessarily cause a reduction in privacy
+for those cases.
+
+It may be possible to partially mitigate this problem by sending batched
+payments in a coinjoin transaction created with other users.  Depending
+on the technique used, this would not necessarily reduce the efficiency
+of batching and could provide significantly enhanced privacy.  However,
+naive implementations of coinjoin previously provided by Bitcoin
+services have had [flaws][coinjoin sudoku] that prevented them from
+providing significant privacy advantages.  As of February 2019, no
+currently-available coinjoin implementation is fully compatible with the
+needs of payment batching.
+
+### Possible inability to fee bump
+
+A final concern is that you may not be able to fee bump a batched
+payment.  Transaction relay nodes such as Bitcoin Core impose limits on
+the transactions they relay to prevent attackers from wasting bandwidth,
+CPU, and other node resources.  By yourself, you can easily avoid
+reaching these limits, but the receivers of the payments you send can
+respend their outputs in child transactions that become part of the
+transaction group containing your transaction.
+
+The closer to a limit a transaction group becomes, the less likely
+you'll be able to fee bump your transaction using either
+Child-Pays-for-Parent (CPFP) fee bumping or Replace-by-Fee (RBF) fee
+bumping.  In addition, the more unconfirmed children a transaction has,
+the more RBF fee bumping will cost as you'll have to pay for both the
+increased feerate of your transaction as well as for all the potential
+fees lost to miners when they remove any child transactions in order
+to accept your replacement.
+
+Note that these problems are not unique to batched payments---independent
+payments can have the same problem.  However, if an independent payment
+can't be fee bumped because the independent receiver spent their output,
+only that user is affected.  But if a single receiver of a batched
+payment spends their output to the point where fee bumping becomes
+impossible, all the other receivers of that transaction are also affected.
+
+As of Bitcoin Core 0.18 (April 2019), the limits are¹ that a
+group of related unconfirmed transactions may not exceed 101,000 vbytes
+in size, have more than 25 unconfirmed ancestors, or have more than 25
+descendants.  This size limit restricts batches to a maximum size of
+about 3,000 outputs and the descendant limit is easily reached if just a
+tiny percentage of those receiving a large batch respend their confirmed
+outputs.  It's also easy for any of the receivers to deliberately create
+transactions that reach one of these limits and prevent fee bumping if
+they know that you're relying on that capability.
+
+## Implementation
+
+Payment batching is extremely easy using certain existing wallet
+implementations, such as using Bitcoin Core's [sendmany][] RPC.  Check your software
+documentation for a function that allows you to send multiple payments.
+
+```bash
+bitcoin-cli sendmany "" '{
+  "bc1q5c2d2ue7x38hcw2ugk5q7y4ae7nw4r6vxcptu8": 0.1,
+  "bc1qztjzd7hpf2xmngr7zkgkxsvdqcv2jpyfgwgtsv": 0.2,
+  "bc1qsul9emtnz0kks939egx2ssa6xnjpsvgwq9chrw": 0.3
+}'
+```
+
+<!-- for max standard tx size: src/policy/policy.h:static const unsigned int MAX_STANDARD_TX_WEIGHT = 400000; -->
+
+If using your own implementation, you are probably already creating
+transactions with two outputs in most cases (a payment output and a
+change output), so it should be easy to add
+support for additional outputs.  The only notable consideration is that
+Bitcoin Core nodes (and most other nodes) will refuse to accept or relay
+transactions over 100,000 vbytes, so you should not attempt to send
+batched payments larger than this.
+
+## Recommendations summary
+
+1. Try to create systems where your users and customers don't expect
+   their payments immediately but are willing to wait for some time
+   (the longer the better).
+
+2. Use low-feerate consolidations to keep some large inputs available
+   for spending.
+
+3. Within each time window, send all payments together in the same
+   transaction.  For example, create an hourly [cronjob][] that sends all pending payments.
+   Ideally, your prior consolidations should allow the
+   transaction to contain only a single input.
+
+4. Don't depend on being able to fee bump the batched payments.  This
+   means using a high-enough feerate on the initial transaction to
+   ensure it has a high probability of confirming within your desired
+   time window. For example, use the `CONSERVATIVE` mode of Bitcoin
+   Core's `estimatesmartfee` RPC.
+
+## Footnotes
+
+¹Optech believes that almost all nodes are using the default Bitcoin
+Core policy for transaction group limits.  However, those defaults
+may change over time, so the example below provides a command that
+can be used to find the current limits along with the current
+values.
+
+```bash
+$ bitcoind -help-debug | grep -A3 -- -limit
+  -limitancestorcount=<n>
+       Do not accept transactions if number of in-mempool ancestors is <n> or
+       more (default: 25)
+
+  -limitancestorsize=<n>
+       Do not accept transactions whose size with all in-mempool ancestors
+       exceeds <n> kilobytes (default: 101)
+
+  -limitdescendantcount=<n>
+       Do not accept transactions if any ancestor would have <n> or more
+       in-mempool descendants (default: 25)
+
+  -limitdescendantsize=<n>
+       Do not accept transactions if any ancestor would have more than <n>
+       kilobytes of in-mempool descendants (default: 101).
+```
+
+
+[chapter consolidation]: #FIXME_not_written_yet
+[coinjoin sudoku]: http://www.coinjoinsudoku.com/
+[coin selection strategies]: #FIXME_not_written_yet
+[fee bumping]: ../1.fee_bumping/fee_bumping.md
+[cronjob]: https://en.wikipedia.org/wiki/Cronjob
+[sendmany]: https://bitcoincore.org/en/doc/0.17.0/rpc/wallet/sendmany/