Hash Cache #8259

NicolasDorier · 2016-06-24T21:00:51Z

Some notes about the implementation:

It calculates the three midstate hashes as soon as a CheckSig segwit operation happens, whathever the SigHash,
It is possible that two different threads calculate the midstate hashes of a transaction twice,

Befinits are:

hashcashes map access is limited so we don't have too much lock contention
Fewer conditional branches in consensus code
Simple to review

This commit is only for having a cache that is simple to review and understand. It is probably possible to fix the two first points above, but the code overhead is not worth it when our goal is only to fix the O(n²) issue.

(rebased version of sipa#70)

jl2012 · 2016-06-25T20:01:32Z

This one is hopefully merged for versions with segwit defined on mainnet

dcousens · 2016-06-27T03:33:09Z

src/script/sigcache.h

+    bool TrySet(uint256 txId, const CachedHashes& hashes)
+    {
+        LOCK(cs);
+        if(map.count(txId))


You could avoid two look ups (IIRC) by comparing the size of the container before and after instead.

Aka:

auto sizeBefore = map.size(); map.insert(txId, hashes); return map.size() != sizeBefore;

Actually, I don't think I really need TrySet to return a bool.

dcousens · 2016-06-27T03:35:46Z

hashcashes

hash caches?, had me confused for a second haha

utACK c2ea4dd

dcousens · 2016-06-27T03:37:23Z

src/script/interpreter.h

+class CachedHashes
+{
+public:
+    uint256 hashPrevouts,hashSequence,hashOutputs;


trivial: maybe spacing between names?

NicolasDorier · 2016-06-28T09:25:03Z

@dcousens addressed your nits in dc188d8.

dcousens · 2016-06-29T02:18:00Z

utACK dc188d8

jtimon · 2016-07-28T20:31:47Z

src/script/interpreter.cpp

@@ -1110,35 +1110,46 @@ class CTransactionSignatureSerializer {

 } // anon namespace

-uint256 SignatureHash(const CScript& scriptCode, const CTransaction& txTo, unsigned int nIn, int nHashType, const CAmount& amount, SigVersion sigversion)
+uint256 SignatureHash(const CScript& scriptCode, const CTransaction& txTo, unsigned int nIn, int nHashType, const CAmount& amount, SigVersion sigversion, CachedHashes* cache)


Perhaps here we can do the same trick we did with the script/sigcache to keep script/interpreter simpler and more reusable.
ping @sipa

Or maybe move SignatureHash to BaseSignatureChecker. Just brainstorming.

sipa · 2016-07-28T22:41:50Z

Assume a transaction has many signatures. One is SIGHASH_SINGLE, all the others are SIGHASH_ALL | SIGHASH_ANYONECANPAY.

The first computes and stores hashPrevouts. All the others will compute hashOutputs. However, after the first call, all TrySets will not do anything, as there is already a result in the cache, so it gets computed over and over again.

I think CachedHashes needs a Merge method like this:

void Merge(const CachedHashes& hashes) {
    if (hashPrevouts.IsNull()) hashPrevouts = hashes.hashPrevout;
    if (hashSequence.IsNull()) hashSequence = hashes.hashSequence;
    if (hashOutputs.IsNull()) hashOutputs = hashes.hashOutputs;
}

which can then be called from TrySet (instead of insert, use map[txid].Merge(hashes)).

NicolasDorier · 2016-07-28T23:08:55Z

so it gets computed over and over again

That's not true, I would be calculating the three hashes at the same time during the SIGHASH_SINGLE.
Take a look at the SignatureHash method, I changed it to calculate the three hashes aggressively.

I prefer not doing a merge. Without a merge my lock only have to protect the internal map as CachedHashes instances are read only. If we calculate the mid states lazily, I also need to be careful about locking at the CachedHashes instance level.

sipa · 2016-07-28T23:11:35Z

@NicolasDorier Oh, I see. I agree that the current code is fine in that case.

I don't understand the argument about the lock. The Merge function would also grab the lock, and be the only code that touches the map.

NicolasDorier · 2016-07-28T23:18:47Z

@sipa The Merge function as you did here can't grab the lock, because the lock is at the cache map level, not at the CachedHashes instance level.

But basically, if doing that way, I would need to grab the lock of the map around the Merge, as well as around any hash read of the HashedCaches instance. (so one can't read and call merge on the HashedCaches at the same time)
This would make lots of contention on a single lock. An alternative is to have one lock per HashedCaches... not sure if it is worth the complexity though, knowing that the only type of transaction which does not need the 3 midstate hashes are transaction which have no SIGHASH_ALL... this is very marginal case.

NicolasDorier · 2016-07-29T01:58:22Z

Closing this one in favor of #8422 which calculate hashes lazily.

NicolasDorier mentioned this pull request Jun 24, 2016

Cached Hashes sipa/bitcoin#101

Closed

Cache hashes

c2ea4dd

NicolasDorier force-pushed the cachedhashes branch from 95f8516 to c2ea4dd Compare June 24, 2016 22:06

NicolasDorier changed the title ~~Cache hashes~~ Hash Cache Jun 25, 2016

dcousens reviewed Jun 27, 2016
View reviewed changes

laanwj added the Validation label Jun 27, 2016

nits and micro optimisation for the cache hash map

dc188d8

laanwj added the Needs backport label Jul 28, 2016

jtimon reviewed Jul 28, 2016
View reviewed changes

NicolasDorier mentioned this pull request Jul 28, 2016

Mempool: Use Consensus::CheckTxInputs direclty over main::CheckInputs #8346

Merged

NicolasDorier mentioned this pull request Jul 29, 2016

Cache hashes #8422

Closed

NicolasDorier closed this Jul 29, 2016

maflcko removed the Needs backport label Jul 31, 2016

bitcoin locked as resolved and limited conversation to collaborators Sep 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hash Cache #8259

Hash Cache #8259

NicolasDorier commented Jun 24, 2016

jl2012 commented Jun 25, 2016

dcousens Jun 27, 2016 •

edited

Loading

NicolasDorier Jun 27, 2016

dcousens commented Jun 27, 2016 •

edited

Loading

dcousens Jun 27, 2016

NicolasDorier commented Jun 28, 2016

dcousens commented Jun 29, 2016

jtimon Jul 28, 2016

jtimon Jul 28, 2016

sipa commented Jul 28, 2016 •

edited

Loading

NicolasDorier commented Jul 28, 2016

sipa commented Jul 28, 2016

NicolasDorier commented Jul 28, 2016 •

edited

Loading

NicolasDorier commented Jul 29, 2016

Hash Cache #8259

Hash Cache #8259

Conversation

NicolasDorier commented Jun 24, 2016

jl2012 commented Jun 25, 2016

dcousens Jun 27, 2016 • edited Loading

Choose a reason for hiding this comment

NicolasDorier Jun 27, 2016

Choose a reason for hiding this comment

dcousens commented Jun 27, 2016 • edited Loading

dcousens Jun 27, 2016

Choose a reason for hiding this comment

NicolasDorier commented Jun 28, 2016

dcousens commented Jun 29, 2016

jtimon Jul 28, 2016

Choose a reason for hiding this comment

jtimon Jul 28, 2016

Choose a reason for hiding this comment

sipa commented Jul 28, 2016 • edited Loading

NicolasDorier commented Jul 28, 2016

sipa commented Jul 28, 2016

NicolasDorier commented Jul 28, 2016 • edited Loading

NicolasDorier commented Jul 29, 2016

dcousens Jun 27, 2016 •

edited

Loading

dcousens commented Jun 27, 2016 •

edited

Loading

sipa commented Jul 28, 2016 •

edited

Loading

NicolasDorier commented Jul 28, 2016 •

edited

Loading