go-ethereum

mirror of https://github.com/ethereum/go-ethereum.git synced 2026-05-09 17:46:37 +00:00

Author	SHA1	Message	Date
Guillaume Ballet	a15778c52f	trie: group 2^N binary trie nodes in serialization (#34794 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details This PR addresses one of the biggest performance issue with binary tries: storing each internal node individually bloats the index, the disk, and triggers a lot of write amplifications. To fix this issue, this PR serializes groups of nodes together. Because we are still looking for the ideal group size, the "depth" of the group tree is made a parameter, but that will be removed in the future, once the perfect size is known. This is a rebase of #33658 --------- Co-authored-by: Copilot <copilot@github.com>	2026-05-01 15:28:19 +02:00
Guillaume Ballet	c374e74ee1	trie/bintrie: print todot path in binary (#34777 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details The nodes were named using the byte representation of the path, instead of the binary representation. This was confusing to other client devs trying to achieve interop.	2026-04-21 14:50:09 +02:00
CPerezz	b6d415c88d	trie/bintrie: replace BinaryNode interface with GC-free NodeRef arena (#34055 ) ## Summary Replace the `BinaryNode` interface with `NodeRef uint32` indices into typed arena pools, eliminating GC-scanned pointers from binary trie nodes. Inspired by [fjl's observation](https://github.com/ethereum/go-ethereum/pull/34034#issuecomment-4075176446): > "if the binary trie produces such a large graph, it should probably be changed so that the trie node type does not contain pointers. The runtime does not scan objects that do not contain pointers, so it can really help with the performance to build it this way." ### The problem CPU profiling of the binary trie (EIP-7864) showed 44% of CPU time in garbage collection. Each `InternalNode` held two `BinaryNode` interface values (2 pointer-words each), and the GC scanned every one. With ~25K `InternalNode`s in memory during block processing, this created enormous GC pressure. ### The solution `NodeRef` is a compact `uint32` (2-bit kind tag + 30-bit pool index). `NodeStore` manages chunked typed pools per node kind: - InternalNode pool: ZERO Go pointers (children are `NodeRef`, hash is `[32]byte`) → noscan spans - HashedNode pool: ZERO Go pointers → noscan spans - StemNode pool: retains `Values [][]byte` (matching existing format) The serialization format is unchanged — flat InternalNode `[type][leftHash][rightHash]` = 65 bytes. ## Benchmark: Apple M4 Pro (`--benchtime=10s --count=3`, on top of #34021) \| Metric \| Baseline \| Arena \| Delta \| \|--------\|----------\|-------\|-------\| \| Approve (Mgas/s) \| 374 \| 382 \| +2.1% \| \| BalanceOf (Mgas/s) \| 885 \| 901 \| +1.8% \| \| Approve allocs/op \| 775K \| 607K \| -21.7% \| \| BalanceOf allocs/op \| 265K \| 228K \| -14.0% \| ## Benchmark: AMD EPYC 48-core (50GB state, execution-specs ERC-20, on top of #34021 + #34032) \| Benchmark \| Baseline \| Arena \| Delta \| \|-----------\|----------\|-------\|-------\| \| erc20_approve (write) \| 22.4 Mgas/s \| 27.0 Mgas/s \| +20.5% \| \| mixed_sload_sstore \| 62.9 Mgas/s \| 97.3 Mgas/s \| +54.7% \| \| erc20_balanceof (read) \| 180.8 Mgas/s \| 167.6 Mgas/s \| -7.3% (cold cache variance) \| The arena benefit scales with heap size — the EPYC (larger heap, more GC pressure) shows much larger gains than the M4 Pro (efficient unified memory). The mixed workload baseline was unstable (62.9 vs 16.3 Mgas/s between runs due to GC-induced throughput collapse); the arena eliminates this entirely (95-97 Mgas/s, stable). ## Dependencies Benchmarked with #34021 (H01 N+1 fix) + #34032 (R14 parallel hashing). No code dependency — applies independently to master. All test suites pass (`trie/bintrie` with `-race`, `core/state`, `triedb/pathdb`, `cmd/geth`). --------- Co-authored-by: Guillaume Ballet <3272758+gballet@users.noreply.github.com>	2026-04-20 14:08:30 +02:00
CPerezz	61bfacc52f	trie/bintrie: skip clean nodes in CollectNodes to reduce commit write amplification (#34754 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details ## Problem `BinaryTrie.Commit` unconditionally walked every resolved in-memory node and flushed it into the `NodeSet`, producing one Pebble write per resolved internal + stem node on every block — even when the node's on-disk blob was bitwise identical to the previous commit. On a warm 400M-state workload this meant tens of thousands of redundant 65-byte writes per block, compounding Pebble compaction pressure on every commit. The existing `mustRecompute` flag tracks hash staleness, not disk-blob staleness: after `Hash()` completes, `mustRecompute` is cleared even though the fresh blob has not been persisted. It is therefore insufficient for a skip-flush optimization. ## Fix Mirror the MPT committer pattern (`trie/committer.go:51-56`) by adding a `dirty` flag on `InternalNode` and `StemNode` with the semantics the on-disk blob is stale. The flag is: - set to `true` wherever the node is created or structurally modified (the same call sites that already set `mustRecompute = true`); - set to `false` only after the node has been passed to the `flushfn` inside `CollectNodes`; - left `false` on nodes produced by `DeserializeNodeWithHash`, matching the loaded from disk, already persisted semantics. `CollectNodes` short-circuits on `!dirty` subtrees. The propagation invariant (an ancestor of any dirty node is itself dirty) is already maintained by the existing `InsertValuesAtStem` / `Insert` paths, which now mirror every `mustRecompute = true` setter with a `dirty = true` setter. ## Benchmark New `BenchmarkCollectNodes_SparseWrite` measures commit cost when only one leaf changes between blocks — the common case for state updates. 10,000-stem trie, one-leaf modification + Commit per iteration, Apple M4 Pro: \| \| before \| after \| delta \| \|---\|---\|---\|---\| \| time / op \| 12,653,000 ns \| 7,336 ns \| ~1,725× \| \| bytes / op \| 107,224,740 B \| 37,774 B \| ~2,839× \| \| allocs / op \| 80,953 \| 134 \| ~604× \| End-to-end impact on a real workload depends on the resolved-footprint-to-dirty-path ratio; the new `TestBinaryTrieCommitIncremental` provides a structural regression guard (asserts that a Commit following a single-leaf modification flushes a root-to-leaf path, not the whole tree). --- Found all of this stuff while bloating my #34706 DB to make some benchmarks. And saw we were spending A LOT OF TIME on hashing. Hope this helps the perf a bit. Will rebase the flat-state PR on top of this once merged.	2026-04-18 11:42:58 +02:00
Guillaume Ballet	ba215fd927	cmd, core, trie, triedb: split CachingDB into merkle + binary dbs. (#34700 ) This Pr implements some prerequisite changes for #34004 : split the `CachingDB` into a `MerkleDB` and a `UBTDB`, so that very different behaviors don't clash as much. The transition isn't handled by this PR, but after talking to Gary we agreed that `UBTDB` should receive another `triedb`, which will only be loaded if the `Ended` flag is set to false in the conversion contract. If this is too hard to achieve, it makes sense to load it regardless, and then loading can be prevented at a later stage by adding a `UBTTransitionFinalizationTime` in `ChainConfig`. --------- Co-authored-by: Gary Rong <garyrong0905@gmail.com>	2026-04-17 08:55:54 +08:00
Guillaume Ballet	735bfd121a	trie/bintrie: spec change, big endian hashing of slot key (#34670 ) The spec has been changed during SIC #49, the offset is encoded as a big-endian number.	2026-04-13 09:42:37 +02:00
CPerezz	deda47f6a1	trie/bintrie: fix GetAccount/GetStorage non-membership — verify stem before returning values (#34690 ) Some checks failed / Linux Build (push) Has been cancelled Details / Linux Build (arm) (push) Has been cancelled Details / Keeper Build (push) Has been cancelled Details / Windows Build (push) Has been cancelled Details / Docker Image (push) Has been cancelled Details Fix `GetAccount` returning wrong account data for non-existent addresses when the trie root is a `StemNode` (single-account trie) — the `StemNode` branch returned `r.Values` without verifying the queried address's stem matches. Co-authored-by: Guillaume Ballet <3272758+gballet@users.noreply.github.com>	2026-04-10 19:43:48 +02:00
CPerezz	f71a884e37	trie/bintrie: fix DeleteAccount no-op (#34676 ) `BinaryTrie.DeleteAccount` was a no-op, silently ignoring the caller's deletion request and leaving the old `BasicData` and `CodeHash` in the trie. Co-authored-by: Guillaume Ballet <3272758+gballet@users.noreply.github.com>	2026-04-10 19:23:44 +02:00
rjl493456442	c3467dd8b5	core, miner, trie: relocate witness stats (#34106 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details This PR relocates the witness statistics into the witness itself, making it more self-contained.	2026-03-27 17:06:46 +01:00
Guillaume Ballet	305cd7b9eb	trie/bintrie: fix NodeIterator Empty node handling and expose tree accessors (#34056 ) Some checks failed / Linux Build (push) Has been cancelled Details / Linux Build (arm) (push) Has been cancelled Details / Keeper Build (push) Has been cancelled Details / Windows Build (push) Has been cancelled Details / Docker Image (push) Has been cancelled Details Fix three issues in the binary trie NodeIterator: 1. Empty nodes now properly backtrack to parent and continue iteration instead of terminating the entire walk early. 2. `HashedNode` resolver handles `nil` data (all-zeros hash) gracefully by treating it as Empty rather than panicking. 3. Parent update after node resolution guards against stack underflow when resolving the root node itself. --------- Co-authored-by: tellabg <249254436+tellabg@users.noreply.github.com>	2026-03-20 13:53:14 -04:00
CPerezz	6138a11c39	trie/bintrie: parallelize InternalNode.Hash at shallow tree depths (#34032 ) ## Summary At tree depths below `log2(NumCPU)` (clamped to [2, 8]), hash the left subtree in a goroutine while hashing the right subtree inline. This exploits available CPU cores for the top levels of the tree where subtree hashing is most expensive. On single-core machines, the parallel path is disabled entirely. Deeper nodes use sequential hashing with the existing `sync.Pool` hasher where goroutine overhead would exceed the hash computation cost. The parallel path uses `sha256.Sum256` with a stack-allocated buffer to avoid pool contention across goroutines. Safety: - Left/right subtrees are disjoint — no shared mutable state - `sync.WaitGroup` provides happens-before guarantee for the result - `defer wg.Done()` + `recover()` prevents goroutine panics from crashing the process - `!bt.mustRecompute` early return means clean nodes never enter the parallel path - Hash results are deterministic regardless of computation order — no consensus risk ## Benchmark (AMD EPYC 48-core, 500K entries, `--benchtime=10s --count=3`, post-H01 baseline) \| Metric \| Baseline \| Parallel \| Delta \| \|--------\|----------\|----------\|-------\| \| Approve (Mgas/s) \| 224.5 ± 7.1 \| 259.6 ± 2.4 \| +15.6% \| \| BalanceOf (Mgas/s) \| 982.9 ± 5.1 \| 954.3 ± 10.8 \| -2.9% (noise, clean nodes skip parallel path) \| \| Allocs/op (approve) \| ~810K \| ~700K \| -13.6% \|	2026-03-18 13:54:23 +01:00
Guillaume Ballet	1c9ddee16f	trie/bintrie: use a sync.Pool when hashing binary tree nodes (#33989 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details Binary tree hashing is quite slow, owing to many factors. One of them is the GC pressure that is the consequence of allocating many hashers, as a binary tree has 4x the size of an MPT. This PR introduces an optimization that already exists for the MPT: keep a pool of hashers, in order to reduce the amount of allocations.	2026-03-12 10:20:12 +01:00
Guillaume Ballet	3f1871524f	trie/bintrie: cache hashes of clean nodes so as not to rehash the whole tree (#33961 ) This is an optimization that existed for verkle and the MPT, but that got dropped during the rebase. Mark the nodes that were modified as needing recomputation, and skip the hash computation if this is not needed. Otherwise, the whole tree is hashed, which kills performance.	2026-03-06 18:06:24 +01:00
Guillaume Ballet	a0fb8102fe	trie/bintrie: fix overflow management in slot key computation (#33951 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details / Windows Build (push) Waiting to run Details The computation of `MAIN_STORAGE_OFFSET` was incorrect, causing the last byte of the stem to be dropped. This means that there would be a collision in the hash computation (at the preimage level, not a hash collision of course) if two keys were only differing at byte 31.	2026-03-05 14:43:31 +01:00
rjl493456442	dd202d4283	core, ethdb, triedb: add batch close (#33708 ) Pebble maintains a batch pool to recycle the batch object. Unfortunately batch object must be explicitly returned via `batch.Close` function. This PR extends the batch interface by adding the close function and also invoke batch.Close in some critical code paths. Memory allocation must be measured before merging this change. What's more, it's an open question that whether we should apply batch.Close as much as possible in every invocation.	2026-03-04 11:17:47 +01:00
Guillaume Ballet	95c6b05806	trie/bintrie: fix endianness in code chunk key computation (#33900 ) The endianness was wrong, which means that the code chunks were stored in the wrong location in the tree.	2026-02-27 11:35:13 +01:00
rjl493456442	be92f5487e	trie: error out for unexpected key-value pairs preceding the range (#33898 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details	2026-02-26 23:00:02 +08:00
Fynn	8450e40798	cmd/geth: add inspect trie tool to analysis trie storage (#28892 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details This pr adds a tool names `inpsect-trie`, aimed to analyze the mpt and its node storage more efficiently. ## Example ./geth db inspect-trie --datadir server/data-seed/ latest 4000 ## Result - MPT shape - Account Trie - Top N Storage Trie ``` +-------+-------+--------------+-------------+--------------+ \| - \| LEVEL \| SHORTNODECNT \| FULLNODECNT \| VALUENODECNT \| +-------+-------+--------------+-------------+--------------+ \| - \| 0 \| 0 \| 1 \| 0 \| \| - \| 1 \| 0 \| 16 \| 0 \| \| - \| 2 \| 76 \| 32 \| 74 \| \| - \| 3 \| 66 \| 1 \| 66 \| \| - \| 4 \| 2 \| 0 \| 2 \| \| Total \| 144 \| 50 \| 142 \| +-------+-------+--------------+-------------+--------------+ AccountTrie +-------+-------+--------------+-------------+--------------+ \| - \| LEVEL \| SHORTNODECNT \| FULLNODECNT \| VALUENODECNT \| +-------+-------+--------------+-------------+--------------+ \| - \| 0 \| 0 \| 1 \| 0 \| \| - \| 1 \| 0 \| 16 \| 0 \| \| - \| 2 \| 108 \| 84 \| 104 \| \| - \| 3 \| 195 \| 5 \| 195 \| \| - \| 4 \| 10 \| 0 \| 10 \| \| Total \| 313 \| 106 \| 309 \| +-------+-------+--------------+-------------+--------------+ ContractTrie-0xc874e65ccffb133d9db4ff637e62532ef6ecef3223845d02f522c55786782911 +-------+-------+--------------+-------------+--------------+ \| - \| LEVEL \| SHORTNODECNT \| FULLNODECNT \| VALUENODECNT \| +-------+-------+--------------+-------------+--------------+ \| - \| 0 \| 0 \| 1 \| 0 \| \| - \| 1 \| 0 \| 16 \| 0 \| \| - \| 2 \| 57 \| 14 \| 56 \| \| - \| 3 \| 33 \| 0 \| 33 \| \| Total \| 90 \| 31 \| 89 \| +-------+-------+--------------+-------------+--------------+ ContractTrie-0x1d7dcb6a0ce5227c5379fc5b0e004561d7833b063355f69bfea3178f08fbaab4 +-------+-------+--------------+-------------+--------------+ \| - \| LEVEL \| SHORTNODECNT \| FULLNODECNT \| VALUENODECNT \| +-------+-------+--------------+-------------+--------------+ \| - \| 0 \| 0 \| 1 \| 0 \| \| - \| 1 \| 5 \| 8 \| 5 \| \| - \| 2 \| 16 \| 1 \| 16 \| \| - \| 3 \| 2 \| 0 \| 2 \| \| Total \| 23 \| 10 \| 23 \| +-------+-------+--------------+-------------+--------------+ ContractTrie-0xaa8a4783ebbb3bec45d3e804b3c59bfd486edfa39cbeda1d42bf86c08a0ebc0f +-------+-------+--------------+-------------+--------------+ \| - \| LEVEL \| SHORTNODECNT \| FULLNODECNT \| VALUENODECNT \| +-------+-------+--------------+-------------+--------------+ \| - \| 0 \| 0 \| 1 \| 0 \| \| - \| 1 \| 9 \| 3 \| 9 \| \| - \| 2 \| 7 \| 1 \| 7 \| \| - \| 3 \| 2 \| 0 \| 2 \| \| Total \| 18 \| 5 \| 18 \| +-------+-------+--------------+-------------+--------------+ ContractTrie-0x9d2804d0562391d7cfcfaf0013f0352e176a94403a58577ebf82168a21514441 +-------+-------+--------------+-------------+--------------+ \| - \| LEVEL \| SHORTNODECNT \| FULLNODECNT \| VALUENODECNT \| +-------+-------+--------------+-------------+--------------+ \| - \| 0 \| 0 \| 1 \| 0 \| \| - \| 1 \| 6 \| 4 \| 6 \| \| - \| 2 \| 8 \| 0 \| 8 \| \| Total \| 14 \| 5 \| 14 \| +-------+-------+--------------+-------------+--------------+ ContractTrie-0x17e3eb95d0e6e92b42c0b3e95c6e75080c9fcd83e706344712e9587375de96e1 +-------+-------+--------------+-------------+--------------+ \| - \| LEVEL \| SHORTNODECNT \| FULLNODECNT \| VALUENODECNT \| +-------+-------+--------------+-------------+--------------+ \| - \| 0 \| 0 \| 1 \| 0 \| \| - \| 1 \| 5 \| 3 \| 5 \| \| - \| 2 \| 7 \| 0 \| 7 \| \| Total \| 12 \| 4 \| 12 \| +-------+-------+--------------+-------------+--------------+ ContractTrie-0xc017ca90c8aa37693c38f80436bb15bde46d7b30a503aa808cb7814127468a44 Contract Trie, total trie num: 142, ShortNodeCnt: 620, FullNodeCnt: 204, ValueNodeCnt: 615 ``` --------- Co-authored-by: lightclient <lightclient@protonmail.com> Co-authored-by: MariusVanDerWijden <m.vanderwijden@live.de>	2026-02-24 10:56:00 -07:00
Felix Lange	ac85a6f254	rlp: add back Iterator.Count, with fixes (#33841 ) I removed `Iterator.Count` in #33840, because it appeared to be unused and did not provide the documented invariant: the returned count should always be an upper bound on the number of iterations allowed by `Next`. In order to make `Count` work, the semantics of `CountValues` has to change to return the number of items up and including the invalid one. I have reviewed all callsites of `CountValues` to assess if changing this is safe. There aren't that many, and the only call that doesn't check the error and return is in the trie node parser, `trie.decodeNodeUnsafe`. There, we distinguish the node type based on the number of items, and it previously returned an error for item count zero. In order to avoid any potential issue that could result from this change, I'm adding an error check in that function, though it isn't necessary.	2026-02-13 23:53:42 +01:00
phrwlk	30656d714e	trie/bintrie: use correct key mapping in GetStorage and DeleteStorage (#33807 ) GetStorage and DeleteStorage used GetBinaryTreeKey to compute the tree key, while UpdateStorage used GetBinaryTreeKeyStorageSlot. The latter applies storage slot remapping (header offset for slots <64, main storage prefix for the rest), so reads and deletes were targeting different tree locations than writes. Replace GetBinaryTreeKey with GetBinaryTreeKeyStorageSlot in both GetStorage and DeleteStorage to match UpdateStorage. Add a regression test that verifies the write→read→delete→read round-trip for main storage slots.	2026-02-11 11:42:17 +01:00
sashass1315	4d4883731e	trie: fix embedded node size validation (#33803 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details The `decodeRef` function used `size > hashLen` to reject oversized embedded nodes, but this incorrectly allowed nodes of exactly 32 bytes through. The encoding side (hasher.go, stacktrie.go) consistently uses `len(enc) < 32` to decide whether to embed a node inline, meaning nodes of 32+ bytes are always hash-referenced. The error message itself already stated `want size < 32`, confirming the intended threshold. Changed `size > hashLen` to `size >= hashLen` in `decodeRef` to align the decoding validation with the encoding logic, the Yellow Paper spec, and the surrounding comments.	2026-02-10 22:05:39 +08:00
Felix Lange	8e1de223ad	crypto/keccak: vendor in golang.org/x/crypto/sha3 (#33323 ) The upstream libray has removed the assembly-based implementation of keccak. We need to maintain our own library to avoid a peformance regression. --------- Co-authored-by: lightclient <lightclient@protonmail.com>	2026-02-03 14:55:27 -07:00
Guillaume Ballet	19f37003fb	trie/bintrie: fix debug_executionWitness for binary tree (#33739 ) The `Witness` method was not implemented for the binary tree, which caused `debug_excutionWitness` to panic. This PR fixes that. Note that the `TransitionTrie` version isn't implemented, and that's on purpose: more thought must be given to what should go in the global witness.	2026-02-03 12:19:40 +01:00
rjl493456442	7046e63244	trie: fix flaky test (#33711 )	2026-01-29 17:22:15 +08:00
Ng Wei Han	3d05284928	trie/bintrie: fix tree key hashing to match spec (#33694 ) Based on [EIP-7864](https://eips.ethereum.org/EIPS/eip-7864), the tree index should be 32 bytes instead of 31 bytes. ``` def get_tree_key(address: Address32, tree_index: int, sub_index: int): # Assumes STEM_SUBTREE_WIDTH = 256 return tree_hash(address + tree_index.to_bytes(32, "little"))[:31] + bytes( [sub_index] ) ```	2026-01-28 11:51:02 +01:00
marukai67	e250836973	trie: preallocate slice capacity (#33689 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details This PR optimizes memory allocation in StateTrie.PrefetchAccount() and StateTrie.PrefetchStorage() by preallocating slice capacity when the final size is known.	2026-01-27 12:04:12 +08:00
rjl493456442	f51870e40e	rlp, trie, triedb/pathdb: compress trienode history (#32913 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details This pull request introduces a mechanism to compress trienode history by storing only the node diffs between consecutive versions. - For full nodes, only the modified children are recorded in the history; - For short nodes, only the modified value is stored; If the node type has changed, or if the node is newly created or deleted, the entire node value is stored instead. To mitigate the overhead of reassembling nodes from diffs during history reads, checkpoints are introduced by periodically storing full node values. The current checkpoint interval is set to every 16 mutations, though this parameter may be made configurable in the future.	2026-01-08 21:58:02 +08:00
Guillaume Ballet	3f641dba87	trie, go.mod: remove all references to go-verkle and go-ipa (#33461 ) In order to reduce the amount of code that is embedded into the keeper binary, I am removing all the verkle code that uses go-verkle and go-ipa. This will be followed by further PRs that are more like stubs to replace code when the keeper build is detected. I'm keeping the binary tree of course. This means that you will still see `isVerkle` variables all over the codebase, but they will be renamed when code is touched (i.e. this is not an invitation for 30+ AI slop PRs). --------- Co-authored-by: Gary Rong <garyrong0905@gmail.com>	2025-12-30 20:44:04 +08:00
Guillaume Ballet	2a2f106a01	cmd/evm/internal/t8ntool, trie: support for verkle-at-genesis, use UBT, and move the transition tree to its own package (#32445 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details This is broken off of #31730 to only focus on testing networks that start with verkle at genesis. The PR has seen a lot of work since its creation, and it now targets creating and re-executing tests for a binary tree testnet without the transition (so it starts at genesis). The transition tree has been moved to its own package. It also replaces verkle with the binary tree for this specific application. --------- Co-authored-by: Gary Rong <garyrong0905@gmail.com>	2025-11-14 15:25:30 +01:00
Youssef Azzaoui	b373d797d8	core/state: state copy bugfixes with Verkle Trees (#31696 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details This change addresses critical issues in the state object duplication process specific to Verkle trie implementations. Without these modifications, updates to state objects fail to propagate correctly through the trie structure after a statedb copy operation, leading to inaccuracies in the computation of the state root hash. --------- Co-authored-by: Guillaume Ballet <3272758+gballet@users.noreply.github.com>	2025-10-16 19:19:44 +02:00
hero5512	1e4b39ed12	trie: cleaner array concatenation (#32756 ) Some checks failed / Linux Build (push) Has been cancelled Details / Linux Build (arm) (push) Has been cancelled Details / Windows Build (push) Has been cancelled Details / Docker Image (push) Has been cancelled Details It uses the slices.Concat and slices.Clone methods available now in Go.	2025-10-02 17:32:20 +02:00
Martin HS	057667151b	core/types, trie: reduce allocations in derivesha (#30747 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details Alternative to #30746, potential follow-up to #30743 . This PR makes the stacktrie always copy incoming value buffers, and reuse them internally. Improvement in #30743: ``` goos: linux goarch: amd64 pkg: github.com/ethereum/go-ethereum/core/types cpu: 12th Gen Intel(R) Core(TM) i7-1270P │ derivesha.1 │ derivesha.2 │ │ sec/op │ sec/op vs base │ DeriveSha200/stack_trie-8 477.8µ ± 2% 430.0µ ± 12% -10.00% (p=0.000 n=10) │ derivesha.1 │ derivesha.2 │ │ B/op │ B/op vs base │ DeriveSha200/stack_trie-8 45.17Ki ± 0% 25.65Ki ± 0% -43.21% (p=0.000 n=10) │ derivesha.1 │ derivesha.2 │ │ allocs/op │ allocs/op vs base │ DeriveSha200/stack_trie-8 1259.0 ± 0% 232.0 ± 0% -81.57% (p=0.000 n=10) ``` This PR further enhances that: ``` goos: linux goarch: amd64 pkg: github.com/ethereum/go-ethereum/core/types cpu: 12th Gen Intel(R) Core(TM) i7-1270P │ derivesha.2 │ derivesha.3 │ │ sec/op │ sec/op vs base │ DeriveSha200/stack_trie-8 430.0µ ± 12% 423.6µ ± 13% ~ (p=0.739 n=10) │ derivesha.2 │ derivesha.3 │ │ B/op │ B/op vs base │ DeriveSha200/stack_trie-8 25.654Ki ± 0% 4.960Ki ± 0% -80.67% (p=0.000 n=10) │ derivesha.2 │ derivesha.3 │ │ allocs/op │ allocs/op vs base │ DeriveSha200/stack_trie-8 232.00 ± 0% 37.00 ± 0% -84.05% (p=0.000 n=10) ``` So the total derivesha-improvement over both PRS is: ``` goos: linux goarch: amd64 pkg: github.com/ethereum/go-ethereum/core/types cpu: 12th Gen Intel(R) Core(TM) i7-1270P │ derivesha.1 │ derivesha.3 │ │ sec/op │ sec/op vs base │ DeriveSha200/stack_trie-8 477.8µ ± 2% 423.6µ ± 13% -11.33% (p=0.015 n=10) │ derivesha.1 │ derivesha.3 │ │ B/op │ B/op vs base │ DeriveSha200/stack_trie-8 45.171Ki ± 0% 4.960Ki ± 0% -89.02% (p=0.000 n=10) │ derivesha.1 │ derivesha.3 │ │ allocs/op │ allocs/op vs base │ DeriveSha200/stack_trie-8 1259.00 ± 0% 37.00 ± 0% -97.06% (p=0.000 n=10) ``` Since this PR always copies the incoming value, it adds a little bit of a penalty on the previous insert-benchmark, which copied nothing (always passed the same empty slice as input) : ``` goos: linux goarch: amd64 pkg: github.com/ethereum/go-ethereum/trie cpu: 12th Gen Intel(R) Core(TM) i7-1270P │ stacktrie.7 │ stacktrie.10 │ │ sec/op │ sec/op vs base │ Insert100K-8 88.21m ± 34% 92.37m ± 31% ~ (p=0.280 n=10) │ stacktrie.7 │ stacktrie.10 │ │ B/op │ B/op vs base │ Insert100K-8 3.424Ki ± 3% 4.581Ki ± 3% +33.80% (p=0.000 n=10) │ stacktrie.7 │ stacktrie.10 │ │ allocs/op │ allocs/op vs base │ Insert100K-8 22.00 ± 5% 26.00 ± 4% +18.18% (p=0.000 n=10) ``` --------- Co-authored-by: Gary Rong <garyrong0905@gmail.com> Co-authored-by: Felix Lange <fjl@twurst.com>	2025-10-01 10:05:49 +02:00
VolodymyrBg	c5a1c35cfb	trie: fix error message in test (#32772 ) Fixes an error message in TestReplication	2025-09-29 12:23:43 +02:00
MozirDmitriy	8e87b7539b	trie: correct error messages for UpdateStorage operations (#32746 ) Fix incorrect error messages in TestVerkleTreeReadWrite and TestVerkleRollBack functions.	2025-09-26 07:47:58 -06:00
radik878	7ed17f1933	trie: fix TestOneElementProof expected value message (#32738 ) - Correct the error message in TestOneElementProof to expect 'v' instead of 'k'. - The trie is updated with key "k" and value "v"; on mismatch the expected value must be 'v'. - Aligns the message with the actual test logic and other similar checks in this file, reducing confusion during test failures. No behavioral changes.	2025-09-24 18:57:01 -06:00
VolodymyrBg	48c74f4593	trie: align AllFFPrefix test assertion and message (#32719 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details	2025-09-24 10:36:56 +08:00
Samuel Arogbonlo	fda09c7b1b	trie: add sub-trie iterator support (#32520 ) - Adds `NodeIteratorWithPrefix()` method to support iterating only nodes within a specific key prefix - Adds `NodeIteratorWithRange()` method to support iterating only nodes within a specific key range Current `NodeIterator` always traverses the entire remaining trie from a start position. For non-ethereum applications using the trie implementation, there's no way to limit iteration to just a subtree with a specific prefix. Usage: ```go // Only iterate nodes with prefix "key1" iter, err := trie.NodeIteratorWithPrefix([]byte("key1")) ``` Testing: Comprehensive test suite covering edge cases and boundary conditions. Closes #32484 --------- Co-authored-by: gballet <guillaume.ballet@gmail.com> Co-authored-by: Gary Rong <garyrong0905@gmail.com>	2025-09-17 22:07:02 +08:00
rjl493456442	902ec5baae	cmd, core, eth, triedb/pathdb: track node origins in the path database (#32418 ) This PR is the first step in the trienode history series. It introduces the `nodeWithOrigin` struct in the path database, which tracks the original values of dirty nodes to support trienode history construction. Note, the original value is always empty in this PR, so it won't break the existing journal for encoding and decoding. The compatibility of journal should be handled in the following PR.	2025-09-05 10:37:05 +08:00
Guillaume Ballet	bd4b17907f	trie/bintrie: add eip7864 binary trees and run its tests (#32365 ) Implement the binary tree as specified in [eip-7864](https://eips.ethereum.org/EIPS/eip-7864). This will gradually replace verkle trees in the codebase. This is only running the tests and will not be executed in production, but will help me rebase some of my work, so that it doesn't bitrot as much. --------- Signed-off-by: Guillaume Ballet Co-authored-by: Parithosh Jayanthi <parithosh.jayanthi@ethereum.org> Co-authored-by: rjl493456442 <garyrong0905@gmail.com>	2025-09-01 21:06:51 +08:00
pxwanglu	d0602ba45a	core,trie: fix typo in TransitionTrie (#32491 ) Change `NewTransitionTree` to the correct `NewTransitionTrie`. Signed-off-by: pxwanglu <pxwanglu@icloud.com>	2025-08-25 09:29:58 +02:00
shazam8253	e9656238a7	core, miner, trie: add metrics tracking state trie depth (#32388 ) Co-authored-by: shantichanal <158101918+shantichanal@users.noreply.github.com> Co-authored-by: Gary Rong <garyrong0905@gmail.com> Co-authored-by: Guillaume Ballet <3272758+gballet@users.noreply.github.com>	2025-08-22 22:09:14 +08:00
rjl493456442	bf8f63dcd2	trie, core/state: introduce trie Prefetch for optimizing preload (#32134 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details This pull introduces a `Prefetch` operation in the trie to prefetch trie nodes in parallel. It is used by the `triePrefetcher` to accelerate state loading and improve overall chain processing performance.	2025-08-20 21:45:27 +08:00
Guillaume Ballet	ea3a71792d	trie, core/state: add the transition tree (verkle transition part 2) (#32366 ) This add some of the changes that were missing from #31634. It introduces the `TransitionTrie`, which is a façade pattern between the current MPT trie and the overlay tree. --------- Signed-off-by: Guillaume Ballet <3272758+gballet@users.noreply.github.com> Co-authored-by: rjl493456442 <garyrong0905@gmail.com>	2025-08-15 14:34:32 +08:00
cui	43b2aac33c	trie: refactor to use slices.Concat (#32401 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details	2025-08-12 21:47:18 +08:00
rjl493456442	cbbf686ecc	trie, core: rework tracer and track origin value of dirty nodes (#32306 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details These changes made in the PR should be highlighted here The trie tracer is split into two distinct structs: opTracer and prevalueTracer. The former is specific to MPT, while the latter is generic and applicable to all trie implementations. The original values of dirty nodes are tracked in a NodeSet. This serves as the foundation for both full archive node implementations and the state live tracer.	2025-08-11 21:55:38 +08:00
rjl493456442	23da91f73b	trie: reduce the memory allocation in trie hashing (#31902 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details This pull request optimizes trie hashing by reducing memory allocation overhead. Specifically: - define a fullNodeEncoder pool to reuse encoders and avoid memory allocations. - simplify the encoding logic for shortNode and fullNode by getting rid of the Go interfaces.	2025-08-01 10:23:23 +08:00
Ömer Faruk Irmak	61d7279e1f	trie: avoid spawning goroutines for empty children (#32220 )	2025-07-16 21:00:39 +08:00
Ha DANG	846d13a31a	ethdb: Implement DeleteRange in batch (#31947 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Docker Image (push) Waiting to run Details implement #31945 --------- Co-authored-by: prpeh <prpeh@proton.me> Co-authored-by: Gary Rong <garyrong0905@gmail.com>	2025-06-20 19:40:41 +08:00
maskpp	09289fd154	trie: delete secKeyCacheOwner (#31785 ) The optimization tried to defer allocating the cache map until it was used for the first time. It's a relic from earlier times, when tries were copied often. This seems unnecessary now, so we can just create the map when the trie is created. --------- Co-authored-by: Felix Lange <fjl@twurst.com>	2025-06-19 16:19:54 +02:00
Delweng	999f09f8af	trie: no need to store preimage if not enabled (#32012 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Docker Image (push) Waiting to run Details As the preimage will only be stored if `t.preimages != nil`, so no need to save them into local cache if not enabled. This will reduce the memory wasted to copy the bytes --------- Signed-off-by: jsvisa <delweng@gmail.com>	2025-06-13 15:04:24 +08:00

1 2 3 4 5 ...

454 commits