go-ethereum

mirror of https://github.com/ethereum/go-ethereum.git synced 2026-06-12 09:51:36 +00:00

Author	SHA1	Message	Date
CPerezz	21f243ff8a	triedb/pathdb,core/state: fix disklayer.storage fail-open gate and historicStateReader rlp.Split bug Addresses review finding C4 + Opus agent audit secondary bug. Bug 1 — fail-open gate in disklayer.storage: disklayer.storage() compared a 64-byte merkle-shaped combinedKey (accountHash \|\| storageHash) against the 32-byte bintrie generator marker via codec.MarkerCompare. For bintrie, accountHash is always common.Hash{} (since bintrieFlatCodec.StorageKey returns zero for the account key), so the combinedKey started with 32 zero bytes. The sha256-derived marker's first byte is essentially never 0x00, so bytes.Compare returned -1, the > 0 branch never fired, and the generator-progress gate was silently DISABLED. During active generation, disklayer.storage served whatever was on disk (nil or stale) without returning errNotCoveredYet. Fix: add StorageMarkerKey(accountHash, storageHash) to the flatStateCodec interface. Merkle returns the 64-byte concatenation (preserving existing behavior); bintrie returns storageHash[:] (the 32-byte stem\|\|offset key matching the generator marker shape). disklayer.storage now uses the codec method. Bug 2 — rlp.Split on raw bintrie storage leaves in historicStateReader: historicStateReader.Storage at core/state/database_history.go:87 calls rlp.Split(blob) on whatever bytes the pathdb historical reader returns. Merkle storage values are RLP-encoded (trimmed-left-zeros); bintrie leaves are raw 32 bytes. rlp.Split on raw 32-byte input either errors or decodes garbage. Even after fixing Bug 1, bintrie historical storage reads were broken end-to-end. Fix: add isVerkle bool to historicStateReader; when true, bypass rlp.Split and copy the raw 32-byte blob directly. The flag is set from db.triedb.IsVerkle() at construction time.	2026-04-15 15:00:41 +02:00
CPerezz	78f785e4ff	core/state: fix (nil,nil) shadowing trie reader fallback in bintrieFlatReader Addresses review finding C3. Before this commit, bintrieFlatReader.Account returned (nil, nil) when both the BasicData and CodeHash leaves were absent from the flat state. multiStateReader.Account treats (nil, nil) as "confirmed absent" and short-circuits — the trie reader never runs. This silently hid every corruption mode the other A-commits are fixing (C1 mid-stem resume loss, C2 disk-layer shape mismatch, in-transition stale data, etc.): the flat state said "not present" and nobody checked. Fix: introduce errBintrieFlatStateMiss as a local sentinel. When both leaves are absent, the flat reader returns (nil, errBintrieFlatStateMiss) instead of (nil, nil). The multiStateReader falls through on any non-nil error, so the trie reader now runs and serves as the authoritative gatekeeper. If the flat state genuinely has no data (and the trie reader also returns nil), the end result is the same — but any case where the flat state is wrong and the trie is right is now caught by the fallthrough. Same treatment for Storage: absent blob returns errBintrieFlatStateMiss. Known limitation: BinaryTrie.GetAccount does not verify stem membership (a characteristic of verkle-style tries where non-membership proofs are handled externally). A truly non-existent account returns the closest stem's data, not nil. The TestBintrieFlatReaderMissingAccountSentinel test therefore verifies the flat reader's sentinel in isolation rather than the end-to-end multiStateReader result.	2026-04-15 15:00:40 +02:00
CPerezz	fcc0587ec3	core/state,triedb/pathdb: fix bintrieFlatReader disk-layer shape via per-offset extraction Addresses review finding C2 (+ I5, S5, T2, T3, T12). Before this commit, bintrieFlatCodec.ReadAccount returned the FULL variable-length stem blob from disk while the in-memory diff-layer buffer stored per-offset 32-byte values. The consumer, bintrieFlatReader.Account, enforced len(basicBlob)!=32 → error, so every disk-layer hit produced "bintrie BasicData leaf invalid length" in production the moment the write buffer flushed. TestBintrieFlatReaderEndToEnd did not catch this because it never forced a buffer → disk flush. Fix: make bintrieFlatCodec.ReadAccount extract the offset from the stem blob (mirroring ReadStorage), so the disk path and the buffer path return the same 32-byte per-offset shape. Update AccountCacheKey/StorageCacheKey to embed the full 32-byte key (prefix + 31-byte stem + 1-byte offset), since caching under a stem-only key would collapse BasicData and CodeHash into the same slot and return the wrong value on the second hit. Update Flush's cache-update loop to store per-offset entries from the aggregated write set. Design note: I considered the alternative of introducing a new StemBlob(stem) interface method that returns the full blob synthesized from a stem-level lookup index. Rejected because (a) the index is a new data structure with its own consistency invariants, (b) the per-offset approach is strictly local to the codec + reader, and (c) the "1 Pebble read per Account" locality benefit is preserved at the OS page cache level — both offsets at the same stem live in the same Pebble block, so the second read is effectively free. bintrieFlatReader.Account still does two AccountRLP lookups; the torn-read hazard is gated by a new load-bearing invariant test, TestBinaryHasherWritesBothBasicAndCodeHash, which asserts that binaryHasher.updateAccount always emits both BasicData and CodeHash leaves together. A future code-only update that broke this invariant would fail the test. Tests added: * TestBintrieFlatReaderEndToEndAfterFlush — explicitly flushes via tdb.Commit(root, false) and re-reads through a fresh StateReader. This is the smoking-gun regression for C2. * TestBintrieFlatReaderMultipleOffsetsPerStem — multiple offsets at the same stem (BasicData, CodeHash, header storage slots) all round-trip post-flush. * TestBintrieCodecCrossFlushRMW — two Flush calls to the same stem from different "blocks" correctly merge on disk, with prior offsets preserved. * TestBinaryHasherWritesBothBasicAndCodeHash — locks down the hasher co-write invariant that bintrieFlatReader.Account relies on. Existing tests updated to match the new per-offset ReadAccount semantics: * TestBintrieCodecAccountRoundTrip, TestBintrieCodecMultipleWritesSameStem, TestBintrieCodecDeleteAccount — now read per-offset rather than calling extractStemOffset on the raw blob. * TestBintrieCodecCacheKeysDisjoint — additionally verifies two offsets at the same stem produce distinct cache keys. Error messages in bintrieFlatReader now include address and length context (S5).	2026-04-15 15:00:40 +02:00
CPerezz	bfb77d98f6	core/state,triedb/pathdb: enable bintrie flat state reads end-to-end Wires the pieces from Commits 1-9 into a running system: * triedb/pathdb.New: install the bintrieFlatCodec when isVerkle is set, backed by the same verkle-namespaced db used for trie nodes. * triedb/pathdb.database.go: drop isVerkle from the noBuild guard so the bintrie generator (Commit 9) runs on startup, and remove it from the generateSnapshot call path for the same reason. * triedb/pathdb.disklayer.revert: hard-fail on bintrie because the reorg path would replay merkle-shaped origin records against a per-stem layout. Tracked in BINTRIE_FLAT_STATE_REORG_GAP.md. * triedb/pathdb.journal: add IsBintrie to journalGenerator (rlp:"optional" so v3 journals still decode) and make journalProgress a method on generator so it stamps the active scheme; loadGenerator discards any journal whose scheme does not match the database, forcing a fresh regeneration. * triedb/pathdb.reader: export RawStateReader, a small extension of database.StateReader that exposes AccountRLP so callers outside the package can reach the raw flat-state bytes without going through the slim-RLP decode path that assumes merkle shape. * core/state.reader: add bintrieFlatReader, the bintrie equivalent of flatReader. It derives the EIP-7864 stem keys from (addr, slot), performs two AccountRLP lookups per Account call (BasicData + CodeHash), and decodes via bintrie.UnpackBasicData. Storage reads go through a single AccountRLP lookup at the slot's full bintrie key. * core/state.database.StateReader: dispatch to bintrieFlatReader when the path database is in verkle mode; merkle path unchanged. Depends on the lookup sentinel fix in the previous commit; without it missing-account reads on bintrie misreport as "layer stale".	2026-04-15 15:00:40 +02:00
CPerezz	a1ff36d9e1	core/state,triedb/pathdb: wire bintrie leaves through stateUpdate Drains the binaryHasher's LeafProducer side-channel in StateDB.commit and threads the stem writes through stateUpdate.encodeBinary into the pathdb state set as per-offset accountData entries (key = stem\|\|offset, value = 32-byte leaf or nil for clears). The flat-state codec gains a Flush method that owns the in-memory→disk write path, replacing the codec-agnostic per-entry loop in writeStates. The merkle codec preserves its historical per-entry behavior verbatim; the bintrie codec aggregates per-offset writes by stem so each stem hits disk via a single read-modify-write, satisfying the codec's pre-aggregation requirement and updating the clean cache with the merged blob it just produced (no extra disk read). stateUpdate.encodeBinary returns empty origin maps for the bintrie path: state-history rollback for bintrie is deferred to a follow-up PR (see BINTRIE_FLAT_STATE_REORG_GAP.md), and the diskLayer.revert path will panic before consuming origins anyway.	2026-04-15 15:00:40 +02:00
CPerezz	29ef7576d9	core/state: hook leaf production in binaryHasher binaryHasher now implements the new LeafProducer optional extension to the Hasher interface. Every UpdateAccount, UpdateStorage, and delete path records the corresponding (stem, offset, value) write into an internal buffer, which the caller drains once per block via DrainStemWrites() and hands to the pathdb flat-state layer through the stateUpdate (wired up in the next commit). Three kinds of writes are recorded: - Account create/update: two writes (BasicData at offset 0, CodeHash at offset 1), sharing the same 31-byte stem. BasicData is produced via bintrie.PackBasicData so the flat-state blob is bit-identical to what the trie layer packs internally. - Storage update: one write per slot. Non-zero values become right-justified 32-byte blobs; the zero value (the bintrie's "delete" convention) becomes 32 zero bytes, matching the trie's tombstone-with-zero semantics so the flat-state mirror stays bit-identical to the StemNode.Values entry. - Account delete: two clear writes (nil Value) for offsets 0 and 1. Storage slots and code chunks at the same or other stems are NOT touched; pre-EIP-6780 full-wipe is a documented scope limitation. The LeafProducer interface lives on Hasher and is strictly opt-in — merkleHasher does not implement it, and callers detect capability via a type assertion. This keeps the read-side/write-side split of the existing Hasher cleanly extended: hashers that have a concept of flat-state leaves can expose them; hashers that don't (MPT) are unaffected. Tests cover: - TestBinaryHasherLeafProduction: account update produces 2 writes at offsets 0+1 with matching stem; drain is destructive; storage update emits one matching write; zero-value storage writes 32 zero bytes; delete emits 2 clear writes. - TestMerkleHasherNoLeafProducer: merkleHasher does NOT satisfy the LeafProducer interface (the capability is opt-in per hasher). The collected stem writes are not yet propagated anywhere — a later commit wires DrainStemWrites into StateDB.IntermediateRoot so the writes flow through stateUpdate and the pathdb stateSet into the flat-state layer.	2026-04-15 15:00:40 +02:00
CPerezz	64d185616c	core/state: plumb CodeSize through AccountMut for binaryHasher binaryHasher.updateAccount computed codeLen from len(account.Code.Code), which is only non-zero when the code itself was modified in the current block. For balance- or nonce-only updates account.Code is nil and the computed codeLen was 0, silently overwriting the code_size field packed into the bintrie BasicData leaf (EIP-7864 bytes 5-7) with zero every time a contract was touched without a code write. The TODO(rjl493456442) on updateAccount acknowledged this. Fix it by adding a CodeSize field to AccountMut and having the caller at StateDB.IntermediateRoot populate it via stateObject.CodeSize(), which returns len(obj.code) when the bytes are loaded, otherwise falls back to a code-size lookup via the reader. The binary hasher then passes account.CodeSize straight to BinaryTrie.UpdateAccount as its codeLen argument, and the TODO is removed. Rationale for placing CodeSize on AccountMut rather than Account: AccountMut already carries Code *CodeMut — the new bytecode, which is not a field of Account — because code is write-time data that is not persisted in the flat-state format (SlimAccountRLP). CodeSize has the identical lifecycle: it is not in SlimAccountRLP, it is not populated by any reader, and it is only consumed by the hasher at write time. Mirroring Code's placement keeps the read-side/write-side split honest (Account models the persisted flat-state record; AccountMut adds the code-related write-time parameters). If the bintrie flat-state format is later extended to carry code_size, CodeSize can be promoted onto Account at that time. merkleHasher is unaffected: StateTrie.UpdateAccount ignores its codeLen parameter, so the wrapTrie.UpdateAccount shim continues to pass 0 and no state-root divergence is introduced on the MPT path. Regression test TestVerkleCodeSizePreserved verifies that the state root produced by "create contract, commit, reload, modify balance, commit" matches the root of a single-step construction of the same final state. Before the fix the roots diverge: path A (reload + balance): 1a675599... path B (fresh, same state): de0cfb03...	2026-04-15 15:00:39 +02:00
Gary Rong	533d2109d5	core: fix memory leaking	2026-04-15 15:00:39 +02:00
Gary Rong	aec9c18432	core/state: improve binary hasher	2026-04-15 15:00:39 +02:00
Gary Rong	a2496465f9	core: fix cross validation	2026-04-15 15:00:39 +02:00
Gary Rong	d57dca07b1	core/state: integrate witness collector	2026-04-15 15:00:39 +02:00
Gary Rong	5e23a29b73	core/state: integrate prefetching into merkle hasher	2026-04-15 15:00:38 +02:00
Gary Rong	91298c8655	core/state: implement binary hasher just for demonstration	2026-04-15 15:00:38 +02:00
Gary Rong	282cece030	core/state: implement merkle hasher	2026-04-15 15:00:38 +02:00
Gary Rong	38c7021c73	core/state: invoke prefetcher	2026-04-15 15:00:38 +02:00
Gary Rong	1ae462f08d	core/state: build hasher skeleton	2026-04-15 15:00:38 +02:00
Gary Rong	9daaef1923	core/state: remove trie prefetcher and witness from stateDB	2026-04-15 14:59:05 +02:00
Gary Rong	e2c00d6c96	core/state: add hasher interface definition	2026-04-15 14:59:05 +02:00
Gary Rong	00c3b6da6c	core/state: rework trie prefetcher	2026-04-15 14:58:57 +02:00
rjl493456442	ef0f1f96f9	core/state: ignore the root returned in Commit function for simplicity (#34723 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details StateDB.Commit first commits all storage changes into the storage trie, then updates the account metadata with the new storage root into the account trie. Within StateDB.Commit, the new storage trie root has already been computed and applied as the storage root. This PR explicitly skips the redundant storage trie root assignment for readability.	2026-04-15 11:15:43 +08:00
rjl493456442	eb67d61933	cmd/geth, core/state, tests: rework EIP7610 check (#34718 ) This PR simplifies the implementation of EIP-7610 by eliminating the need to check storage emptiness during contract deployment. EIP-7610 specifies that contract creation must be rejected if the destination account has a non-zero nonce, non-empty runtime code, or non-empty storage. After EIP-161, all newly deployed contracts are initialized with a nonce of one. As a result, such accounts are no longer eligible as deployment targets unless they are explicitly cleared. However, prior to EIP-161, contracts were initialized with a nonce of zero. This made it possible to end up with accounts that have: - zero nonce - empty runtime code - non-empty storage (created during constructor execution) - non-zero balance These edge-case accounts complicate the storage emptiness check. In practice, contract addresses are derived using one of the following formulas: - `Keccak256(rlp({sender, nonce}))[12:]` - `Keccak256([]byte{0xff}, sender, salt[:], initHash)[12:]` As such, an existing address is not selected as a deployment target unless a collision occurs, which is extremely unlikely. --- Previously, verifying storage emptiness relied on GetStorageRoot. However, with the transition to the block-based access list (BAL), the storage root is no longer available, as computing it would require reconstructing the full storage trie from all mutations of preceding transactions. To address this, this PR introduces a simplified approach: it hardcodes the set of known accounts that have zero nonce, empty runtime code, but non-empty storage and non-zero balance. During contract deployment, if the destination address belongs to this set, the deployment is rejected. This check is applied retroactively back to genesis. Since no address collision events have occurred in Ethereum’s history, this change does not alter existing behavior. Instead, it serves as a safeguard for future state transitions.	2026-04-14 15:54:36 +02:00
cui	2414861d36	core/state: optimize transient storage (#33695 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details Optimizes the transient storage. Turns it from a map of maps into a single map keyed by <account,slot>.	2026-04-14 15:39:42 +02:00
Felföldi Zsolt	21b19362c2	core/state: fix tracer hook for EIP-7708 burn logs (#34688 ) This PR fixes https://github.com/ethereum/go-ethereum/issues/34623 by changing the `vm.StateDB` interface: Instead of `EmitLogsForBurnAccounts()` emitting burn logs, `LogsForBurnAccounts() []*types.Log` just returns these logs which are then emitted by the caller. This way when tracing is used, `hookedStateDB.AddLog` will be used automatically and there is no need to duplicate either the burn log logic or the `OnLog` tracing hook.	2026-04-09 09:12:35 +08:00
rjl493456442	0ba4314321	core/state: introduce state iterator interface (#33102 ) In this PR, the Database interface in `core/state` has been extended with one more function: ```go // Iteratee returns a state iteratee associated with the specified state root, // through which the account iterator and storage iterator can be created. Iteratee(root common.Hash) (Iteratee, error) ``` With this additional abstraction layer, the implementation details can be hidden behind the interface. For example, state traversal can now operate directly on the flat state for Verkle or binary trees, which do not natively support traversal. Moreover, state dumping will now prefer using the flat state iterator as the primary option, offering better efficiency. Edit: this PR also fixes a tiny issue in the state dump, marshalling the next field in the correct way.	2026-04-03 10:35:32 +08:00
CPerezz	3da517e239	core/state: fix storage counters in binary trie IntermediateRoot (#34110 ) Add missing `StorageUpdated` and `StorageDeleted` counter increments in the binary trie fast path of `IntermediateRoot()`.	2026-03-31 15:47:07 +02:00
rjl493456442	c3467dd8b5	core, miner, trie: relocate witness stats (#34106 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details This PR relocates the witness statistics into the witness itself, making it more self-contained.	2026-03-27 17:06:46 +01:00
Felföldi Zsolt	b87340a856	core, core/vm: implement EIP-7708 (#33645 ) This PR implements EIP-7708 according to the latest "rough consensus": https://github.com/ethereum/EIPs/pull/9003 https://github.com/etan-status/EIPs/blob/fl-ethlogs/EIPS/eip-7708.md --------- Co-authored-by: Jared Wasinger <j-wasinger@hotmail.com> Co-authored-by: raxhvl <raxhvl@users.noreply.github.com> Co-authored-by: Gary Rong <garyrong0905@gmail.com>	2026-03-23 22:29:53 +08:00
CPerezz	77779d1098	core/state: bypass per-account updateTrie in IntermediateRoot for binary trie (#34022 ) ## Summary In binary trie mode, `IntermediateRoot` calls `updateTrie()` once per dirty account. But with the binary trie there is only one unified trie (`OpenStorageTrie` returns `self`), so each call redundantly does per-account trie setup: `getPrefetchedTrie`, `getTrie`, slice allocations for deletions/used, and `prefetcher.used` — all for the same trie pointer. This PR replaces the per-account `updateTrie()` calls with a single flat loop that applies all storage updates directly to `s.trie`. The MPT path is unchanged. The prefetcher trie replacement is guarded to avoid overwriting the binary trie that received updates. This is the phase-1 counterpart to #34021 (H01). H01 fixes the commit phase (`trie.Commit()` called N+1 times). This PR fixes the update phase (`updateTrie()` called N times with redundant setup). Same root cause — unified binary trie operated on per-account — different phases. ## Benchmark (Apple M4 Pro, 500K entries, `--benchtime=10s --count=3`, on top of #34021) \| Metric \| H01 baseline \| H01 + this PR \| Delta \| \|--------\|:------------:\|:-------------:\|:-----:\| \| Approve (Mgas/s) \| 368 \| 414 \| +12.5% \| \| BalanceOf (Mgas/s) \| 870 \| 875 \| +0.6% \| Should be rebased after #34021 is merged.	2026-03-20 15:40:04 +01:00
CPerezz	519a450c43	core/state: skip redundant trie Commit for Verkle in stateObject.commit (#34021 ) ## Summary Bug fix. In Verkle mode, all state objects share a single unified trie (`OpenStorageTrie` returns `self`). During `stateDB.commit()`, the main account trie is committed via `s.trie.Commit(true)`, which calls `CollectNodes` to traverse and serialize the entire tree. However, each dirty account's `obj.commit()` also calls `s.trie.Commit(false)` on the same trie object, redundantly traversing and serializing the full tree once per dirty account. With N dirty accounts per block, this causes N+1 full-tree traversals instead of 1. On a write-heavy workload (2250 SSTOREs), this produces ~131 GB of allocations per block from duplicate NodeSet creation and serialization. It also causes a latent data race from N+1 goroutines concurrently calling `CollectNodes` on shared `InternalNode` objects. This commit adds an `IsVerkle()` early return in `stateObject.commit()` to skip the redundant `trie.Commit()` call. ## Benchmark (AMD EPYC 48-core, 500K entries, `--benchtime=10s --count=3`) \| Metric \| Baseline \| Fixed \| Delta \| \|--------\|----------\|-------\|-------\| \| Approve (Mgas/s) \| 4.16 ± 0.37 \| 220.2 ± 10.1 \| +5190% \| \| BalanceOf (Mgas/s) \| 966.2 ± 8.1 \| 971.0 ± 3.0 \| +0.5% \| \| Allocs/op (approve) \| 136.4M \| 792K \| -99.4% \| Resolves the TODO in statedb.go about the account trie commit being "very heavy" and "something's wonky". --------- Co-authored-by: Guillaume Ballet <3272758+gballet@users.noreply.github.com>	2026-03-17 12:27:29 +01:00
CPerezz	4b915af2c3	core/state: avoid Bytes() allocation in flatReader hash computations (#34025 ) ## Summary Replace `addr.Bytes()` and `key.Bytes()` with `addr[:]` and `key[:]` in `flatReader`'s `Account` and `Storage` methods. The former allocates a copy while the latter creates a zero-allocation slice header over the existing backing array. ## Benchmark (AMD EPYC 48-core, 500K entries, screening `--benchtime=1x`) \| Metric \| Baseline \| Slice syntax \| Delta \| \|--------\|----------\|--------------\|-------\| \| Approve (Mgas/s) \| 4.13 \| 4.22 \| +2.2% \| \| BalanceOf (Mgas/s) \| 168.3 \| 190.0 \| +12.9% \|	2026-03-17 11:42:42 +01:00
rjl493456442	91cec92bf3	core, miner, tests: introduce codedb and simplify cachingDB (#33816 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details	2026-03-10 08:29:21 +01:00
rjl493456442	dd202d4283	core, ethdb, triedb: add batch close (#33708 ) Pebble maintains a batch pool to recycle the batch object. Unfortunately batch object must be explicitly returned via `batch.Close` function. This PR extends the batch interface by adding the close function and also invoke batch.Close in some critical code paths. Memory allocation must be measured before merging this change. What's more, it's an open question that whether we should apply batch.Close as much as possible in every invocation.	2026-03-04 11:17:47 +01:00
rjl493456442	e636e4e3c1	core/state: track slot reads for empty storage (#33743 ) From the https://eips.ethereum.org/EIPS/eip-7928 > SELFDESTRUCT (in-transaction): Accounts destroyed within a transaction MUST be included in AccountChanges without nonce or code changes. However, if the account had a positive balance pre-transaction, the balance change to zero MUST be recorded. Storage keys within the self-destructed contracts that were modified or read MUST be included as a storage_reads entry. The storage read against the empty contract (zero storage) should also be recorded in the BAL's readlist.	2026-02-24 21:57:50 +08:00
Felix Lange	8e1de223ad	crypto/keccak: vendor in golang.org/x/crypto/sha3 (#33323 ) The upstream libray has removed the assembly-based implementation of keccak. We need to maintain our own library to avoid a peformance regression. --------- Co-authored-by: lightclient <lightclient@protonmail.com>	2026-02-03 14:55:27 -07:00
Marius van der Wijden	16a6531ac2	core: miner: reduce allocations in block building (#33375 ) I recently went on a longer flight and started profiling the geth block production pipeline. This PR contains a bunch of individual fixes split into separate commits. I can drop some if necessary. Benchmarking is not super easy, the benchmark I wrote is a bit non-deterministic. I will try to write a better benchmark later ``` goos: linux goarch: amd64 pkg: github.com/ethereum/go-ethereum/miner cpu: Intel(R) Core(TM) Ultra 7 155U │ /tmp/old.txt │ /tmp/new.txt │ │ sec/op │ sec/op vs base │ BuildPayload-14 141.5µ ± 3% 146.0µ ± 6% ~ (p=0.346 n=200) │ /tmp/old.txt │ /tmp/new.txt │ │ B/op │ B/op vs base │ BuildPayload-14 188.2Ki ± 4% 177.4Ki ± 4% -5.71% (p=0.018 n=200) │ /tmp/old.txt │ /tmp/new.txt │ │ allocs/op │ allocs/op vs base │ BuildPayload-14 2.703k ± 4% 2.453k ± 5% -9.25% (p=0.000 n=200) ```	2026-02-03 08:19:16 +01:00
Noisy	a179ccf6f0	core/state: add bounds check in heap eviction loop (#33712 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details core/state: add bounds check in heap eviction loop Add len(h) > 0 check before accessing h[0] to prevent potential panic and align with existing heap access patterns in txpool, p2p, and mclock packages.	2026-01-29 21:08:04 +08:00
CPerezz	1e9dfd5bb0	core: standardize slow block JSON output for cross-client metrics (#33655 ) Some checks are pending / Docker Image (push) Waiting to run Details / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Windows Build (push) Waiting to run Details Implement standardized JSON format for slow block logging to enable cross-client performance analysis and protocol research. This change is part of the Cross-Client Execution Metrics initiative proposed by Gary Rong: https://hackmd.io/dg7rizTyTXuCf2LSa2LsyQ The standardized metrics enabled data-driven analysis like the EIP-7907 research: https://ethresear.ch/t/data-driven-analysis-on-eip-7907/23850 JSON format includes: - block: number, hash, gas_used, tx_count - timing: execution_ms, total_ms - throughput: mgas_per_sec - state_reads: accounts, storage_slots, bytecodes, code_bytes - state_writes: accounts, storage_slots, bytecodes - cache: account/storage/code hits, misses, hit_rate This should come after merging #33522 --------- Co-authored-by: Gary Rong <garyrong0905@gmail.com>	2026-01-28 20:58:41 +08:00
rjl493456442	c2595381bf	core: extend the code reader statistics (#33659 ) Some checks are pending / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details This PR extends the statistics of contract code read by adding these fields: - CacheHitBytes: the total number of bytes served by cache - CacheMissBytes: the total number of bytes read on cache miss - CodeReadBytes: the total number of bytes for contract code read	2026-01-26 11:25:53 +01:00
rjl493456442	1022c7637d	core, eth, internal, triedb/pathdb: enable eth_getProofs for history (#32727 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details This PR enables the `eth_getProofs ` endpoint against the historical states.	2026-01-22 09:19:27 +08:00
forkfury	2eb1ccc6c4	core/state: ensure deterministic hook emission order in Finalise (#33644 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details Fixes #33630 Sort self-destructed addresses before emitting hooks in Finalise() to ensure deterministic ordering and fix flaky test TestHooks_OnCodeChangeV2. --------- Co-authored-by: jwasinger <j-wasinger@hotmail.com>	2026-01-20 20:36:07 +08:00
jwasinger	715bf8e81e	core: invoke selfdestruct tracer hooks during finalisation (#32919 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details The core part of this PR that we need to adopt is to move the code and nonce change hook invocations to occur at tx finalization, instead of when the selfdestruct opcode is called. Additionally: * remove `SelfDestruct6780` now that it is essentially the same as `SelfDestruct` just gated by `is new contract` * don't duplicate `BalanceIncreaseSelfdestruct` (transfer to recipient of selfdestruct) in the hooked statedb and in the opcode handler for the selfdestruct opcode. * balance is burned immediately when the beneficiary of the selfdestruct is the sender, and the contract was created in the same transaction. Previously we emit two balance increases to the recipient (see above point), and a balance decrease from the sender. --------- Co-authored-by: Sina Mahmoodi <itz.s1na@gmail.com> Co-authored-by: Gary Rong <garyrong0905@gmail.com> Co-authored-by: lightclient <lightclient@protonmail.com>	2026-01-16 15:10:08 -07:00
rjl493456442	9623dcbca2	core/state: add cache statistics of contract code reader (#33532 )	2026-01-08 11:48:45 +08:00
Ng Wei Han	01b39c96bf	core/state, core/tracing: new state update hook (#33490 ) ### Description Add a new `OnStateUpdate` hook which gets invoked after state is committed. ### Rationale For our particular use case, we need to obtain the state size metrics at every single block when fuly syncing from genesis. With the current state sizer, whenever the node is stopped, the background process must be freshly initialized. During this re-initialization, it can skip some blocks while the node continues executing blocks, causing gaps in the recorded metrics. Using this state update hook allows us to customize our own data persistence logic, and we would never skip blocks upon node restart. --------- Co-authored-by: Gary Rong <garyrong0905@gmail.com>	2026-01-08 11:07:19 +08:00
Bashmunta	25439aac04	core/state/snapshot: fix storageList memory accounting (#33505 )	2025-12-31 09:40:43 +08:00
Guillaume Ballet	3f641dba87	trie, go.mod: remove all references to go-verkle and go-ipa (#33461 ) In order to reduce the amount of code that is embedded into the keeper binary, I am removing all the verkle code that uses go-verkle and go-ipa. This will be followed by further PRs that are more like stubs to replace code when the keeper build is detected. I'm keeping the binary tree of course. This means that you will still see `isVerkle` variables all over the codebase, but they will be renamed when code is touched (i.e. this is not an invitation for 30+ AI slop PRs). --------- Co-authored-by: Gary Rong <garyrong0905@gmail.com>	2025-12-30 20:44:04 +08:00
rjl493456442	ffe9dc97e5	core: add code read statistics (#33442 ) Some checks are pending / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Keeper Build (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details	2025-12-18 17:24:02 +08:00
Ng Wei Han	15f52a2937	core/state: fix code existence not marked correctly (#33415 ) When iterating over a map with value types in Go, the loop variable is a copy. In `markCodeExistence`, assigning to `code.exists` modified only the local copy, not the actual map entry, causing the existence flag to always remain false. This resulted in overcounting contract codes in state size statistics, as codes that already existed in the database were incorrectly counted as new. Fix by changing `codes` from `map[common.Address]contractCode` to `map[common.Address]*contractCode`, so mutations apply directly to the struct.	2025-12-15 13:54:26 +08:00
Daniel Liu	3a5560fa98	core/state: make test output message readable (#33400 )	2025-12-13 11:27:00 +08:00
Ng Wei Han	9a346873b8	core/state: fix incorrect contract code state metrics (#33376 ) ## Description This PR fixes incorrect contract code state metrics by ensuring duplicate codes are not counted towards the reported results. ## Rationale The contract code metrics don't consider database deduplication. The current implementation assumes that the results are only slightly inaccurate, but this is not true, especially for data collection efforts that started from the genesis block.	2025-12-10 11:33:59 +08:00
rjl493456442	d3679c2f2e	core/state: export statistics to metrics (#33254 ) Some checks are pending / Keeper Build (push) Waiting to run Details / Linux Build (push) Waiting to run Details / Linux Build (arm) (push) Waiting to run Details / Windows Build (push) Waiting to run Details / Docker Image (push) Waiting to run Details This PR exposes the state size statistics to the metrics, making them easier to demonstrate. Note that the contract code included in the metrics is not de-duplicated, so the reported size will appear larger than the actual storage footprint.	2025-12-02 16:28:51 +01:00

1 2 3 4 5 ...

606 commits