Execution tracing: GraphQL query to get storage inputs for past blocks #2491

Dentosal · 2024-12-11T06:48:27Z

Description

To support local debugging and execution tracing, we need to add an API that returns the state that VM needs to execute transactions. This is done by recording all database accesses during tx validation, called storage read replay (bikeshedding in progress). This data is then exposed as-is through GraphQL. The API is non-trivial to consume, but a seperate client is proviced, see https://github.com/FuelLabs/execution-trace

The implementation exposes database table names and representations directly. Maintaining backwards compatibility with this could turn out to be quite hard.

Requires --historical-execution flag to enable, as this is otherwise quite expensive.

Follow-ups:

Execution tracing: dry run support #2625

Example query:

mutation {
  storageReadReplay(height:"4") { column, key, value }
}

Open questions:

What should the query cost be?

Checklist

Breaking changes are clearly marked as such in the PR description and changelog: No breaking changes!
New behavior is reflected in tests
The specification matches the implemented behavior: This is outside spec scope.

Before requesting review

I have reviewed the code myself
I have created follow-up issues caused by this PR and linked them here

After merging, notify other teams

See the VM PR FuelLabs/fuel-vm#881.

- Related to #1432 # Release notes In this release, we: - Changed `ABIDecoder` methods to take `std::io::Read` instead of `&[u8]`, allowing it to be used in a streaming manner. # Summary `ABIDecoder` methods take `bytes: impl std::io::Read` instead of `bytes: &[u8]`. This allows decoding abi types without having to know the size in advance. This is particularly useful when reading them directly from VM memory, which will be used by the indexer after FuelLabs/fuel-core#2491 is done. # Breaking Changes `ABIDecoder` methods take `bytes: impl std::io::Read` instead of `bytes: &[u8]`. Callers using arrays or `Vec` must change the argument from `&value` to `value.as_slice()`. # Checklist - [x] All **changes** are **covered** by **tests** (or not applicable) - [x] All **changes** are **documented** (or not applicable) - [x] I **reviewed** the **entire PR** myself (preferably, on GH UI) - [x] I **described** all **Breaking Changes** (or there's none) --------- Co-authored-by: hal3e <git@hal3e.io> Co-authored-by: Ahmed Sagdati <37515857+segfault-magnet@users.noreply.github.com> Co-authored-by: segfault-magnet <ahmed.sagdati.ets@gmail.com>

crates/client/assets/schema.sdl

xgreenx · 2025-02-12T20:57:25Z

crates/client/assets/schema.sdl

@@ -762,6 +762,10 @@ type Mutation {
 	"""
 	dryRun(txs: [HexString!]!, utxoValidation: Boolean, gasPrice: U64, blockHeight: U32): [DryRunTransactionExecutionStatus!]!
 	"""
+	Get execution trace for an already-executed transaction.
+	"""
+	storageReadReplay(height: U32!): [StorageReadReplayEvent!]!


I don't know, maybe @Voxelot knows. But I think it should be just Query since it doesn't modify the internal state and is used on the state at height, so it is deterministic.

crates/services/producer/src/block_producer.rs

xgreenx · 2025-02-12T21:01:02Z

crates/services/producer/src/ports.rs

+    fn get_full_block(&self, height: &BlockHeight) -> StorageResult<Block> {
+        let block = self.get_block(height)?;
+        let transactions = block
+            .transactions()
+            .iter()
+            .map(|id| self.get_transaction(id).map(|tx| tx.into_owned()))
+            .collect::<Result<Vec<_>, _>>()?;
+        Ok(block.into_owned().uncompress(transactions))
+    }


In the implementation of the BlockProducerDatabase port in fuel-core.

crates/services/upgradable-executor/src/storage_access_recorder.rs

crates/types/src/services/executor.rs

tests/tests/storage_read_replay.rs

Co-authored-by: Green Baneling <XgreenX9999@gmail.com>

xgreenx · 2025-02-14T04:36:06Z

crates/services/upgradable-executor/src/executor.rs

+        let output = instance.run(module)?;
+
+        match output {
+            ReturnType::ExecutionV0(result) => {
+                let _ = convert_from_v0_execution_result(result)?;
+            }
+            ReturnType::ExecutionV1(result) => {
+                let _ = convert_from_v1_execution_result(result)?;
+            }
+            ReturnType::Validation(result) => {
+                let _ = result?;
+            }
+        }


I think for simplicity and performance we can avoid decoding the output type

Also, maybe for performance we want to avoid deserialization of the output in the instance.run, but it is up to you. Because it required a new method like run_without_result

I think we still need to ensure the execution was successful, and my understanding is that this is the way to do that. I expect ther performance impact of decoding the type to be neglible anyway.

Usually you re-execute blocks that already were executed and validated in the past, so the result should be correct, unless storage has different state.

xgreenx · 2025-02-14T04:38:31Z

crates/services/upgradable-executor/src/storage_access_recorder.rs

+    S: KeyValueInspect,
+{
+    pub storage: S,
+    pub record: Arc<Mutex<Vec<StorageReadReplayEvent>>>,


Instead of Arc<Mutex<_>> I think you could use RefCell. I will make StorageAccessRecorder non Send, but I don't think that we have cases where you need it to be Send. But maybe I'm not right

We require it to be Send here:

fuel-core/crates/services/upgradable-executor/src/instance.rs

Line 304 in 686f8ce

S: KeyValueInspect<Column = Column> + Send + Sync + 'static,

tests/tests/lib.rs

crates/services/executor/src/executor.rs

crates/fuel-core/src/service/config.rs

crates/fuel-core/src/graphql_api.rs

crates/services/producer/src/block_producer.rs

crates/services/upgradable-executor/src/executor.rs

acerone85

Thanks for explanations and reply to all comments.
Happy to approve

xgreenx · 2025-02-18T15:13:32Z

crates/services/upgradable-executor/src/executor.rs

+        let output = instance.run(module)?;
+
+        match output {
+            ReturnType::ExecutionV0(result) => {
+                let _ = convert_from_v0_execution_result(result)?;
+            }
+            ReturnType::ExecutionV1(result) => {
+                let _ = convert_from_v1_execution_result(result)?;
+            }
+            ReturnType::Validation(result) => {
+                let _ = result?;
+            }
+        }


Usually you re-execute blocks that already were executed and validated in the past, so the result should be correct, unless storage has different state.

Dentosal added 2 commits December 11, 2024 08:39

Initial support for historical execution tracing

676a08b

Update schema.sdl

1e018f1

Dentosal added enhancement New feature or request fuel-block-executor wasm WASM-based block execution labels Dec 11, 2024

Dentosal self-assigned this Dec 11, 2024

Dentosal mentioned this pull request Dec 11, 2024

Execution trace recording FuelLabs/fuel-vm#881

Closed

8 tasks

Dentosal and others added 7 commits December 11, 2024 08:50

Fix typo exection => execution

5f6c079

Make execution trace endpoint to work on block level

7c8548f

Fix cli arguments

6ebe050

Various fixes and work towards Rust client support

cd0704a

Merge branch 'master' into dento/execution-trace

63a0016

WIP

f76c96f

Change from tracing to storage read replay recording

3ba2bc7

Dentosal force-pushed the dento/execution-trace branch from 8d80d7b to 3ba2bc7 Compare December 19, 2024 15:18

Dentosal added 2 commits December 30, 2024 14:40

Merge branch 'master' into dento/execution-trace

88a2730

Merge branch 'master' into dento/execution-trace

f558296

Dentosal mentioned this pull request Jan 6, 2025

feat!: streaming decoder FuelLabs/fuels-rs#1564

Merged

4 tasks

rafal-ch added the release label Jan 10, 2025

xgreenx removed the release label Jan 10, 2025

Add wasm executor support

5abde1c

Dentosal changed the title ~~Execution tracing for past blocks~~ Execution tracing: add GraphQL endpoint to get storage inputs for past blocks Jan 15, 2025

Dentosal changed the title ~~Execution tracing: add GraphQL endpoint to get storage inputs for past blocks~~ Execution tracing: GraphQL query to get storage inputs for past blocks Jan 15, 2025

Move behind --debug flag

6827e0d

Dentosal mentioned this pull request Jan 23, 2025

Execution tracing: dry run support #2625

Open

Dentosal added 4 commits January 23, 2025 11:35

Adjust query cost

acb9bab

Add debug flag decription

bb6c9dd

Approve snapshot changes

75ff219

Add a test case

73cb7a6

Dentosal added 6 commits February 12, 2025 12:07

Use u32 column id instead of column name string

1871919

Update rest of the column fields to U32

a23c2bb

Change the test to use column number

684e8cc

fmt

233d29a

clippy

b255c46

Fix missing import in test

de6b170

xgreenx reviewed Feb 12, 2025

View reviewed changes

Dentosal and others added 7 commits February 13, 2025 22:23

Update tests/tests/storage_read_replay.rs

35fa7df

Co-authored-by: Green Baneling <XgreenX9999@gmail.com>

Limit debug print size of replay event keys and values to

9c78b11

Address PR review comments

b4a1d08

Merge branch 'master' into dento/execution-trace

3659ff0

Fix tests to use query instead of mutation

075e960

Move get_full_block to fuel-core

582a30c

Fix tests

c080532

Dentosal requested a review from xgreenx February 14, 2025 01:24

xgreenx previously approved these changes Feb 14, 2025

View reviewed changes

Merge branch 'master' into dento/execution-trace

8ea800b

Dentosal dismissed xgreenx’s stale review via 8ea800b February 17, 2025 14:26