feat: implement search_transactions_before and search_transactions_after #13621

caglaryucekaya · 2025-01-02T23:52:36Z

Will close #13499

caglaryucekaya · 2025-01-03T00:05:10Z

@mattsse I have some questions. In the issue you wrote that we should enforce a page size limit like 100 blocks. However, according to the documentation the page size argument is not blocks but the number of transactions, so we can put a limit to that.

One problem with my current implementation is that it processes blocks one by one until all the traces are fetched because we don't know in advance in which block we are going to reach the page_size number of transactions. The problem is that since the blocks are not processed in parallel, if the user e.g. request the search beginning from the genesis block and the searched addresses appear far later, the search can take hours.

I will also look into your suggestion to use AccountReader for optimization.

…plement-search-transactions-before-and-after

mattsse

cool, this is great start

I left a few questions because the page settings are a bit confusing to me

page size argument is not blocks but the number of transactions

I see, then we should perhaps try to perform tracing of multiple blocks in parallel by spawning jobs?

if the user e.g. request the search beginning from the genesis block and the searched addresses appear far later, the search can take hours.

yeah we can definitely look into processing block tracing in parallel after we have the initial draft

crates/rpc/rpc/src/otterscan.rs

mattsse · 2025-01-06T14:52:16Z

crates/rpc/rpc/src/otterscan.rs

+        let mut txs_with_receipts = TransactionsWithReceipts {
+            txs: Vec::default(),
+            receipts: Vec::default(),
+            first_page: false,


unclear what first_page means

First page means it's the page with the most recent transactions meaning we have traced until the tip of the chain.

crates/rpc/rpc/src/otterscan.rs

mattsse · 2025-01-06T14:53:50Z

crates/rpc/rpc/src/otterscan.rs

+            from_block: None,
+            to_block: None,


I think this is where we'd need to configure the block_number and perhaps the page_size?

because worst case this would trace the entire chain

So do you think we should put a limit to the number of blocks traced, or to the number of transactions? If we limit the number of transaction, there's still the chance to trace the entire chain e.g. if the account has 5 transactions in total and the user requests 10.

unfortunately, we don't have another way to determine this so this is will always be possible, so we should doing some chunking instead so we limit how many blocks we trace at once, so something like a https://docs.rs/futures/latest/futures/stream/struct.FuturesUnordered.html with limited capacity where we push new block tasks

What you mean is processing blocks as batches of something like 100 or 1000 right? E.g. processing 1000 blocks, waiting until they're all complete, and continue with another 1000 blocks if we didn't reach the page size yet. In that case using try_join_all looks like a better idea since we have to wait for all 1000 blocks to complete anyways. It's also how it's done in trace_filter.

On second thought, FuturesUnordered will be useful

Okay I changed the implementation with FuturesUnordered and it works much faster now. I used 1000 block batches for now but we can change that. I think we can also put a limit to page_size to prevent requesting unreasonable number of transactions. What do you think would be a good number for that?

…plement-search-transactions-before-and-after

sealer3 · 2025-02-11T14:48:50Z

@mattsse I have some questions. In the issue you wrote that we should enforce a page size limit like 100 blocks. However, according to the documentation the page size argument is not blocks but the number of transactions, so we can put a limit to that.

One problem with my current implementation is that it processes blocks one by one until all the traces are fetched because we don't know in advance in which block we are going to reach the page_size number of transactions. The problem is that since the blocks are not processed in parallel, if the user e.g. request the search beginning from the genesis block and the searched addresses appear far later, the search can take hours.

I will also look into your suggestion to use AccountReader for optimization.

Note the gotcha from the Otterscan RPC API docs on this one:

There is a small gotcha regarding pageSize. If there are fewer results than pageSize, they are just returned as is.

But if there are more than pageSize results, they are capped by the last found block. For example, let's say you are searching for Uniswap Router address with a pageSize of 25, and it already found 24 matches. It then looks at the next block containing this address's occurrences and there are 5 matches inside the block. They are all returned, so it returns 30 transaction results. The caller code should be aware of this.

And feel free to reference the Anvil implementation, which was correct (at least the last time I went through the code): https://github.com/foundry-rs/foundry/blob/master/crates/anvil/src/eth/otterscan/api.rs

The default Otterscan page size is 25, so make sure not to set a limit below that.

caglaryucekaya · 2025-02-12T19:39:37Z

Note the gotcha from the Otterscan RPC API docs on this one:

I was aware of the gotcha and implemented accordingly.

And feel free to reference the Anvil implementation, which was correct (at least the last time I went through the code): https://github.com/foundry-rs/foundry/blob/master/crates/anvil/src/eth/otterscan/api.rs

The default Otterscan page size is 25, so make sure not to set a limit below that.

I wasn't aware of the Foundry implementation, I will check it and make changes if necessary. And I will use a limit above 25 transactions. Thank you very much!

caglaryucekaya · 2025-02-13T11:03:24Z

@sealer3 Do you know why the Anvil implementation only considers post-fork blocks?

…plement-search-transactions-before-and-after

sealer3 · 2025-02-13T13:56:21Z

According to the original pull request foundry-rs/foundry#5414, the stated reason is to be able to iterate through all blocks in memory for ots_searchTransactionsBefore, ots_searchTransactionsAfter, ots_getContractCreator, and other Otterscan RPC methods that would otherwise require special indexing from the node.

Since Anvil is designed to work on any backend node (for forking), it would make sense for them not to have to make potentially thousands of RPC requests to a backend node that might be far away on a remote machine to gather the required information for a response.

caglaryucekaya · 2025-02-14T17:27:13Z

@mattsse I also added the implementation for search_transactions_before. This is waiting for your review now.

implement search_transactions_after

d8e221e

caglaryucekaya requested review from mattsse and Rjected as code owners January 2, 2025 23:52

caglaryucekaya added 3 commits January 3, 2025 03:20

fix search_transactions_after test

eea862e

check if the address exists

dafc2d8

Merge branch 'main' of https://github.com/caglaryucekaya/reth into im…

3de62b0

…plement-search-transactions-before-and-after

mattsse requested changes Jan 6, 2025

View reviewed changes

caglaryucekaya added 3 commits January 6, 2025 23:57

improve code

44cf98a

trace blocks concurrently

703e572

Merge branch 'main' of https://github.com/caglaryucekaya/reth into im…

51cea1f

…plement-search-transactions-before-and-after

github-actions bot added the S-stale This issue/PR is stale and will close with no further activity label Jan 31, 2025

github-actions bot closed this Feb 7, 2025

mattsse added C-enhancement New feature or request A-rpc Related to the RPC implementation M-prevent-stale Prevents old inactive issues/PRs from being closed due to inactivity and removed S-stale This issue/PR is stale and will close with no further activity labels Feb 7, 2025

mattsse reopened this Feb 7, 2025

Merge branch 'main' of https://github.com/caglaryucekaya/reth into im…

bdbab2e

…plement-search-transactions-before-and-after

caglaryucekaya added 3 commits February 14, 2025 18:23

add search_transactions_before method

038327d

fix search_transactions_before test

b2028bd

improve search_transactions_after method

d3675cf

mattsse mentioned this pull request Feb 14, 2025

replace Into<Self> bound with standalone function #14504

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: implement search_transactions_before and search_transactions_after #13621

feat: implement search_transactions_before and search_transactions_after #13621

caglaryucekaya commented Jan 2, 2025 •

edited

Loading

caglaryucekaya commented Jan 3, 2025

mattsse left a comment •

edited

Loading

mattsse Jan 6, 2025

caglaryucekaya Jan 6, 2025

mattsse Jan 6, 2025

caglaryucekaya Jan 6, 2025

mattsse Jan 6, 2025

caglaryucekaya Jan 6, 2025 •

edited

Loading

caglaryucekaya Jan 6, 2025 •

edited

Loading

caglaryucekaya Jan 9, 2025

sealer3 commented Feb 11, 2025

caglaryucekaya commented Feb 12, 2025

caglaryucekaya commented Feb 13, 2025

sealer3 commented Feb 13, 2025

caglaryucekaya commented Feb 14, 2025

feat: implement search_transactions_before and search_transactions_after #13621

Are you sure you want to change the base?

feat: implement search_transactions_before and search_transactions_after #13621

Conversation

caglaryucekaya commented Jan 2, 2025 • edited Loading

caglaryucekaya commented Jan 3, 2025

mattsse left a comment • edited Loading

Choose a reason for hiding this comment

mattsse Jan 6, 2025

Choose a reason for hiding this comment

caglaryucekaya Jan 6, 2025

Choose a reason for hiding this comment

mattsse Jan 6, 2025

Choose a reason for hiding this comment

caglaryucekaya Jan 6, 2025

Choose a reason for hiding this comment

mattsse Jan 6, 2025

Choose a reason for hiding this comment

caglaryucekaya Jan 6, 2025 • edited Loading

Choose a reason for hiding this comment

caglaryucekaya Jan 6, 2025 • edited Loading

Choose a reason for hiding this comment

caglaryucekaya Jan 9, 2025

Choose a reason for hiding this comment

sealer3 commented Feb 11, 2025

caglaryucekaya commented Feb 12, 2025

caglaryucekaya commented Feb 13, 2025

sealer3 commented Feb 13, 2025

caglaryucekaya commented Feb 14, 2025

caglaryucekaya commented Jan 2, 2025 •

edited

Loading

mattsse left a comment •

edited

Loading

caglaryucekaya Jan 6, 2025 •

edited

Loading

caglaryucekaya Jan 6, 2025 •

edited

Loading