Introduce ThreadBlockBuildingContext, remove old CachedReads. #544

dvush · 2025-04-08T14:58:29Z

The idea behind ThreadBlockBuildingContext is to have a place that is owned by a particular thread (for example, max profit ordering builder) that can be used as a scratchpad for caching without doing global caching shared between all threads.

Currently we have only one place for caches BlockBuildingContext. It is global and shared between all builders, top of block simulation, and finalization thread. This approach solves many problems with caching. For example, we use it for eth-sparse-mpt root hash caching.

But sometimes we would like to have a cache that is local to the current thread to avoid the overhead of mutex and multiple threads.

I've seen the need for caches like this multiple times. This commit adds support for local caches like this.

The first thing that is moved to this new setup of 2 caches is cached reads. We had a lot of trouble with passing CachedReads struct around and it introduced itself into traits where it does not belong such as BlockBuildingHelper or backtesting code. Here we move CachedReads to this uniform setup and it simplifies cached reads state handling.

📝 Summary

💡 Motivation and Context

✅ I have completed the following steps:

Run make lint
Run make test
Added tests (if applicable)

Copilot

Copilot reviewed 26 out of 26 changed files in this pull request and generated no comments.

Comments suppressed due to low confidence (2)

crates/rbuilder/src/building/builders/ordering_builder.rs:166

[nitpick] Consider standardizing the placement and naming of the 'local_ctx' parameter across functions (e.g., always placing it immediately after the BlockBuildingContext) and adding a brief inline comment describing its purpose to improve code consistency and readability.

let mut local_ctx = ThreadBlockBuildingContext::default();

crates/rbuilder/src/building/order_commit.rs:296

[nitpick] Consider adding inline documentation for the new lifetime parameters ('c' and 'd') in PartialBlockFork so that their purpose and relation to the thread-local context are clearer for future maintainers.

pub struct PartialBlockFork<'a, 'b, 'c, 'd, Tracer: SimulationTracer> {

The idea behind ThreadBlockBuildingContext is to have a place that is owned by a particular thread (for example, max profit ordering builder) that can be used as a scratchpad for caching without doing global caching shared between all threads. Currently we have only one place for caches BlockBuildingContext. It is global and shared between all builders, top of block simulation, and finalization thread. This approach solves many problems with caching. For example, we use it for eth-sparse-mpt root hash caching. But sometimes we would like to have a cache that is local to the current thread to avoid the overhead of mutex and multiple threads. I've seen the need for caches like this multiple times. This commit adds support for local caches like this. The first thing that is moved to this new setup of 2 caches is cached reads. We had a lot of trouble with passing CachedReads struct around and it introduced itself into traits where it does not belong such as BlockBuildingHelper or backtesting code. Here we move CachedReads to this uniform setup and it simplifies cached reads state handling.

ZanCorDX · 2025-04-08T19:37:00Z

Is it possible to hide the ThreadBlockBuildingContext inside the BlockBuildingHelperFromProvider so we don't have to pass it to edit the block since it's always the same?

dvush · 2025-04-08T20:36:39Z

I decided against it because we move block building helper to send to other thread for sealing. This would be against the idea of having fixed cache that is:

tied to some thread
never cloned during the slot
it only fills with data and grows

If you put thread local cache with block building helper that would mean that we need to clone it every time we finish block building attempt.

dvush · 2025-04-08T20:45:18Z

The only way I see to make it work would be putting Mutex<Option<Arc<ThreadLocalBuildingContext>>> into block building helper and adding the following flow: you set up context for the context of the current thread and then you remove it when you send it to other thread. This is error prone and ugly, passing &mut to every method that requires it seems to be much better and you are actually protected by borrow checker since &mut is correct way to have this context in your code.

Copilot AI review requested due to automatic review settings April 8, 2025 14:58

dvush requested review from ZanCorDX and ferranbt as code owners April 8, 2025 14:58

Copilot AI reviewed Apr 8, 2025

View reviewed changes

dvush force-pushed the local_caches branch 2 times, most recently from f316ecf to d809ab8 Compare April 8, 2025 16:35

dvush force-pushed the local_caches branch from d809ab8 to c81d23c Compare April 8, 2025 16:58

ZanCorDX approved these changes Apr 9, 2025

View reviewed changes

dvush merged commit dc7d3fc into develop Apr 9, 2025
4 checks passed

dvush deleted the local_caches branch April 9, 2025 12:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce ThreadBlockBuildingContext, remove old CachedReads. #544

Introduce ThreadBlockBuildingContext, remove old CachedReads. #544

dvush commented Apr 8, 2025

Copilot AI left a comment

ZanCorDX commented Apr 8, 2025

dvush commented Apr 8, 2025 •

edited

Loading

dvush commented Apr 8, 2025

Introduce ThreadBlockBuildingContext, remove old CachedReads. #544

Introduce ThreadBlockBuildingContext, remove old CachedReads. #544

Conversation

dvush commented Apr 8, 2025

📝 Summary

💡 Motivation and Context

✅ I have completed the following steps:

Copilot AI left a comment

Choose a reason for hiding this comment

ZanCorDX commented Apr 8, 2025

dvush commented Apr 8, 2025 • edited Loading

dvush commented Apr 8, 2025

dvush commented Apr 8, 2025 •

edited

Loading