feat: add `StateSync` component #650

tomyrd · 2025-01-02T21:46:40Z

This PR adds a new StateSync component that encompasses all the logic for the client's synchronization with the node. The idea is that this component can be instantiated separate from the client and can be ran without modifying the state of the client. The component exposes a sync_state method that will do the necessary rpc calls and return a StateSyncUpdate with all the changes that should be applied to the store.

`state_sync` method

The new state_sync method no longer exposes the "steps" of the rpc response. It returns all the necessary updates to be up to date with the node in a single call.

Additionally, the component doesn't have direct access to the store. This is because the sync process now is indepentent on the state of the store and users can pass the information they want the component tu use and simulate syncs without the store changing. For this reason, the client will gather all the needed information up front and give it to the component, including the headers of every tracked account, all the unspent notes and the current partial MMR. The only way the store can be accessed during the sync process is using the callbacks explained in the next section.

Callbacks

The component receives two callbacks that will be called in specific events during the sync:

OnNoteReceived: This callback will be called on each new committed note that is received in the sync state endpoint response. It receives a CommittedNote that contains the Id of the note being committed and an optional InputNoteRecord that corresponds to the state of the note in the node (only if the note is public), this allows the callback to check whether the new public note is relevant to the client or not. The callback should return a boolean indicating if this new CommittedNote event should be applied to the store or not. The default implementation is on_note_received.
OnNullifierReceived: This callback will be called on each new nullifier that is received in the sync state endpoint response. It receives a NullifierUpdate that contains the new nullifier along with its block number. The callback should return a boolean indicating if this new NullifierUpdate event should be applied to the store or not. There's no default implementation, the client will simply return true for every nullifier update.

Other changes

The rpc_api in the client was changed from Box to Arc so it can be shared with the component. This also means that I had to change the base struct to give it interior mutability.
I moved most of the code from crates/rust-client/src/sync/mod.rs to the new crates/rust-client/src/sync/state_sync.rs.

TODOs

Rebase the branch to bring the changes from the RPC functions (some conflicts will probably emerge but they shouldn't be too hard to solve)
Since I'm already refactoring the sync process, I want to move the apply_mmr_changes part inside the component. I feel like it could be better documented and it should fit nicely inside the note_state_sync logic (it's just a different note state change).
Remove all the temporary TODOs in the code.
Once I'm finished with the changes above, I need to modify the web store to be compatible with the new functionality.
Fix onchain tag integration test.

crates/rust-client/src/sync/state_sync.rs

tomyrd · 2025-01-08T02:44:45Z

I need to tidy up the code and update the documentation, but the basic idea with this updated version is that we have callback functions (defined in state_sync.rs) that receive the unprocessed sync response and return the updates needed to be made in the store. These callbacks can be defined as closures that have access to the rpc_api and store and could be redefined by the user to affect how the changes are applied to the store at the end.

These callbacks could also work as a sort of event reaction, as the user could redefine them and they would be called on each sync with all the new and updated node information.

crates/rust-client/src/sync/state_sync.rs

crates/rust-client/src/notes/mod.rs

crates/rust-client/src/store/web_store/sync/mod.rs

crates/rust-client/src/sync/block_headers.rs

bobbinth

Thank you! Not a full review but I left some comments inline. Overall, I think the approach works - but we need to clean this up quite a bit and make sure the comments are accurate.

Also, I would suggest moving out some "auxiliary changes" (e.g., implementing interior mutability) into another PR so that this PR could focus on the state sync related logic.

crates/rust-client/src/sync/mod.rs

crates/rust-client/src/sync/state_sync.rs

bobbinth · 2025-02-11T09:47:54Z

crates/rust-client/src/sync/state_sync.rs

+/// Callback to be executed when a nullifier is received in the sync response. It receives the
+/// nullifier update received from the node and the list of transaction updates that were committed
+/// in the block.
+///
+/// It returns two optional notes (one input and one output) that should be updated in the store and
+/// an optional transaction ID if a transaction should be discarded.
+pub type OnNullifierReceived =


I think the comments are outdated here. Also, why do we need this callback?

We don't need it, the Client just assumes all nullifiers are relevant (i.e. the default implementation always returns true). The purpose of this callback is so that the sync component is more extensible and so that users can react to nullifiers arriving in the sync process.

We could remove it if we want a simplified version of the component on this first iteration.

I would probably get rid of this for now as I'm not 100% sure we'll need it. It'll simplify the PR a bit and if we do need it, should be pretty simple to add it in the future.

crates/rust-client/src/sync/state_sync.rs

crates/rust-client/src/rpc/domain/transaction.rs

crates/rust-client/src/rpc/tonic_client/mod.rs

crates/rust-client/src/sync/state_sync.rs

tomyrd · 2025-02-11T19:29:30Z

Just created two auxiliary PRs (#726 and #727) that will merge into the new-state-sync branch. Once this happens I'll merge those changes (along with the updates from next) into this branch. This will result in a better and smaller PR diff that will be easier to review.

You can review the new PRs first and wait to review this one until those are merged so that is easier.

tomyrd · 2025-02-12T21:10:31Z

The number of files to review has been reduced after merging the auxiliary PRs, the remaining changes are more relevant to the addition of the component.

igamigo

Very nice! I left some comments mostly about docs (feel free to disregard the more stylistic ones if you think the current version is better) and code structure, but the overall refactor and functionality looks good to me

crates/rust-client/src/account.rs

crates/rust-client/src/store/sqlite_store/sync.rs

crates/rust-client/src/sync/block_header.rs

crates/rust-client/src/sync/state_sync.rs

bobbinth

Thank you! Looks good! Not a full review still, but I left some comments inline. The main ones are about bringing the code up to date with the latest changes (e.g., not having nullifiers included in the state sync request).

crates/rust-client/src/sync/state_sync.rs

bobbinth · 2025-02-23T00:55:12Z

crates/rust-client/src/sync/state_sync.rs

+            nullifiers_tags.append(
+                &mut state_sync_update
+                    .note_updates
+                    .updated_input_notes()
+                    .filter(|note| {
+                        note.is_committed()
+                            && !nullifiers_tags.contains(&get_nullifier_prefix(&note.nullifier()))
+                    })
+                    .map(|note| get_nullifier_prefix(&note.nullifier()))
+                    .collect::<Vec<_>>(),


Could this be converted into a helper method on StateSyncUpdate? Maybe something like:

impl StateSyncUpdate { pub fn append_nullifier_tags(&self, target: &mut Vec<u16>) { ... } }

But also, with some of the latest updates in miden-node, we no longer need to send nullifier tags with the state sync requests but rather do them at the end of the sync process via the GetNullifiersByPrefix endpoint.

We should probably integrate these changes here as it may simplify a few things.

Yes! This will be removed in #751 and merged into this PR.

Ended up changing it in this PR in case #751 gets closed after this discussion

bobbinth · 2025-02-23T00:59:00Z

crates/rust-client/src/sync/state_sync.rs

+        let current_block_num = (u32::try_from(current_partial_mmr.num_leaves() - 1)
+            .expect("The number of leaves in the MMR should be greater than 0 and less than 2^32"))
+        .into();


Could we not get current_block_num here simply as:

let current_block_num = state_sync_update.block_num;

Yes, but I have to move this conversion to the state_sync_update construction so that the block num is up to date in the first call.

bobbinth · 2025-02-23T01:03:04Z

crates/rust-client/src/sync/state_sync.rs

+        state_sync_update.block_num = response.block_header.block_num();
+
+        // We don't need to continue if the chain has not advanced, there are no new changes
+        if response.block_header.block_num() == current_block_num {
+            return Ok(false);
+        }


If we can do what I described in the last comment, we could change this to be something like:

if response.block_header.block_num() == state_sync_update.block_num { return Ok(false); } state_sync_update.block_num = response.block_header.block_num();

And then we may not even need the current_block_num variable.

The current_block_num variable is also used in the request before this line so I think we need it anyways.

Edit: Sorry, I think I misunderstood the suggestion the first time I responded. I was just doing an overview of all comments and this is totally possible. I'll be removing this variable.

bobbinth · 2025-02-23T01:09:49Z

crates/rust-client/src/sync/state_sync.rs

+        let account_updates =
+            self.account_state_sync(accounts, &response.account_hash_updates).await?;
+
+        state_sync_update.account_updates = account_updates;


I may be missing something, but wouldn't this overwrite account updates received in the previous step?

But also, is there a reason to fetch updated public account states on every step rather than do that at the end the sync for all updated public accounts?

I may be missing something, but wouldn't this overwrite account updates received in the previous step?

Yes, I should have used extend here.

But also, is there a reason to fetch updated public account states on every step rather than do that at the end the sync for all updated public accounts?

Yes, this would be better. I'll change it so that the public account fetch is done at the end.

bobbinth · 2025-02-23T02:00:39Z

crates/rust-client/src/sync/mod.rs

        self.store
            .apply_state_sync(state_sync_update)
            .await
            .map_err(ClientError::StoreError)?;

-        if response.chain_tip == response.block_header.block_num() {
-            Ok(SyncStatus::SyncedToLastBlock(sync_summary))
-        } else {
-            Ok(SyncStatus::SyncedToBlock(sync_summary))
-        }
-    }
-
-    // HELPERS
-    // --------------------------------------------------------------------------------------------
-
-    /// Returns the [`NoteUpdates`] containing new public note and committed input/output notes and
-    /// a list or note tag records to be removed from the store.
-    async fn committed_note_updates(
-        &mut self,
-        committed_notes: Vec<CommittedNote>,
-        block_header: &BlockHeader,
-    ) -> Result<(NoteUpdates, Vec<NoteTagRecord>), ClientError> {
-        // We'll only pick committed notes that we are tracking as input/output notes. Since the
-        // sync response contains notes matching either the provided accounts or the provided tag
-        // we might get many notes when we only care about a few of those.
-        let relevant_note_filter =
-            NoteFilter::List(committed_notes.iter().map(CommittedNote::note_id).copied().collect());
-
-        let mut committed_input_notes: BTreeMap<NoteId, InputNoteRecord> = self
-            .store
-            .get_input_notes(relevant_note_filter.clone())
-            .await?
-            .into_iter()
-            .map(|n| (n.id(), n))
-            .collect();
-
-        let mut committed_output_notes: BTreeMap<NoteId, OutputNoteRecord> = self
-            .store
-            .get_output_notes(relevant_note_filter)
-            .await?
-            .into_iter()
-            .map(|n| (n.id(), n))
-            .collect();
-
-        let mut new_public_notes = vec![];
-        let mut committed_tracked_input_notes = vec![];
-        let mut committed_tracked_output_notes = vec![];
-        let mut removed_tags = vec![];
-
-        for committed_note in committed_notes {
-            let inclusion_proof = NoteInclusionProof::new(
-                block_header.block_num(),
-                committed_note.note_index(),
-                committed_note.merkle_path().clone(),
-            )?;
-
-            if let Some(mut note_record) = committed_input_notes.remove(committed_note.note_id()) {
-                // The note belongs to our locally tracked set of input notes
-
-                let inclusion_proof_received = note_record
-                    .inclusion_proof_received(inclusion_proof.clone(), committed_note.metadata())?;
-                let block_header_received = note_record.block_header_received(block_header)?;
-
-                removed_tags.push((&note_record).try_into()?);
-
-                if inclusion_proof_received || block_header_received {
-                    committed_tracked_input_notes.push(note_record);
-                }
-            }
-
-            if let Some(mut note_record) = committed_output_notes.remove(committed_note.note_id()) {
-                // The note belongs to our locally tracked set of output notes
-
-                if note_record.inclusion_proof_received(inclusion_proof.clone())? {
-                    committed_tracked_output_notes.push(note_record);
-                }
-            }
-
-            if !committed_input_notes.contains_key(committed_note.note_id())
-                && !committed_output_notes.contains_key(committed_note.note_id())
-            {
-                // The note is public and we are not tracking it, push to the list of IDs to query
-                new_public_notes.push(*committed_note.note_id());
-            }
-        }
-
-        // Query the node for input note data and build the entities
-        let new_public_notes =
-            self.fetch_public_note_details(&new_public_notes, block_header).await?;
+        self.update_mmr_data().await?;


Why do we separate update_mmr_data() from apply_state_sync()? I would have expected that whatever is being done inside update_mmr_data() is done as a part of apply_state_sync().

It's one of the followup points but I was planning to do it in a future PR as this one was getting pretty big.

bobbinth · 2025-02-23T02:01:01Z

crates/rust-client/src/sync/mod.rs

+pub(crate) fn get_nullifier_prefix(nullifier: &Nullifier) -> u16 {
+    (nullifier.inner()[3].as_int() >> FILTER_ID_SHIFT) as u16
+}


As mentioned in one of the previous comments, we should be able to get rid of this function now.

bobbinth · 2025-02-23T02:03:58Z

crates/rust-client/src/sync/state_sync_update.rs

 // STATE SYNC UPDATE
 // ================================================================================================

 /// Contains all information needed to apply the update in the store after syncing with the node.
+#[derive(Default)]
 pub struct StateSyncUpdate {


I would probably move all child structs of this struct (e.g., BlockUpdates, NoteUpdates, AccountUpdates, TransactionUpdates) into this file. Unless they are being use din multiple places?

bobbinth · 2025-02-23T02:08:44Z

crates/rust-client/src/sync/block_header.rs

+#[derive(Debug, Clone, Default)]
+pub struct BlockUpdates {
+    /// New block headers to be stored, along with a flag indicating whether the block contains
+    /// notes that are relevant to the client and the MMR peaks for the block.
+    block_headers: Vec<(BlockHeader, bool, MmrPeaks)>,
+    /// New authentication nodes that are meant to be stored in order to authenticate block
+    /// headers.
+    new_authentication_nodes: Vec<(InOrderIndex, Digest)>,
+}


Not for this PR, and this is probably fine for now, but for a client that is online and syncs very frequently with the network, we may end up storing a lot of extra data. Ideally, if a block header doesn't have relevant notes, we shouldn't store it, unless it is the last block in the chain. Thought, I wonder if this has any implications.

I agree, but wouldn't we need to store some minimal data for the chain mmr? even if the block isn't relevant

bobbinth · 2025-02-23T02:09:50Z

crates/rust-client/src/store/web_store/sync/flattened_vec.rs

Why are these changes needed in this PR?

The change from u32 to usize is because of a cast_possible_truncation clippy error, this file wasn't being included in previous versions of the web client so clippy didn't check this file.

tomyrd · 2025-03-05T13:38:36Z

I'll be updating this PR with the new state sync rpc request to fix the CI tests. Originally I wanted to wait for #751 but the stream functionality is currently being discussed.

tomyrd added 8 commits December 30, 2024 17:17

feat: give interior mutability to NodeRpcClient

6a7579f

refactor: move client's rpc_api to Arc

991779e

feat: add StateSync component (wip)

d7c7fb6

feat: remove old sync structs

0812700

feat: update output notes on sync

c1b8c4f

feat: check for locked accounts in state sync

435682b

doc: improve documentation

b61262d

refactor: revert unnecessary changes

d54e186

tomyrd force-pushed the tomyrd-sync-component-alt branch from 8731b2b to d54e186 Compare January 2, 2025 23:05

Merge branch 'next' into tomyrd-sync-component-alt

922b862

bobbinth reviewed Jan 7, 2025

View reviewed changes

crates/rust-client/src/sync/state_sync.rs Outdated Show resolved Hide resolved

bobbinth reviewed Jan 7, 2025

View reviewed changes

crates/rust-client/src/sync/state_sync.rs Show resolved Hide resolved

tomyrd added 2 commits January 7, 2025 19:43

refactor: move state transitions outside StateSync

a499748

feat: add update callbacks

b0930cf

bobbinth reviewed Jan 8, 2025

View reviewed changes

crates/rust-client/src/sync/state_sync.rs Outdated Show resolved Hide resolved

bobbinth reviewed Jan 8, 2025

View reviewed changes

crates/rust-client/src/sync/state_sync.rs Outdated Show resolved Hide resolved

bobbinth reviewed Jan 8, 2025

View reviewed changes

crates/rust-client/src/sync/state_sync.rs Outdated Show resolved Hide resolved

tomyrd added 2 commits January 8, 2025 17:55

refactor: change callbacks to deal with individual elements

170cc85

chore: improve code structure and documentation

599a801

tomyrd force-pushed the tomyrd-sync-component-alt branch from 6baeafa to 50938fb Compare January 9, 2025 02:42

bobbinth reviewed Jan 9, 2025

View reviewed changes

crates/rust-client/src/notes/mod.rs Outdated Show resolved Hide resolved

tomyrd added 2 commits January 9, 2025 11:49

fix: update web store

2645e7a

chore: update CHANGELOG

2f26007

tomyrd force-pushed the tomyrd-sync-component-alt branch from 50938fb to 2f26007 Compare January 9, 2025 14:50

tomyrd mentioned this pull request Jan 10, 2025

refactor: add StateSync component #646

Closed

Merge branch 'next' into tomyrd-sync-component-alt

e86c024

igamigo reviewed Jan 10, 2025

View reviewed changes

crates/rust-client/src/store/web_store/sync/mod.rs Show resolved Hide resolved

crates/rust-client/src/sync/block_headers.rs Outdated Show resolved Hide resolved

tomyrd marked this pull request as ready for review January 10, 2025 21:12

tomyrd mentioned this pull request Jan 10, 2025

StateSync component refactor follow-ups #663

Open

5 tasks

bobbinth reviewed Feb 11, 2025

View reviewed changes

tomyrd mentioned this pull request Feb 11, 2025

feat: give interior mutability to rpc client #726

Merged

review: improve StateSync comments

12e4706

tomyrd mentioned this pull request Feb 11, 2025

refactor: update structs for StateSync component #727

Merged

Merge branch 'new-state-sync' into tomyrd-sync-component-alt

fa830a5

tomyrd force-pushed the tomyrd-sync-component-alt branch from 6a4bf21 to fa830a5 Compare February 12, 2025 21:08

igamigo approved these changes Feb 13, 2025

View reviewed changes

tomyrd added 4 commits February 14, 2025 10:43

review: refactor BlockUpdates

8ea1a15

review: improve docs

a6ac948

review: improve NoteUpdates

b1c0110

remove state_sync_update from component

52c04ab

This was referenced Feb 14, 2025

feat: add prefix to Nullifier 0xPolygonMiden/miden-base#1153

Merged

feat: stream SyncState response 0xPolygonMiden/miden-node#685

Open

tomyrd added 2 commits February 21, 2025 15:34

Merge branch 'new-state-sync' into tomyrd-sync-component-alt

6e3ba27

feat: add state sync component to client constructor

efb9589

bobbinth reviewed Feb 23, 2025

View reviewed changes

tomyrd added 3 commits February 24, 2025 11:07

fix: use Nullifier::prefix

d1c8490

review: address suggestions

728f3f6

review: remove OnNullifierReceived callback

2090593

igamigo mentioned this pull request Feb 24, 2025

feat: Make sync compatible with node's next #758

Merged

TomasArrachea and others added 2 commits February 25, 2025 16:07

Merge branch 'new-state-sync' into tomyrd-sync-component-alt

9c99579

revert efb9589

807a929

review:address suggestions

54b277f

tomyrd mentioned this pull request Mar 5, 2025

feat: reduce the extra data used for block headers that aren't relevant to the client #773

Open

tomyrd added 2 commits March 5, 2025 17:22

Merge branch 'new-state-sync' into tomyrd-sync-component-alt

02eb0de

feat: add check nullifiers request

3dc16d5

tomyrd force-pushed the tomyrd-sync-component-alt branch from 8a5a6a6 to 3dc16d5 Compare March 5, 2025 20:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add `StateSync` component #650

feat: add `StateSync` component #650

tomyrd commented Jan 2, 2025 •

edited

Loading

tomyrd commented Jan 8, 2025

bobbinth left a comment

bobbinth Feb 11, 2025

tomyrd Feb 11, 2025

bobbinth Feb 23, 2025

tomyrd commented Feb 11, 2025

tomyrd commented Feb 12, 2025

igamigo left a comment •

edited

Loading

bobbinth left a comment

bobbinth Feb 23, 2025

tomyrd Feb 24, 2025

tomyrd Mar 5, 2025

bobbinth Feb 23, 2025

tomyrd Feb 24, 2025

bobbinth Feb 23, 2025

tomyrd Feb 24, 2025 •

edited

Loading

bobbinth Feb 23, 2025

tomyrd Feb 24, 2025

bobbinth Feb 23, 2025

tomyrd Feb 24, 2025

bobbinth Feb 23, 2025

bobbinth Feb 23, 2025

bobbinth Feb 23, 2025

tomyrd Mar 5, 2025 •

edited

Loading

bobbinth Feb 23, 2025

tomyrd Feb 24, 2025

tomyrd commented Mar 5, 2025

feat: add StateSync component #650

Are you sure you want to change the base?

feat: add StateSync component #650

Conversation

tomyrd commented Jan 2, 2025 • edited Loading

state_sync method

Callbacks

Other changes

TODOs

tomyrd commented Jan 8, 2025

bobbinth left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tomyrd commented Feb 11, 2025

tomyrd commented Feb 12, 2025

igamigo left a comment • edited Loading

Choose a reason for hiding this comment

bobbinth left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tomyrd Feb 24, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tomyrd Mar 5, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tomyrd commented Mar 5, 2025

feat: add `StateSync` component #650

feat: add `StateSync` component #650

tomyrd commented Jan 2, 2025 •

edited

Loading

`state_sync` method

igamigo left a comment •

edited

Loading

tomyrd Feb 24, 2025 •

edited

Loading

tomyrd Mar 5, 2025 •

edited

Loading