fix(tree) Fix document corruption due to loss of change info for transient nodes in sequence field #16190

yann-achard-MS · 2023-06-29T20:40:18Z

Motivation

Fixes the fact that we were dropping all nested changes to a subtree when composing the (re)creation of that subtree, changes to it, and the deletion of the subtree. See added/updated tests for details. Note that this is for sequence field only.

Dropping the nested changes used to produce unexpected results as well as potential divergence in state between clients, which eventually leads to document corruption.

Changes

This is accomplished by adding information on Insert and Revive marks to convey whether the content has been deleted (in addition to inserted/revived), and a ChangeAtomId describing the deletion.

The ChangeAtomId info is used to properly order and match up (in terms of cell index) different marks. For example, by detecting that a later revive should be composed with the transient insert.

This PR also disables some composition tests. These tests check that left-associative and right-associative compositions produce the same results. The changes in this PR do not introduce the discrepancy in how those compositions are processed, but they make the existing discrepancy obvious by making the output of the composition different. This reduction in the correctness of compose is considered an acceptable price to pay for the benefit of handling transient node changes better, especially when considering that the prior compose logic, while consistent w.r.t. to associativity, was producing results that are consistently wrong (this is because repair data needs to be created for pairs of inserts and deletes).

msfluid-bot · 2023-06-29T21:05:23Z

⯅ @fluid-example/bundle-size-tests: +2.6 KB

Metric Name	Baseline Size	Compare Size	Size Diff
aqueduct.js	455.09 KB	455.09 KB	■ No change
connectionState.js	680 Bytes	680 Bytes	■ No change
containerRuntime.js	244.52 KB	244.52 KB	■ No change
loader.js	151.41 KB	151.41 KB	■ No change
map.js	47.19 KB	47.19 KB	■ No change
matrix.js	147.86 KB	147.86 KB	■ No change
odspDriver.js	93.68 KB	93.68 KB	■ No change
odspPrefetchSnapshot.js	44.51 KB	44.51 KB	■ No change
sharedString.js	164.77 KB	164.77 KB	■ No change
sharedTree2.js	240.38 KB	242.98 KB	⯅ +2.6 KB
Total Size	1.71 MB	1.71 MB	⯅ +2.6 KB

Baseline commit: b04b832

Generated by 🚫 dangerJS against 02ba46c

alex-pardes · 2023-06-30T16:28:49Z

experimental/dds/tree2/src/test/utils.ts

+/**
+ * @returns `true` iff the given delta has a visible impact on the document tree.
+ */
+export function isDeltaVisible(delta: Delta.MarkList): boolean {


Is it difficult to avoid creating "invisible" deltas?

In general, yes, because it might require following a sequence of moves to see if they span the boundary between transient and non-transient stuff.

For the tests that use this function, we could avoid it in more cases if NodeChangeComposer had the ability to return undefined in order to signify that the outcome of the composition turned out to be a no-op.

alex-pardes · 2023-06-30T16:29:47Z

experimental/dds/tree2/src/test/utils.ts

+				case Delta.MarkType.MoveOut:
+				case Delta.MarkType.MoveIn:
+				case Delta.MarkType.Delete:
+					break;


Why don't these marks count towards visibility?

My bad, they indeed should.

That was my mistake. They do now.

alex-pardes · 2023-06-30T17:37:06Z