CRDT Implementation

Trait-based system for Conflict-free Replicated Data Types enabling deterministic conflict resolution.

Core Concepts

CRDT Trait: Defines merge operation for resolving conflicts between divergent states. Requires Serialize, Deserialize, and Default implementations.

Merkle-CRDT Principles: CRDT state stored in Entry's RawData for deterministic merging across distributed systems.

Multiple CRDT Support: Different CRDT types can be used for different stores within the same database.

Doc Type

Doc: The main CRDT document type

Hierarchical document structure supporting nested data
Provides document-level operations (get, set, merge, etc.)
Handles path-based operations for nested data access (dot notation)
Supports the Value enum for different data types

Value Types:

Text (string)
Int (i64 integer)
Bool (boolean)
Doc (nested document)
List (ordered collection with CRDT positioning)
Deleted (tombstone marker)

CRDT Behavior:

Recursive merging for nested structures
Last-write-wins strategy for conflicting leaf values
Tombstones for deletion tracking
Type-aware conflict resolution

Tombstones

Critical for distributed deletion propagation:

Mark data as deleted instead of physical removal
Retained and synchronized between replicas
Ensure deletions propagate to all nodes
Prevent resurrection of deleted data

Merge Algorithm

LCA-Based Computation: Uses Lowest Common Ancestor for efficient state calculation

Process:

Identify parent entries (tips) for store
Find LCA if multiple parents exist
Merge all paths from LCA to parent tips
Cache results for performance

Caching: Automatic caching of computed states with (Entry_ID, Store) keys for dramatic performance improvements.

Custom CRDT Implementation

Requirements:

Struct implementing Default, Serialize, Deserialize
Data marker trait implementation
CRDT trait with deterministic merge logic
Optional SubTree handle for user-friendly API