Diode Message Canister - Milestone 1 Architecture - Amended

The MS1 architecture has been amended to address the following points:

There is a fixed size 8mb overhead for each allocated Region (https://forum.dfinity.org/t/region-api-8mb-stable-overhead-per-region/35750). With this overhead having one Region for every key_id index is not scalable. Instead the key_id indexes are changed to be a linked list within the inbox region.

The key_id index will point to the first and last id entry still allowing for the same interface.

Changed Architecture

The architecture is nearly the same as in the original architecture document. Only that the Key Inbox Regions are removed.

graph TD
    subgraph Canister[Diode Message Canister]
        SM[Store Message]
        GM[Get Messages]
        GK[Get Messages Indexed by Encryption Key]
        IX[Heap Indexes]

        subgraph DS[Data Storage]
            IR[Inbox Region]
            PR[Cipher Text Region]
        end
    end

    C1[Client 1] -->|add_message| SM
    C2[Client 2] -->|get_message_by_id| GM
    C3[Client 3] -->|get_*_message_id_by_key| GK

    SM -->|Write| DS[Stable Storage]
    GM -->|Read| DS
    GK -->|Read| DS

Loading

Indexes

Only the Key ID Index is changed:

Index Name	Key	Value	Description
Key ID Index	key_id (20 bytes)	(min message id, max message id, message count)	Maps the encryption key id to the highest and lowest message id in the inbox region encrypted with this key.

Data layout

The inbox region layout changes to add a next_id and prev_id field to each message to allow for a linked list of messages per key_id.

Binary layout of `inbox`

Each inbox entry is 84 bytes size allowing fast binary search based on timestamp and direct calculation of an inbox entry location based on the id field. Id are guaranteed to be incrementing by one per entry. In the future the id field might be used to support a ring buffer layout for the inbox field to prevent growth beyond limits.

Field	Type	Description
id	uint32	auto incremented integer and unique id of this message
timestamp	uint32	unix timestamp of the insertion into the canister
key_id	bytes41	group_id of the destination group
next_id	uint32	id of the next message in the linked list for this key_id
prev_id	uint32	id of the previous message in the linked list for this key_id
hash	bytes32	sha256 hash of the cipher_text
offset	uint64	offset of the message payload in the `payload_region`
len	uint32	length of the message payload in the `payload_region`

Visualization of the data relation

erDiagram
    %% HEAP Maps
    KEY_INBOX_MAP ||--o{ INBOX : maps
    KEY_INBOX_MAP {
        bytes41 key_id
        uint32 min_msg_id
        uint32 max_msg_id
    }
    MESSAGE_INDEX ||--o{ INBOX : indexes
    MESSAGE_INDEX {
        bytes32 hash
        uint32 message_id
    }

    %% REGION Data
    INBOX ||--|{ PAYLOAD : references
    INBOX {
        uint32 message_id
        uint32 timestamp
        bytes41 destination
        bytes32 hash
        uint32 next_msg_id
        uint32 prev_msg_id
        uint64 offset
        uint32 len
    }
    PAYLOAD {
        bytes cipher_text
    }

    %% Markers
    HEAP_MARKER[HEAP_MEMORY]
    REGION_MARKER[STABLE_MEMORY]

    %% Relationships to markers
    HEAP_MARKER ||--|{ KEY_INBOX_MAP : contains
    HEAP_MARKER ||--|{ MESSAGE_INDEX : contains
    REGION_MARKER ||--|{ INBOX : contains
    REGION_MARKER ||--|{ PAYLOAD : contains

Loading

Remarks

This amendment to the original architecture reduces the number of Regions from (2 + number of unique key_ids) to fixed 2 Regions.

The downside is that the linked list of messages for a given key_id are embedded within the inbox region. This means that linear traversal is possible, but random access is not. So searching for a certain message timestamp within a key_id is not possible without linear traversal.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARCHITECTURE_MS1_amend.md

ARCHITECTURE_MS1_amend.md

Diode Message Canister - Milestone 1 Architecture - Amended

Changed Architecture

Indexes

Data layout

Binary layout of `inbox`

Visualization of the data relation

Remarks

Files

ARCHITECTURE_MS1_amend.md

Latest commit

History

ARCHITECTURE_MS1_amend.md

File metadata and controls

Diode Message Canister - Milestone 1 Architecture - Amended

Changed Architecture

Indexes

Data layout

Binary layout of inbox

Visualization of the data relation

Remarks

Binary layout of `inbox`