Skip to content

External Address Format (SS58)

Alexander Popiak edited this page Nov 30, 2020 · 21 revisions

SS58 is a simple address format designed for Substrate based chains. There's no problem with using other address formats for a chain, but this serves as a robust default. It is heavily based on Bitcoin's Base-58-check format with a few alterations.

The basic idea is a base-58 encoded value which can identify a specific account on the Substrate chain. Different chains have different means of identifying accounts. SS58 is designed to be extensible for this reason.

Basic Format

The basic format conforms to:

base58encode ( concat ( <address-type>, <address>, <checksum> ) )

That is, the concatenated byte series of address type, address and checksum then passed into a base-58 encoder. The base58encode function is exactly as defined in Bitcoin and IPFS, using the same alphabet as both.

Address Type

The <address-type> is one or more bytes that describe the precise format of the following bytes.

Currently, there exist several valid values:

  • 00000000b..=00101111b (0..47 inclusive): Account/address identifiers on networks:
    • 00000000b (0) Polkadot Live (SS58 checksum preimage)
    • 00000001b (1) Polkadot Live (reserved for secondary use)
    • 00000010b (2) Polkadot Canary (SS58 checksum preimage)
    • 00000011b (3) Polkadot Canary (reserved for secondary use)
    • 00000111b (7) Edgeware/Berlin (SS58 checksum preimage)
    • 00010000b (16) Kulupu (SS58 checksum preimage)
    • 00010001b (17) Kulupu (Reserved)
    • 00010100b (20) Dothereum (SS58 checksum preimage)
    • 00101010b (42) Generic Substrate wildcard (SS58 checksum preimage)
    • 00101011b (43) Generic Substrate wildcard (reserved for secondary use)
    • 00101111b (46) Reserved for future address format extensions.
    • 00101111b (47) Reserved for future address format extensions.
  • 00110000b..=00110111b (48..55 inclusive): Bare public keys of primary cryptography
    • 00110000b (48) Schnorr/Ristretto 25519 ("S/R 25519") key
    • 00110001b (49) Edwards Ed25519 key
    • 00110010b (50) ECDSA SECP256k1 key
  • 00111000b..=00111111b (56..63 inclusive): Bare public keys of secondary cryptography (additional byte expected).
  • 01000000b..=01111111b (64-127 inclusive) Reserved for future address format extensions.
  • 10000000b..=11111111b (128-255 inclusive) Secondary address types (additional byte expected).

The latter (42) is a "wildcard" address that is meant to be equally valid on all Substrate networks that support fixed-length addresses. For production networks, however, a network-specific version may be desirable to help avoid the key-reuse between networks and some of the problems that it can cause. Substrate Node will default to printing keys in address type 42, though alternative Substrate-based node implementations (e.g. Polkadot) may elect to default to some other type.

Address Formats for Substrate

There are 16 different address formats, identified by the length (in bytes) of the total payload (i.e. including the checksum).

  • 3 bytes: 1 byte account index, 1 byte checksum
  • 4 bytes: 2 byte account index, 1 byte checksum
  • 5 bytes: 2 byte account index, 2 byte checksum
  • 6 bytes: 4 byte account index, 1 byte checksum
  • 7 bytes: 4 byte account index, 2 byte checksum
  • 8 bytes: 4 byte account index, 3 byte checksum
  • 9 bytes: 4 byte account index, 4 byte checksum
  • 10 bytes: 8 byte account index, 1 byte checksum
  • 11 bytes: 8 byte account index, 2 byte checksum
  • 12 bytes: 8 byte account index, 3 byte checksum
  • 13 bytes: 8 byte account index, 4 byte checksum
  • 14 bytes: 8 byte account index, 5 byte checksum
  • 15 bytes: 8 byte account index, 6 byte checksum
  • 16 bytes: 8 byte account index, 7 byte checksum
  • 17 bytes: 8 byte account index, 8 byte checksum
  • 34 bytes: 32 byte account id, 2 byte checksum

Checksum types

Several potential checksum strategies exist within Substrate, giving different length and longevity guarantees. There are two types of checksum preimage (known as SS58 and AccountID) and many different checksum lengths (1 to 8 bytes).

In all cases for Substrate, the Blake2-256 hash function is used. The variants simply select the preimage used as the input to the hash function and the number of bytes taken from its output.

The bytes used are always the left most bytes. The input to be used is the non-checksum portion of the SS58 byte series used as input to the base-58 function, i.e. concat( <address-type>, <address> ). A context prefix of 0x53533538505245, (the string SS58PRE) is prepended to the input to give the final hashing preimage.

The advantage of using more checksum bytes is simply that more bytes provide a greater degree of protection against input errors and index alteration at the cost of widening the textual address by an extra few characters. For the account ID form, this is insignificant and therefore no 1-byte alternative is provided. For the shorter account-index formats, the extra byte represents a far greater portion of the final address and so it is left for further up the stack (though not necessarily the user themself) to determine the best tradeoff for their purposes.

Primary, secondary and bare types

The table above and, more canonically, the codebase as well as the registry express the status of the first 64 external address formats, known as the primary formats. A set of secondary formats exists that use multibyte addresses, identifiable by the most significant bit of the leftmost byte being set.

In this case, an additional byte is used to form a 15-bit value (0..32767), identifying the address type. These 32 thousand or so types are called secondary address types.

In addition to address types, there are bar cryptographic types that identify only a cryptographic method and not a particular application or subsystem. There are up to eight bare primary types (shown above) as well as a further 2048 secondary bare types, as 11-bit value made up of 3 bits from the first byte and 8 from the second.

In both cases of secondary values, the bitstring (either 11-bit or 15-bit) should be interpreted as Little Endian, with the least significant logical byte being split across the first two concrete bytes of the address. This is consistent with SCALE's dynamic-sized numberic encoding.

Clone this wiki locally