What are Salesforce ID's composed of?
The Id Field Type is a base-62 encoded string.
Each character can be one of 62 possible values:
- a lowercase letter (a-z) - 26 values
- an uppercase letter (A-Z) - 26 values
- a numeric digit (0-9) - 10 values
As there is a combination of lower and upper case letters the casing of the 15 character Id has significance. E.g. 50130000000014c is a different ID from 50130000000014C.
Within a 15 character Id the breakdown is:
- First 3 characters - Key Prefix As per Jon's answer, the first 3 characters are the key prefix that identify the object type. There are a few exceptions to this where multiple objects all share the same key prefix! There are a number of fixed key prefixes that are common across all of Salesforce. Custom objects get a unique key prefix per Org.
I'd need to confirm this, but I'm fairly certain thatCustom objects in managed packages can have a different keyprefix in each installed org. - The 4th and 5th characters - Reserved. Currently used for the instance id (a.k.a. pod identifier) (As per comment from @ca_peterson). Starting with the 4th character and overflowing to the 5th if required. Indicates which pod/instance the record was created on. Note that data may be migrated to other pods over time. Updated based on a separate question that indicated the pod identifier is actually two characters rather than one as initially thought.
- 6th character - Reserved. Will be
0
until such time that Salesforce has a need for it. Source - Steven Tamm - Remaining 9 characters - basically a really big number. Like 62^9 big.
To this you can add an optional 3 character suffix that will make the Id unique case-insensitive. This is useful when working with programs that can't maintain the case of the ID (E.g. Excel VLookup).
Notes about the suffix:
- this is not intended as a check sum to verify the other 12 characters haven't been corrupted.
- you can't just lower/upper case the entire ID. While it helps other case insensitive applications handle the IDs Salesforce is still case sensitive and won't auto correct the casing based on the suffix. E.g. the casing on a KeyPrefix is important with
00t
being OpportunityShare and00T
being Task.
The algorithm to convert from a 15 character Id to an 18 character Id is: (Source - I'm sure there used to be official documentation on how do this.)
Divide the 15 char into 3 chunks of 5 chars each.
For each character give that position a value of 1 if uppercase, 0 otherwise (lowercase or number).
Combine the bits from each chunk into a 5 bit integer where the rightmost bit is the most significant bit. This will yield a number between 0 and 31 for each chunk.
Construct an array that contains the sequence of capital letters A-Z and 0-5 (26 + 6 = 32 possible values).
Use the integer from each chunk to choose a character from the array.
Append the resulting 3 characters, in chunk order, to the end of the 15 char id.
In a formula there is the CASESAFEID function that will perform this algorithm.
You can apply this algorithm to some sample IDs to see how it doesn't really function as a checksum or checkdigit. For example, if you exclude the alpha characters, every ID between 001100000000001 and 001999999999999 will have the suffix AAA. Infact, you get the same suffix if you include any lowercase alpha characters as well. The suffix will only change in the presence of uppercase characters. It is basically encoding which of the 5 characters that each suffix character represents are uppercase.
Sample code to restore the casing from an 18 character ID is in Creating a link using an 18 character ID
If you are working with Data Exports you can also come across the special empty key with the 000 keyprefix.
One area I'm not sure of is the order in which Salesforce increments through the base 62 encoding. E.g. Does it go 0 to 9, then a to z, then A to Z? At this stage I think the sequence looks like '0123456789aAbBcCdDeEfFgGhHiIjJkKlLmMnNoOpPqQrRsStTuUvVwWxXyYzZ'
'0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz'
The first 3 digits are a prefix that specifies the type of sObject, a big list can be found here:
http://www.fishofprey.com/2011/09/obscure-salesforce-object-key-prefixes.html
I believe the rest of the Id is reference to the record itself. I should also add that the ID's are 15 digits long but can be 18 digits long with the last 3 digits for error correction making the the Id case-insensitive.
So:
3 Digits (Object) / 12 Digits (Record) / (Optional) 3 Digits (Error Correction)
While is is true that the last 3 character ads case insensitivity, I believe the algorithm used to generate them is the check digit. Check digits originated in legacy data transmission to alleviate the introduction if errors in the data. So a bit if both I reckon.