Logic table of supersets
Character sets up to 8-bit comparison
Some systems store all characters in up to 8 bits. This is most notable in C and C++ languages where the char type of a single byte, usually 8 bits, stores a single character, and the character and string literals are based on the char type. In Microsoft Windows, the char type could be of CP437 encoding, or CP850, etc. depending on the regional settings. The char type is commonly said to be ASCII characters, however most systems actually use a superset of ASCII such as CP437 or ISO 8859-1.
ASCII | ISO 8859-1 | ISO 8859-14 | CP1252 | Vietnamese | Japanese (halfwidth) | CP437 | |||||
character count: | 95 | 191 | 191 | 218 | 229 | 163 | 255 | ||||
bits in its binary format: | 7 | 8 | 8 | 8 | 8 | 8 | 8 | ||||
Empty set | |||||||||||
extensions (sets not intended to be used on their own) | |||||||||||
IPA-MINI | |||||||||||
character sets up to 8-bit (up to 256 characters) | |||||||||||
ASCII | |||||||||||
ISO 8859-1 | |||||||||||
ISO 8859-14 | |||||||||||
CP1252 | |||||||||||
Vietnamese | |||||||||||
Japanese (halfwidth) | |||||||||||
CP437 | |||||||||||
bits in its binary format: | 7 | 8 | 8 | 8 | 8 | 8 | 8 | ||||
character count: | 95 | 191 | 191 | 218 | 229 | 163 | 255 | ||||
ASCII | ISO 8859-1 | ISO 8859-14 | CP1252 | Vietnamese | Japanese (halfwidth) | CP437 |
Character sets more than 8-bit comparison
Unicode is much larger than any 8-bit codepage, therefore it may be unfeasible to design every single Unicode character. Therefore the Unicode subsets are made to be a point of inclusion in the font, so that a font can be marketted as for instance having Subset2 character set.
The Japanese character set is based on the Shift JIS encoding with 163 8-bit characters and 7326 16-bit characters. For all other character sets more than 8-bit, it is recommended to use the UTF-16 encoding where BMP characters are 16-bit, and non-BMP characters are 32-bit.
MES-1 | VSECS | W1G | WGL4 | Subset1 | SECS | KRA-1.0 | MES-2 | KRA-1.1 | Subset2 | KRA-1.2 | LPTT-1 | LPTT-1&Subset1 | LPTT-1.1 | MES-3B | Subset3 | Subset3+ | LPTT-1.3 | LPTT-1.4 | Japanese | |
character count: | 335 | 361 | 603 | 657 | 678 | 708 | 999 | 1064 | 1098 | 1193 | 1351 | 1999 | 2005 | 2201 | 2821 | 2823 | 3309 | 3501 | 3996 | 7489 |
Empty set | ||||||||||||||||||||
extensions (sets not intended to be used on their own) | ||||||||||||||||||||
IPA-MINI | ||||||||||||||||||||
character sets up to 8-bit (up to 256 characters) | ||||||||||||||||||||
ASCII | ||||||||||||||||||||
ISO 8859-1 | ||||||||||||||||||||
ISO 8859-14 | ||||||||||||||||||||
CP1252 | ||||||||||||||||||||
Vietnamese | ||||||||||||||||||||
Japanese (halfwidth) | ||||||||||||||||||||
CP437 | ||||||||||||||||||||
character sets more than 8-bit (comparison with each other) | ||||||||||||||||||||
MES-1 | ||||||||||||||||||||
VSECS | ||||||||||||||||||||
W1G | ||||||||||||||||||||
WGL4 | ||||||||||||||||||||
Subset1 | ||||||||||||||||||||
SECS | ||||||||||||||||||||
KRA-1.0 | ||||||||||||||||||||
MES-2 | ||||||||||||||||||||
KRA-1.1 | ||||||||||||||||||||
Subset2 | ||||||||||||||||||||
KRA-1.2 | ||||||||||||||||||||
LPTT-1 | ||||||||||||||||||||
LPTT-1&Subset1 | ||||||||||||||||||||
LPTT-1.1 | ||||||||||||||||||||
MES-3B | ||||||||||||||||||||
Subset3 | ||||||||||||||||||||
Subset3+ | ||||||||||||||||||||
LPTT-1.3 | ||||||||||||||||||||
LPTT-1.4 | ||||||||||||||||||||
Japanese | ||||||||||||||||||||
character count: | 335 | 361 | 603 | 657 | 678 | 708 | 999 | 1064 | 1098 | 1193 | 1351 | 1999 | 2005 | 2201 | 2821 | 2823 | 3309 | 3501 | 3996 | 7489 |
MES-1 | VSECS | W1G | WGL4 | Subset1 | SECS | KRA-1.0 | MES-2 | KRA-1.1 | Subset2 | KRA-1.2 | LPTT-1 | LPTT-1&Subset1 | LPTT-1.1 | MES-3B | Subset3 | Subset3+ | LPTT-1.3 | LPTT-1.4 | Japanese |