Logic table of supersets

From Wikicode Subsets
Jump to navigation Jump to search

Character sets up to 8-bit comparison

Some systems store all characters in up to 8 bits. This is most notable in C and C++ languages where the char type of a single byte, usually 8 bits, stores a single character, and the character and string literals are based on the char type. In Microsoft Windows, the char type could be of CP437 encoding, or CP850, etc. depending on the regional settings. The char type is commonly said to be ASCII characters, however most systems actually use a superset of ASCII such as CP437 or ISO 8859-1.

ASCII ISO 8859-1 ISO 8859-14 CP1252 Vietnamese Japanese (halfwidth) CP437
character count: 95 191 191 218 229 163 255
bits in its binary format: 7 8 8 8 8 8 8
Empty set
extensions (sets not intended to be used on their own)
IPA-MINI
character sets up to 8-bit (up to 256 characters)
ASCII
ISO 8859-1
ISO 8859-14
CP1252
Vietnamese
Japanese (halfwidth)
CP437
bits in its binary format: 7 8 8 8 8 8 8
character count: 95 191 191 218 229 163 255
ASCII ISO 8859-1 ISO 8859-14 CP1252 Vietnamese Japanese (halfwidth) CP437

Character sets more than 8-bit comparison

Unicode is much larger than any 8-bit codepage, therefore it may be unfeasible to design every single Unicode character. Therefore the Unicode subsets are made to be a point of inclusion in the font, so that a font can be marketted as for instance having Subset2 character set.

The Japanese character set is based on the Shift JIS encoding with 163 8-bit characters and 7326 16-bit characters. For all other character sets more than 8-bit, it is recommended to use the UTF-16 encoding where BMP characters are 16-bit, and non-BMP characters are 32-bit.

MES-1 VSECS W1G WGL4 Subset1 SECS KRA-1.0 MES-2 KRA-1.1 Subset2 KRA-1.2 LPTT-1 LPTT-1&Subset1 LPTT-1.1 MES-3B Subset3 Subset3+ LPTT-1.3 LPTT-1.4 Japanese
character count: 335 361 601 657 678 708 999 1064 1098 1193 1351 1999 2005 2201 2821 2823 3309 3501 3996 7489
Empty set
extensions (sets not intended to be used on their own)
IPA-MINI
character sets up to 8-bit (up to 256 characters)
ASCII
ISO 8859-1
ISO 8859-14
CP1252
Vietnamese
Japanese (halfwidth)
CP437
character sets more than 8-bit (comparison with each other)
MES-1
VSECS
W1G
WGL4
Subset1
SECS
KRA-1.0
MES-2
KRA-1.1
Subset2
KRA-1.2
LPTT-1
LPTT-1&Subset1
LPTT-1.1
MES-3B
Subset3
Subset3+
LPTT-1.3
LPTT-1.4
Japanese
character count: 335 361 601 657 678 708 999 1064 1098 1193 1351 1999 2005 2201 2821 2823 3309 3501 3996 7489
MES-1 VSECS W1G WGL4 Subset1 SECS KRA-1.0 MES-2 KRA-1.1 Subset2 KRA-1.2 LPTT-1 LPTT-1&Subset1 LPTT-1.1 MES-3B Subset3 Subset3+ LPTT-1.3 LPTT-1.4 Japanese