background preloader

Unicode

Facebook Twitter

Mapping of Unicode characters. UCS has a potential capacity to encode over 1 million characters.

Mapping of Unicode characters

Each UCS character is abstractly represented by a code point, which is an integer between 0 and 1,114,111, used to represent each character within the internal logic of text processing software (1,114,112 = 220 + 216 or 17 × 216, or hexadecimal 110000 code points). As of Unicode 6.2, released in September 2012, 249,764 (22.4%) of these code points are assigned, including 110,182 (9.9%) encoded characters, 137,468 (12.3%) reserved for private use, 2,048 for surrogates, and 66 designated non-characters, leaving 864,348 (77.6%) unassigned. The number of encoded characters is made up as follows:

Code Charts. Specials Controls: C0, C1 Layout Controls Invisible Operators.

Code Charts

Unicode typeface. Background[edit] Computer fonts use various techniques to display characters or glyphs. A bitmap font contains a grid of dots known as pixels forming an image of each glyph in each face and size.