Get flash to fully experience Pearltrees
Unicode ’s Universal Character Set (UCS) has a potential capacity to support over 1 million characters. Each UCS character is mapped to a code point , which is an integer between 0 and 1,114,111, used to represent each character within the internal logic of text processing software (1,114,112 = 2 20 + 2 16 or 17 × 2 16 , or hexadecimal 110000 code points). As of Unicode 6.2, released in September 2012, 249,764 (22.4%) of these code points are assigned, including 110,182 (9.9%) encoded characters, 137,468 (12.3%) reserved for private use , 2,048 for surrogates , and 66 designated noncharacters , leaving 864,348 (77.6%) unassigned. The number of encoded characters is made up as follows: 109,976 graphical characters (some of which are invisible, but are still counted as graphical) 206 special purpose characters for control and formatting.
Scripts | Symbols | Notes
A Unicode font (also known as UCS font and Unicode typeface ) is a computer font that contains a wide range of characters , letters , digits , glyphs , symbols , ideograms , logograms , etc., which are collectively mapped into the standard Universal Character Set , derived from many different languages and scripts from around the world. Unlike most conventional computer fonts, which are specific to a particular language or legacy character set and contain only a small subset of the UCS characters, these fonts attempt to include many thousands of possible glyphs, so that they can be used as a single typeface across multi-lingual documents. [ edit ] Background