Mapping of Unicode Characters - Whitespace Characters

Whitespace Characters

Unicode provides a list of characters it deems whitespace characters for interoperability support. Software Implementations and other standards may use the term to denote a slightly different set of characters. For example, Java does not consider U+00A0 no-break space or U+0085 (NEXT LINE) to be whitespace, even though Unicode does. Whitespace characters are characters typically designated for programming environments. Often they have no syntactic meaning in such programming environments and are ignored by the machine interpreters. Unicode designates the legacy control characters U+0009 through U+000D and U+0085 as whitespace characters, as well as all characters whose General Category property value is Separator. There are 26 total whitespace characters as of Unicode 6.0.0.

Read more about this topic:  Mapping Of Unicode Characters

Famous quotes containing the word characters:

    Of all the characters I have known, perhaps Walden wears best, and best preserves its purity. Many men have been likened to it, but few deserve that honor. Though the woodchoppers have laid bare first this shore and then that, and the Irish have built their sties by it, and the railroad has infringed on its border, and the ice-men have skimmed it once, it is itself unchanged, the same water which my youthful eyes fell on; all the change is in me.
    Henry David Thoreau (1817–1862)