Extended ASCII - Character Set Confusion

Character Set Confusion

Because these ASCII extensions have so many variants, it is necessary to identify which set is being used for a particular text for it to be interpreted correctly. However, because the most-used characters (those in ASCII, the seven-bit code points) are common to all sets—even most proprietary ones—failure to correctly identify a character set often suffers no adverse consequences if the user is typing in English. Further, because many Internet standards use ISO 8859-1, and because Microsoft Windows (using the code page 1252 superset of ISO 8859-1) is the dominant operating system for personal computers today, unannounced use of ISO 8859-1 is quite commonplace, and may generally be assumed without evidence to the contrary.

In many protocols, most importantly e-mail and HTTP, the character encoding of content has to be tagged with IANA-assigned character set identifiers.

Read more about this topic:  Extended ASCII

Famous quotes containing the words character, set and/or confusion:

    The first thing to be done by a biographer in estimating character is to examine the stubs of his victim’s cheque-books.
    Silas Weir Mitchell (1829–1914)

    The whole object of travel is not to set foot on foreign land; it is at last to set foot on one’s own country as a foreign land.
    Gilbert Keith Chesterton (1874–1936)

    There is ... no glamor at banquets—I mean the large formal banquets of big associations and societies. There is only a kind of dignified confusion that gradually unhinges the mind.
    James Thurber (1894–1961)