Extended ASCII - Character Set Confusion

Character Set Confusion

Because these ASCII extensions have so many variants, it is necessary to identify which set is being used for a particular text for it to be interpreted correctly. However, because the most-used characters (those in ASCII, the seven-bit code points) are common to all sets—even most proprietary ones—failure to correctly identify a character set often suffers no adverse consequences if the user is typing in English. Further, because many Internet standards use ISO 8859-1, and because Microsoft Windows (using the code page 1252 superset of ISO 8859-1) is the dominant operating system for personal computers today, unannounced use of ISO 8859-1 is quite commonplace, and may generally be assumed without evidence to the contrary.

In many protocols, most importantly e-mail and HTTP, the character encoding of content has to be tagged with IANA-assigned character set identifiers.

Read more about this topic:  Extended ASCII

Famous quotes containing the words character, set and/or confusion:

    It is a part of the American character to consider nothing as desperate; to surmount every difficulty by resolution and contrivance.
    Thomas Jefferson (1743–1826)

    You have a row of dominoes set up; you knock over the first one, and what will happen to the last one is that it will go over very quickly.
    Dwight D. Eisenhower (1890–1969)

    The LORD will afflict you with madness, blindness, and confusion of mind; you shall grope about at noon as blind people grope in darkness, but you shall be unable to find your way; and you shall be continually abused and robbed, without anyone to help.
    Bible: Hebrew, Deuteronomy 28:28,29.