Extended ASCII - Character Set Confusion

Character Set Confusion

Because these ASCII extensions have so many variants, it is necessary to identify which set is being used for a particular text for it to be interpreted correctly. However, because the most-used characters (those in ASCII, the seven-bit code points) are common to all sets—even most proprietary ones—failure to correctly identify a character set often suffers no adverse consequences if the user is typing in English. Further, because many Internet standards use ISO 8859-1, and because Microsoft Windows (using the code page 1252 superset of ISO 8859-1) is the dominant operating system for personal computers today, unannounced use of ISO 8859-1 is quite commonplace, and may generally be assumed without evidence to the contrary.

In many protocols, most importantly e-mail and HTTP, the character encoding of content has to be tagged with IANA-assigned character set identifiers.

Read more about this topic:  Extended ASCII

Famous quotes containing the words character, set and/or confusion:

    If man is reduced to being nothing but a character in history, he has no other choice but to subside into the sound and fury of a completely irrational history or to endow history with the form of human reason.
    Albert Camus (1913–1960)

    Let us speak, though we show all our faults and weaknesses,—for it is a sign of strength to be weak, to know it, and out with it,—not in a set way and ostentatiously, though, but incidentally and without premeditation.
    Herman Melville (1819–1891)

    There is ... no glamor at banquets—I mean the large formal banquets of big associations and societies. There is only a kind of dignified confusion that gradually unhinges the mind.
    James Thurber (1894–1961)