Extended ASCII - Character Set Confusion

Character Set Confusion

Because these ASCII extensions have so many variants, it is necessary to identify which set is being used for a particular text for it to be interpreted correctly. However, because the most-used characters (those in ASCII, the seven-bit code points) are common to all sets—even most proprietary ones—failure to correctly identify a character set often suffers no adverse consequences if the user is typing in English. Further, because many Internet standards use ISO 8859-1, and because Microsoft Windows (using the code page 1252 superset of ISO 8859-1) is the dominant operating system for personal computers today, unannounced use of ISO 8859-1 is quite commonplace, and may generally be assumed without evidence to the contrary.

In many protocols, most importantly e-mail and HTTP, the character encoding of content has to be tagged with IANA-assigned character set identifiers.

Read more about this topic:  Extended ASCII

Famous quotes containing the words character, set and/or confusion:

    Note too that a faithful study of the liberal arts humanizes character and permits it not to be cruel.
    Ovid (Publius Ovidius Naso)

    Parenthood always comes as a shock. Postpartum blues? Postpartum panic is more like it. We set out to have a baby; what we get is a total take-over of our lives.
    Polly Berrien Berends (20th century)

    There is ... no glamor at banquets—I mean the large formal banquets of big associations and societies. There is only a kind of dignified confusion that gradually unhinges the mind.
    James Thurber (1894–1961)