Character Set Confusion
Because these ASCII extensions have so many variants, it is necessary to identify which set is being used for a particular text for it to be interpreted correctly. However, because the most-used characters (those in ASCII, the seven-bit code points) are common to all sets—even most proprietary ones—failure to correctly identify a character set often suffers no adverse consequences if the user is typing in English. Further, because many Internet standards use ISO 8859-1, and because Microsoft Windows (using the code page 1252 superset of ISO 8859-1) is the dominant operating system for personal computers today, unannounced use of ISO 8859-1 is quite commonplace, and may generally be assumed without evidence to the contrary.
In many protocols, most importantly e-mail and HTTP, the character encoding of content has to be tagged with IANA-assigned character set identifiers.
Read more about this topic: Extended ASCII
Famous quotes containing the words character, set and/or confusion:
“The first thing to be done by a biographer in estimating character is to examine the stubs of his victims cheque-books.”
—Silas Weir Mitchell (18291914)
“The whole object of travel is not to set foot on foreign land; it is at last to set foot on ones own country as a foreign land.”
—Gilbert Keith Chesterton (18741936)
“There is ... no glamor at banquetsI mean the large formal banquets of big associations and societies. There is only a kind of dignified confusion that gradually unhinges the mind.”
—James Thurber (18941961)