Character Set Confusion
Because these ASCII extensions have so many variants, it is necessary to identify which set is being used for a particular text for it to be interpreted correctly. However, because the most-used characters (those in ASCII, the seven-bit code points) are common to all sets—even most proprietary ones—failure to correctly identify a character set often suffers no adverse consequences if the user is typing in English. Further, because many Internet standards use ISO 8859-1, and because Microsoft Windows (using the code page 1252 superset of ISO 8859-1) is the dominant operating system for personal computers today, unannounced use of ISO 8859-1 is quite commonplace, and may generally be assumed without evidence to the contrary.
In many protocols, most importantly e-mail and HTTP, the character encoding of content has to be tagged with IANA-assigned character set identifiers.
Read more about this topic: Extended ASCII
Famous quotes containing the words character, set and/or confusion:
“Innocence is lovely in the child, because in harmony with its nature; but our path in life is not backward but onward, and virtue can never be the offspring of mere innocence. If we are to progress in the knowledge of good, we must also progress in the knowledge of evil. Every experience of evil brings its own temptation and according to the degree in which the evil is recognized and the temptations resisted, will be the value of the character into which the individual will develop.”
—Mrs. H. O. Ward (18241899)
“He set the jug down slowly at his feet
With trembling care, knowing that most things break;”
—Edwin Arlington Robinson (18691935)
“The small force that it takes to launch a boat into the stream should not be confused with the force of the stream that carries it along: but this confusion appears in nearly all biographies.”
—Friedrich Nietzsche (18441900)