Byte pair encoding or digram coding is a simple form of data compression in which the most common pair of consecutive bytes of data is replaced with a byte that does not occur within that data. A table of the replacements is required to rebuild the original data. The algorithm was first described publicly by Philip Gage in a February 1994 article "A New Algorithm for Data Compression" in the C Users Journal.
Read more about Byte Pair Encoding: Byte Pair Encoding Example
Famous quotes containing the word pair:
“I well recall my horror when I heard for the first time, of a journalist who had laid in a pair of what were then called bicycle pants and taken to golf; it was as if I had encountered a studhorse with his hair done up in frizzes, and pink bowknots peeking out of them. It seemed, in some vague way, ignominious, and even a bit indelicate.”
—H.L. (Henry Lewis)