Byte Pair Encoding

Byte pair encoding or digram coding is a simple form of data compression in which the most common pair of consecutive bytes of data is replaced with a byte that does not occur within that data. A table of the replacements is required to rebuild the original data. The algorithm was first described publicly by Philip Gage in a February 1994 article "A New Algorithm for Data Compression" in the C Users Journal.

Read more about Byte Pair Encoding:  Byte Pair Encoding Example

Famous quotes containing the word pair:

    I well recall my horror when I heard for the first time, of a journalist who had laid in a pair of what were then called bicycle pants and taken to golf; it was as if I had encountered a studhorse with his hair done up in frizzes, and pink bowknots peeking out of them. It seemed, in some vague way, ignominious, and even a bit indelicate.
    —H.L. (Henry Lewis)