Byte Pair Encoding

Byte pair encoding or digram coding is a simple form of data compression in which the most common pair of consecutive bytes of data is replaced with a byte that does not occur within that data. A table of the replacements is required to rebuild the original data. The algorithm was first described publicly by Philip Gage in a February 1994 article "A New Algorithm for Data Compression" in the C Users Journal.

Read more about Byte Pair Encoding:  Byte Pair Encoding Example

Famous quotes containing the word pair:

    Firm-style bean curd insoles cushion feet, absorb perspiration and provide more protein than meat or fish innersoles of twice the weight. Tofu compresses with use, becoming more pungent and flavorful. May be removed when not in use to dry or marinate. Innersoles are ready to eat after 1,200 miles of wear. Each pair provides adult protein requirement for 2 meals. Insoles are sized large to allow for snacks. Recipe booklet included.
    Alfred Gingold, U.S. humorist. Items From Our Catalogue, “Tofu Innersoles,” Avon Books (1982)