Universal text preprocessing and postprocessing for PPM using Alphabet Adjustment
Research output: Contribution to conference › Paper
Electronic versions
DOI
In this paper, we introduce several new universal pre-processing techniques to improve Prediction by Partial Matching (PPM) compression of UTF-8 encoded natural language text. These methods essentially 'adjust' the alphabet in some manner (for example, by expanding or reducing it) prior to the compression algorithm then being applied to the amended text.
Original language | English |
---|---|
Pages | 395 |
DOIs | |
Publication status | Published - 26 Mar 2014 |
Event | Proceedings of the Data Compression Conference, Snowbird, Utah, 26 - 28 March 2014 - Duration: 3 Jan 0001 → … |
Conference
Conference | Proceedings of the Data Compression Conference, Snowbird, Utah, 26 - 28 March 2014 |
---|---|
Period | 3/01/01 → … |