Abstract
In this paper, we introduce several new universal pre-processing techniques to improve Prediction by Partial Matching (PPM) compression of UTF-8 encoded natural language text. These methods essentially 'adjust' the alphabet in some manner (for example, by expanding or reducing it) prior to the compression algorithm then being applied to the amended text.
| Original language | English |
|---|---|
| Pages | 395 |
| DOIs | |
| Publication status | Published - 26 Mar 2014 |
| Event | Proceedings of the Data Compression Conference, Snowbird, Utah, 26 - 28 March 2014 - Duration: 3 Jan 0001 → … |
Conference
| Conference | Proceedings of the Data Compression Conference, Snowbird, Utah, 26 - 28 March 2014 |
|---|---|
| Period | 3/01/01 → … |