Universal text preprocessing and postprocessing for PPM using Alphabet Adjustment
Allbwn ymchwil: Cyfraniad at gynhadledd › Papur
Fersiynau electronig
Dangosydd eitem ddigidol (DOI)
In this paper, we introduce several new universal pre-processing techniques to improve Prediction by Partial Matching (PPM) compression of UTF-8 encoded natural language text. These methods essentially 'adjust' the alphabet in some manner (for example, by expanding or reducing it) prior to the compression algorithm then being applied to the amended text.
Iaith wreiddiol | Saesneg |
---|---|
Tudalennau | 395 |
Dynodwyr Gwrthrych Digidol (DOIs) | |
Statws | Cyhoeddwyd - 26 Maw 2014 |
Digwyddiad | Proceedings of the Data Compression Conference, Snowbird, Utah, 26 - 28 March 2014 - Hyd: 3 Ion 0001 → … |
Cynhadledd
Cynhadledd | Proceedings of the Data Compression Conference, Snowbird, Utah, 26 - 28 March 2014 |
---|---|
Cyfnod | 3/01/01 → … |