Universal text preprocessing and postprocessing for PPM using Alphabet Adjustment
Research output: Contribution to conference › Paper
Standard Standard
2014. 395 Paper presented at Proceedings of the Data Compression Conference, Snowbird, Utah, 26 - 28 March 2014.
Research output: Contribution to conference › Paper
HarvardHarvard
APA
CBE
MLA
VancouverVancouver
Author
RIS
TY - CONF
T1 - Universal text preprocessing and postprocessing for PPM using Alphabet Adjustment
AU - Alhawiti, K.
AU - Teahan, W.J.
PY - 2014/3/26
Y1 - 2014/3/26
N2 - In this paper, we introduce several new universal pre-processing techniques to improve Prediction by Partial Matching (PPM) compression of UTF-8 encoded natural language text. These methods essentially 'adjust' the alphabet in some manner (for example, by expanding or reducing it) prior to the compression algorithm then being applied to the amended text.
AB - In this paper, we introduce several new universal pre-processing techniques to improve Prediction by Partial Matching (PPM) compression of UTF-8 encoded natural language text. These methods essentially 'adjust' the alphabet in some manner (for example, by expanding or reducing it) prior to the compression algorithm then being applied to the amended text.
U2 - 10.1109/DCC.2014.12
DO - 10.1109/DCC.2014.12
M3 - Paper
SP - 395
T2 - Proceedings of the Data Compression Conference, Snowbird, Utah, 26 - 28 March 2014
Y2 - 3 January 0001
ER -