On the compression of search trees

•Space-efficient representation for non-decreasing sequences of integers.•Efficient support for random access and searches.•Proved performance in main and secondary memory.•Results competitive with the state of the art.•Applications to several domains: posting lists, sparse bitmaps, geographic data,...

Full description

Saved in:
Bibliographic Details
Published inInformation processing & management Vol. 50; no. 2; pp. 272 - 283
Main Authors Claude, Francisco, Nicholson, Patrick K., Seco, Diego
Format Journal Article
LanguageEnglish
Published Kidlington Elsevier Ltd 01.03.2014
Elsevier
Elsevier Science Ltd
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:•Space-efficient representation for non-decreasing sequences of integers.•Efficient support for random access and searches.•Proved performance in main and secondary memory.•Results competitive with the state of the art.•Applications to several domains: posting lists, sparse bitmaps, geographic data, etc. Let X=x1,x2,…,xn be a sequence of non-decreasing integer values. Storing a compressed representation of X that supports access and search is a problem that occurs in many domains. The most common solution to this problem uses a linear list and encodes the differences between consecutive values with encodings that favor small numbers. This solution includes additional information (i.e. samples) to support efficient searching on the encoded values. We introduce a completely different alternative that achieves compression by encoding the differences in a search tree. Our proposal has many applications, such as the representation of posting lists, geographic data, sparse bitmaps, and compressed suffix arrays, to name just a few. The structure is practical and we provide an experimental evaluation to show that it is competitive with the existing techniques.
ISSN:0306-4573
1873-5371
DOI:10.1016/j.ipm.2013.11.002