Toggle Main Menu Toggle Search

Open Access padlockePrints

SINC: Semantic approach and enhancement for relational data compression

Lookup NU author(s): Professor Raj Ranjan


Full text for this publication is not currently held within this repository. Alternative links are provided below where available.


© 2022 Elsevier B.V.Data compression has been widely adopted in the industry to reduce storage or bandwidth consumption by removing redundant data or encoding information. Redundancy in semantics implies that some facts in a knowledge base can be inferred from the others. For relational databases, it is possible to remove records due to semantic equivalence. In this paper, we present a purely semantic approach, which losslessly compresses relational data in the first place and also enhances data file compression to further reduce the storage. Our Semantic Inductive Compressor (SINC) works not only for intra-relation patterns but also inter-relation cases. SINC achieves around 1/3 to 2/3 of semantic compression ratios, and the original data can be entirely retrieved with the informative patterns induced by SINC. We apply industrial data compression tools on semantically compressed databases, and the experiment results indicate an enhanced compression ratio up to 35%. Almost all efforts in our technique turn to the enhancement.

Publication metadata

Author(s): Wang R, Sun D, Wong R, Ranjan R, Zomaya AY

Publication type: Article

Publication status: Published

Journal: Knowledge-Based Systems

Year: 2022

Volume: 258

Print publication date: 22/12/2022

Online publication date: 20/10/2022

Acceptance date: 10/10/2022

ISSN (electronic): 0950-7051

Publisher: Elsevier B.V.


DOI: 10.1016/j.knosys.2022.110001


Altmetrics provided by Altmetric