Browse by author
Lookup NU author(s): Professor Raj Ranjan
Full text for this publication is not currently held within this repository. Alternative links are provided below where available.
© 2022 Elsevier B.V.Data compression has been widely adopted in the industry to reduce storage or bandwidth consumption by removing redundant data or encoding information. Redundancy in semantics implies that some facts in a knowledge base can be inferred from the others. For relational databases, it is possible to remove records due to semantic equivalence. In this paper, we present a purely semantic approach, which losslessly compresses relational data in the first place and also enhances data file compression to further reduce the storage. Our Semantic Inductive Compressor (SINC) works not only for intra-relation patterns but also inter-relation cases. SINC achieves around 1/3 to 2/3 of semantic compression ratios, and the original data can be entirely retrieved with the informative patterns induced by SINC. We apply industrial data compression tools on semantically compressed databases, and the experiment results indicate an enhanced compression ratio up to 35%. Almost all efforts in our technique turn to the enhancement.
Author(s): Wang R, Sun D, Wong R, Ranjan R, Zomaya AY
Publication type: Article
Publication status: Published
Journal: Knowledge-Based Systems
Print publication date: 22/12/2022
Online publication date: 20/10/2022
Acceptance date: 10/10/2022
ISSN (electronic): 0950-7051
Publisher: Elsevier B.V.
Altmetrics provided by Altmetric