Browse by author
Lookup NU author(s): Dr Shidong WangORCiD
Full text for this publication is not currently held within this repository. Alternative links are provided below where available.
© 2025 Elsevier Ltd. Compositional Zero-Shot Learning (CZSL) aims to recognize novel compositions of objects and states by transferring knowledge from seen compositions. A critical omission in prior studies is the uniform penalty imposed on all incorrect compositions, ignoring their inherent affinities with the ground-truth labels. This oversight leads to severe overfitting on seen classes and impedes the discovery of genuine visual-semantic relationships. To address this, we propose Clique-based Interclass Affinity (CIA), a framework that introduces hierarchical semantic supervision by grouping compositions into affinity cliques. CIA encodes both semantic affinity and visual affinity to construct multi-level cliques. These cliques guide a one-to-many alignment between visual and semantic features, enabling the model to learn generalizable class prototypes through structured constraints, rather than treating all incorrect classes equally. Unlike prior works focusing on direct classification, CIA emphasizes unveiling intrinsic compositional structures by analyzing inter-semantic and visual relationships. Extensive experiments on MIT-States, UT-Zappos, and C-GQA demonstrate CIA's superiority, showcasing its robustness in both closed-world and open-world settings. Our code is available at https://github.com/LanchJL/CIA-CZSL.
Author(s): Jiang C, Ye Q, Wang S, Wu Z, Zhang H
Publication type: Article
Publication status: Published
Journal: Pattern Recognition
Year: 2026
Volume: 173
Print publication date: 01/05/2026
Online publication date: 08/12/2025
Acceptance date: 02/04/2018
ISSN (print): 0031-3203
ISSN (electronic): 1873-5142
Publisher: Elsevier Ltd
URL: https://doi.org/10.1016/j.patcog.2025.112819
DOI: 10.1016/j.patcog.2025.112819
Altmetrics provided by Altmetric