Toggle Main Menu Toggle Search

Open Access padlockePrints

Laser: Efficient Language-Guided Segmentation in Neural Radiance Fields

Lookup NU author(s): Dr Haoran Duan, Dr Tejal Shah, Dr Yang Long, Professor Raj Ranjan

Downloads


Licence

This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0).


Abstract

In this work, we propose a method that leverages CLIP feature distillation, achieving efficient 3D segmentation through language guidance. Unlike previous methods that rely on multi-scale CLIP features and are limited by processing speed and storage requirements, our approach aims to streamline the workflow by directly and effectively distilling dense CLIP features, thereby achieving precise segmentation of 3D scenes using text. To achieve this, we introduce an adapter module and mitigate the noise issue in the dense CLIP feature distillation process through a self-cross-training strategy. Moreover, to enhance the accuracy of segmentation edges, this work presents a low-rank transient query attention mechanism. To ensure the consistency of segmentation for similar colors under different viewpoints, we convert the segmentation task into a classification task through label volume, which significantly improves the consistency of segmentation in color-similar areas. We also propose a simplified text augmentation strategy to alleviate the issue of ambiguity in the correspondence between CLIP features and text. Extensive experimental results show that our method surpasses current state-of-the-art technologies in both training speed and performance. Our code is available on: https://github.com/xingy038/Laser.git.


Publication metadata

Author(s): Miao X, Duan Haoran, Bai Y, Shah T, Song J, Long Y, Ranjan R, Shao Ling

Publication type: Article

Publication status: Published

Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

Year: 2025

Pages: epub ahead of print

Online publication date: 29/01/2025

Acceptance date: 25/01/2025

Date deposited: 10/02/2025

ISSN (print): 0162-8828

ISSN (electronic): 1939-3539

Publisher: IEEE

URL: https://doi.org/10.1109/TPAMI.2025.3535916

DOI: 10.1109/TPAMI.2025.3535916


Altmetrics

Altmetrics provided by Altmetric


Funding

Funder referenceFunder name
MR/S003916/2
MRC

Share