DADO: A Depth-Attention Framework for Object Discovery

Research output: Chapter in Book/Report/Conference proceedingChapterAcademicpeer-review

Abstract

Unsupervised object discovery, the task of identifying and localizing objects in images without human-annotated labels, remains a significant challenge and a growing focus in computer vision. In this work, we introduce a novel model, DADO (Depth-Attention self-supervised technique for Discovering unseen Objects), which combines an attention mechanism and a depth model to identify potential objects in images. To address challenges such as noisy attention maps or complex scenes with varying depth planes, DADO employs dynamic weighting to adaptively emphasize attention or depth features based on the global characteristics of each image. We evaluated DADO on standard benchmarks, where it outperforms state-of-the-art methods in object discovery accuracy and robustness without the need for fine-tuning.
Original languageEnglish
Title of host publicationComputer Analysis of Images and Patterns
Subtitle of host publication21st International Conference, CAIP 2025, Las Palmas de Gran Canaria, Spain, September 22–25, 2025, Proceedings, Part II
EditorsModesto Castrillón-Santana, Carlos M. Travieso-González, David Freire-Obregón, Daniel Hernández-Sosa, Javier Lorenzo-Navarro, Oliverio J. Santana, Oscar Deniz Suarez
PublisherSpringer
Pages281-291
Number of pages11
ISBN (Electronic)978-3-032-05060-1
ISBN (Print)978-3-032-05059-5
DOIs
Publication statusPublished - 2026
Event21st International Conference on Computer Analysis of Images and Patterns, CAIP 2025 - Museo Elder de la Ciencia y la Tecnología, Las Palmas de Gran Canaria, Spain
Duration: 22 Sept 202525 Sept 2025
Conference number: 21
https://caip2025.com/

Publication series

Name Lecture Notes in Computer Science
Volume15622

Conference

Conference21st International Conference on Computer Analysis of Images and Patterns, CAIP 2025
Abbreviated titleCAIP 2025
Country/TerritorySpain
CityLas Palmas de Gran Canaria
Period22/09/2525/09/25
Internet address

Keywords

  • NLA

Fingerprint

Dive into the research topics of 'DADO: A Depth-Attention Framework for Object Discovery'. Together they form a unique fingerprint.

Cite this