Skip to main navigation Skip to search Skip to main content

Do ImageNet-trained models learn shortcuts? The impact of frequency shortcuts on generalization

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

16 Downloads (Pure)

Abstract

Frequency shortcuts refer to specific frequency patterns that models heavily rely on for correct classification. Previous studies have shown that models trained on small image datasets often exploit such shortcuts, potentially impairing their generalization performance. However, existing methods for identifying frequency shortcuts require expensive computations and become impractical for analyzing models trained on large datasets. In this work, we propose the first approach to more efficiently analyze frequency shortcuts at a large scale. We show that both CNN and transformer models learn frequency shortcuts on ImageNet. We also expose that frequency shortcut solutions can yield good performance on out-of-distribution (OOD) test sets which largely retain texture information. However, these shortcuts, mostly aligned with texture patterns, hinder model generalization on rendition-based OOD test sets. These observations suggest that current OOD evaluations often overlook the impact of frequency shortcuts on model generalization. Future benchmarks could thus benefit from explicitly assessing and accounting for these shortcuts to build models that generalize across a broader range of OOD scenarios. Codes are available at https://github.com/nis-research/hfss.
Original languageEnglish
Title of host publication2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
PublisherIEEE Advancing Technology for Humanity
Pages25198-25207
Number of pages10
ISBN (Electronic)979-8-3315-4364-8
ISBN (Print)979-8-3315-4365-5
DOIs
Publication statusPublished - 13 Aug 2025
EventIEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025: 8th Multimodal Learning and Applications Workshop - Nashville, TN, USA, Nashville, United States
Duration: 11 Jun 202515 Jun 2025

Workshop

WorkshopIEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025
Abbreviated titleCVPR 2025
Country/TerritoryUnited States
CityNashville
Period11/06/2515/06/25

Keywords

  • 2025 OA procedure

Fingerprint

Dive into the research topics of 'Do ImageNet-trained models learn shortcuts? The impact of frequency shortcuts on generalization'. Together they form a unique fingerprint.

Cite this