Abstract
Frequency shortcuts refer to specific frequency patterns that models heavily rely on for correct classification. Previous studies have shown that models trained on small image datasets often exploit such shortcuts, potentially impairing their generalization performance. However, existing methods for identifying frequency shortcuts require expensive computations and become impractical for analyzing models trained on large datasets. In this work, we propose the first approach to more efficiently analyze frequency shortcuts at a large scale. We show that both CNN and transformer models learn frequency shortcuts on ImageNet. We also expose that frequency shortcut solutions can yield good performance on out-of-distribution (OOD) test sets which largely retain texture information. However, these shortcuts, mostly aligned with texture patterns, hinder model generalization on rendition-based OOD test sets. These observations suggest that current OOD evaluations often overlook the impact of frequency shortcuts on model generalization. Future benchmarks could thus benefit from explicitly assessing and accounting for these shortcuts to build models that generalize across a broader range of OOD scenarios. Codes are available at https://github.com/nis-research/hfss.
| Original language | English |
|---|---|
| Title of host publication | 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) |
| Publisher | IEEE Advancing Technology for Humanity |
| Pages | 25198-25207 |
| Number of pages | 10 |
| ISBN (Electronic) | 979-8-3315-4364-8 |
| ISBN (Print) | 979-8-3315-4365-5 |
| DOIs | |
| Publication status | Published - 13 Aug 2025 |
| Event | IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025: 8th Multimodal Learning and Applications Workshop - Nashville, TN, USA, Nashville, United States Duration: 11 Jun 2025 → 15 Jun 2025 |
Workshop
| Workshop | IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025 |
|---|---|
| Abbreviated title | CVPR 2025 |
| Country/Territory | United States |
| City | Nashville |
| Period | 11/06/25 → 15/06/25 |
Keywords
- 2025 OA procedure
Fingerprint
Dive into the research topics of 'Do ImageNet-trained models learn shortcuts? The impact of frequency shortcuts on generalization'. Together they form a unique fingerprint.Activities
- 1 Invited talk
-
Beyond Scaling: Toward Data-Efficient and Reliable Vision Models
Strisciuglio, N. (Speaker)
7 May 2026Activity: Talk or presentation › Invited talk
Research output
- 1 Preprint
-
Do ImageNet-trained models learn shortcuts? The impact of frequency shortcuts on generalization
Wang, S., Veldhuis, R. & Strisciuglio, N., 5 Mar 2025, ArXiv.org.Research output: Working paper › Preprint › Academic
Open AccessFile49 Downloads (Pure)
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver