Abstract
Frequency analysis is useful for understanding the mechanisms of representation learning in neural networks (NNs). Most research in this area focuses on the learning dynamics of NNs for regression tasks, while little for classification. This study empirically investigates the latter and expands the understanding of frequency shortcuts. First, we perform experiments on synthetic datasets, designed to have a bias in different frequency bands. Our results demonstrate that NNs tend to find simple solutions for classification, and what they learn first during training depends on the most distinctive frequency characteristics, which can be either low- or high-frequencies. Second, we confirm this phenomenon on natural images. We propose a metric to measure class-wise frequency characteristics and a method to identify frequency shortcuts. The results show that frequency shortcuts can be texture-based or shape-based, depending on what best simplifies the objective. Third, we validate the transferability of frequency shortcuts on out-of-distribution (OOD) test sets. Our results suggest that frequency shortcuts can be transferred across datasets and cannot be fully avoided by larger model capacity and data augmentation. We recommend that future research should focus on effective training schemes mitigating frequency shortcut learning. Codes and data are available at https://github.com/nis-research/nn-frequency-shortcuts.
| Original language | English |
|---|---|
| Title of host publication | 2023 IEEE/CVF International Conference on Computer Vision (ICCV) |
| Place of Publication | Piscataway, NJ |
| Publisher | IEEE |
| Pages | 1433-1442 |
| Number of pages | 10 |
| ISBN (Electronic) | 979-8-3503-0718-4 |
| ISBN (Print) | 979-8-3503-0719-1 |
| DOIs | |
| Publication status | Published - 6 Oct 2023 |
| Event | 2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023 - Paris, France Duration: 1 Oct 2023 → 6 Oct 2023 |
Publication series
| Name | Proceedings IEEE/CVF International Conference on Computer Vision (ICCV) |
|---|---|
| Publisher | IEEE |
| Volume | 2023 |
| ISSN (Print) | 1550-5499 |
| ISSN (Electronic) | 2380-7504 |
Conference
| Conference | 2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023 |
|---|---|
| Abbreviated title | ICCV |
| Country/Territory | France |
| City | Paris |
| Period | 1/10/23 → 6/10/23 |
Keywords
- Training
- Representation learning
- Systematics
- Artificial neural networks
- Data augmentation
- Data models
- Frequency measurement
- 2024 OA procedure
Fingerprint
Dive into the research topics of 'What do neural networks learn in image classification? A frequency shortcut perspective'. Together they form a unique fingerprint.-
DFM-X: Augmentation by Leveraging Prior Knowledge of Shortcut Learning
Wang, S., Brune, C., Veldhuis, R. & Strisciuglio, N., 6 Oct 2023, 2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW). Piscataway, NJ: IEEE, p. 129-138 10 p. 10350684. (Proceedings IEEE/CVF International Conference on Computer Vision Workshops (ICCVW); vol. 2023).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Academic › peer-review
Open AccessFile4 Link opens in a new tab Citations (Scopus)154 Downloads (Pure) -
What do neural networks learn in image classification? A frequency shortcut perspective
Wang, S., Veldhuis, R., Brune, C. & Strisciuglio, N., 19 Jul 2023, ArXiv.org, 17 p.Research output: Working paper › Preprint › Academic
Open AccessFile39 Downloads (Pure)
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver