TY - GEN
T1 - Fourier-Basis Functions to Bridge Augmentation Gap: Rethinking Frequency Augmentation in Image Classification
AU - Vaish, Puru
AU - Wang, Shunxin
AU - Strisciuglio, Nicola
PY - 2024/9/16
Y1 - 2024/9/16
N2 - Computer vision models normally witness degraded performance when deployed in real-world scenarios, due to unexpected changes in inputs that were not accounted for during training. Data augmentation is commonly used to address this issue, as it aims to increase data variety and reduce the distribution gap between training and test data. However, common visual augmentations might not guaran-tee extensive robustness of computer vision models. In this paper, we propose Auxiliary Fourier-basis Augmentation (AFA), a complementary technique targeting augmentation in the frequency domain and filling the robustness gap left by visual augmentations. We demonstrate the utility of augmentation via Fourier-basis additive noise in a straightforward and efficient adversarial setting. Our results show that AFA benefits the robustness of models against common corruptions, OOD generalization, and consistency of performance of models against increasing perturbations, with negligible deficit to the standard performance of models. It can be seamlessly integrated with other augmentation techniques to further boost performance. Codes and models are available at https://github.com/nis-research/afa-augment.
AB - Computer vision models normally witness degraded performance when deployed in real-world scenarios, due to unexpected changes in inputs that were not accounted for during training. Data augmentation is commonly used to address this issue, as it aims to increase data variety and reduce the distribution gap between training and test data. However, common visual augmentations might not guaran-tee extensive robustness of computer vision models. In this paper, we propose Auxiliary Fourier-basis Augmentation (AFA), a complementary technique targeting augmentation in the frequency domain and filling the robustness gap left by visual augmentations. We demonstrate the utility of augmentation via Fourier-basis additive noise in a straightforward and efficient adversarial setting. Our results show that AFA benefits the robustness of models against common corruptions, OOD generalization, and consistency of performance of models against increasing perturbations, with negligible deficit to the standard performance of models. It can be seamlessly integrated with other augmentation techniques to further boost performance. Codes and models are available at https://github.com/nis-research/afa-augment.
KW - 2024 OA procedure
KW - Visualization
KW - Computer vision
KW - Perturbation methods
KW - Computational modeling
KW - Frequency-domain analysis
KW - Data augmentation
KW - Training
UR - http://www.scopus.com/inward/record.url?scp=85202282371&partnerID=8YFLogxK
U2 - 10.1109/CVPR52733.2024.01682
DO - 10.1109/CVPR52733.2024.01682
M3 - Conference contribution
SN - 979-8-3503-5301-3
SP - 17763
EP - 17772
BT - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
PB - IEEE
T2 - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024
Y2 - 16 June 2024 through 22 June 2024
ER -