Robust nonparametric regression based on deep ReLU neural networks

Juntong Chen*

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

1 Citation (Scopus)
33 Downloads (Pure)

Abstract

In this paper, we consider robust nonparametric regression using deep neural networks with ReLU activation function. While several existing theoretically justified methods are geared towards robustness against identical heavy-tailed noise distributions, the rise of adversarial attacks has emphasized the importance of safeguarding estimation procedures against systematic contamination. We approach this statistical issue by shifting our focus towards estimating conditional distributions. To address it robustly, we introduce a novel estimation procedure based on ℓ-estimation. Under a mild model assumption, we establish general non-asymptotic risk bounds for the resulting estimators, showcasing their robustness against contamination, outliers, and model misspecification. We then delve into the application of our approach using deep ReLU neural networks. When the model is well-specified and the regression function belongs to an α-Hölder class, employing ℓ-type estimation on suitable networks enables the resulting estimators to achieve the minimax optimal rate of convergence. Additionally, we demonstrate that deep ℓ-type estimators can circumvent the curse of dimensionality by assuming the regression function closely resembles the composition of several Hölder functions. To attain this, new deep fully-connected ReLU neural networks have been designed to approximate this composition class. This approximation result can be of independent interest.

Original languageEnglish
Article number106182
JournalJournal of statistical planning and inference
Volume233
DOIs
Publication statusPublished - Dec 2024

Keywords

  • UT-Hybrid-D
  • Deep neural networks
  • Nonparametric regression
  • Robust estimation
  • Supremum of an empirical process
  • Circumventing the curse of dimensionality

Fingerprint

Dive into the research topics of 'Robust nonparametric regression based on deep ReLU neural networks'. Together they form a unique fingerprint.

Cite this