Skip to main navigation Skip to search Skip to main content

Statistical theory for image classification using deep convolutional neural network with cross-entropy loss under the hierarchical max-pooling model

  • Michael Kohler
  • , Sophie Langer*
  • *Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

206 Downloads (Pure)

Abstract

Convolutional neural networks (CNNs) trained with cross-entropy loss have proven to be extremely successful in classifying images. In recent years, much work has been done to also improve the theoretical understanding of neural networks. Nevertheless, it seems limited when these networks are trained with cross-entropy loss, mainly because of the unboundedness of the target function. In this paper, we aim to fill this gap by analysing the rate of the excess risk of a CNN classifier trained by cross-entropy loss. Under suitable assumptions on the smoothness and structure of the a posteriori probability, it is shown that these classifiers achieve a rate of convergence which is independent of the dimension of the image. These rates are in line with the practical observations about CNNs.

Original languageEnglish
Article number106188
JournalJournal of statistical planning and inference
Volume234
DOIs
Publication statusPublished - Jan 2025

Keywords

  • Convergence rates
  • Convolutional neural networks
  • Curse of dimensionality
  • Image classification
  • Statistical risk bounds

Fingerprint

Dive into the research topics of 'Statistical theory for image classification using deep convolutional neural network with cross-entropy loss under the hierarchical max-pooling model'. Together they form a unique fingerprint.

Cite this