Ensemble Clustering for Novelty Detection in Data Streams

Kemilly Dearo Garcia*, Elaine Ribeiro de Faria, Cláudio Rebelo de Sá, João Mendes-Moreira, Charu C. Aggarwal, André C.P.L.F. de Carvalho, Joost N. Kok

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

2 Downloads (Pure)

Abstract

In data streams new classes can appear over time due to changes in the data statistical distribution. Consequently, models can become outdated, which requires the use of incremental learning algorithms capable of detecting and learning the changes over time. However, when a single classification model is used for novelty detection, there is a risk that its bias may not be suitable for new data distributions. A solution could be the combination of several models into an ensemble. Besides, because models can only be updated when labeled data arrives, we propose two unsupervised ensemble approaches: one combining clustering partitions using the same clustering technique; and other using different clustering techniques. We compare the performance of the proposed methods with well known novelty detection algorithms. The methods were tested on datasets commonly used in the novelty detection literature. The experimental results show that proposed ensembles have competitive performance for novelty detection in data streams.

Original languageEnglish
Title of host publicationDiscovery Science
Subtitle of host publication22nd International Conference, DS 2019, Splitm, Croatia, October 28-30, 2019. Proceedings
EditorsPetra Kralj Novak, Sašo Džeroski, Tomislav Šmuc
Place of PublicationCham
PublisherSpringer
Pages460-470
Number of pages11
ISBN (Electronic)978-3-030-33778-0
ISBN (Print)978-3-030-33777-3
DOIs
Publication statusPublished - 1 Jan 2019
Event22nd International Conference on Discovery Science, DS 2019 - Radisson Blu Resort and Spa, Split, Croatia
Duration: 28 Oct 201930 Oct 2019
https://ds2019.irb.hr/

Publication series

NameLecture Notes in Artificial Intelligence; subseries of Lecture Notes in Computer Science
PublisherSpringer
Volume11828 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference22nd International Conference on Discovery Science, DS 2019
Abbreviated titleDS2019
CountryCroatia
CitySplit
Period28/10/1930/10/19
Internet address

Keywords

  • Clustering
  • Data streams
  • Ensembles
  • Novelty detection

Fingerprint

Dive into the research topics of 'Ensemble Clustering for Novelty Detection in Data Streams'. Together they form a unique fingerprint.

Cite this