confStream: Automated Algorithm Selection and Configuration of Stream Clustering Algorithms

Matthias Carnein, Heike Trautmann, Albert Bifet, Bernhard Pfahringer

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

1 Citation (Scopus)

Abstract

Machine learning has become one of the most important tools in data analysis. However, selecting the most appropriate machine learning algorithm and tuning its hyperparameters to their optimal values remains a difficult task. This is even more difficult for streaming applications where automated approaches are often not available to help during algorithm selection and configuration. This paper proposes the first approach for automated algorithm selection and configuration of stream clustering algorithms. We train an ensemble of different stream clustering algorithms and configurations in parallel and use the best performing configuration to obtain a clustering solution. By drawing new configurations from better performing ones, we are able to improve the ensemble performance over time. In large experiments on real and artificial data we show how our ensemble approach can improve upon default configurations and can also compete with a-posteriori algorithm configuration. Our approach is considerably faster than a-posteriori approaches and applicable in real-time. In addition, it is not limited to stream clustering and can be generalised to all streaming applications, including stream classification and regression.
Original languageEnglish
Title of host publicationLearning and Intelligent Optimization
Subtitle of host publication14th International Conference, LION 14, Athens, Greece, May 24–28, 2020, Revised Selected Papers
PublisherSpringer International Publishing AG
Pages80-95
Number of pages16
ISBN (Electronic)978-3-030-53552-0
ISBN (Print)978-3-030-53551-3
DOIs
Publication statusPublished - 2020
Externally publishedYes
Event14th International Conference on Learning and Intelligent Optimization, LION 2020 - Virtual Event
Duration: 24 May 202028 May 2020
Conference number: 14

Publication series

NameLecture Notes in Computer Science
Volume12096

Conference

Conference14th International Conference on Learning and Intelligent Optimization, LION 2020
Abbreviated titleLION 2020
CityVirtual Event
Period24/05/2028/05/20

Fingerprint

Dive into the research topics of 'confStream: Automated Algorithm Selection and Configuration of Stream Clustering Algorithms'. Together they form a unique fingerprint.

Cite this