Abstract
Maximizing modularity is currently the most widely-used community detection method in applications. Modularity comes with a parameter that indirectly controls the granularity of the resulting clustering. Moreover, one can choose this parameter in such a way that modularity maximization becomes equivalent to maximizing the likelihood of a stochastic block model. Thus, this method is statistically justified, while at the same time, it is known to have a bias towards fine-grained clusterings. In this work, we introduce a heuristic to correct for this bias. This heuristic is based on prior work where modularity is described in geometric terms. This has led to a broad generalization of modularity-based community detection methods, and the heuristic presented in this paper applies to each of them. We justify the heuristic by describing a relation between several distances that we observe to hold in many instances. We prove that, assuming the validity of this relation, our heuristic leads to a clustering of the same granularity as the ground-truth clustering. We compare our heuristic to likelihood-based community detection methods on several synthetic graphs and show that our method indeed results in clusterings with granularity closer to the granularity of the ground-truth clustering. Moreover, our heuristic often outperforms likelihood maximization in terms of similarity to the ground-truth clustering.
Original language | English |
---|---|
Title of host publication | Algorithms and Models for the Web Graph |
Subtitle of host publication | 18th International Workshop, WAW 2023, Toronto, ON, Canada, May 23–26, 2023, Proceedings |
Editors | Megan Dewar, François Théberge, Paweł Prałat, Przemysław Szufel, Małgorzata Wrzosek |
Place of Publication | Cham |
Publisher | Springer |
Pages | 1-18 |
Number of pages | 18 |
ISBN (Electronic) | 978-3-031-32296-9 |
ISBN (Print) | 978-3-031-32295-2 |
DOIs | |
Publication status | Published - 2023 |
Event | 18th International Workshop on Algorithms and Models for the Web Graph, WAW 2023 - Toronto, Canada Duration: 23 May 2023 → 26 May 2023 Conference number: 18 |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Publisher | Springer |
Volume | 13894 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 18th International Workshop on Algorithms and Models for the Web Graph, WAW 2023 |
---|---|
Abbreviated title | WAW 2023 |
Country/Territory | Canada |
City | Toronto |
Period | 23/05/23 → 26/05/23 |
Keywords
- Clustering
- Community detection
- Likelihood maximization
- Modularity
- 2024 OA procedure