TY - JOUR
T1 - CGC: a scalable Python package for co- and tri-clustering of geodata cubes
AU - Nattino, Francesco
AU - Ku, Ou
AU - Grootes, Meiert W.
AU - Izquierdo-Verdiguier, Emma
AU - Girgin, S.
AU - Zurita-Milla, R.
PY - 2022/4/10
Y1 - 2022/4/10
N2 - Multidimensional data cubes are increasingly ubiquitous, in particular in the geosciences. Clustering techniques encompassing their full dimensionality are necessary to identify patterns "hidden" within these cubes. Clustering Geodata Cubes (CGC) is a Python package designed for partitional clustering, which identifies groups of similar data across two (e.g., spatial and temporal) or three (e.g., spatial, temporal, and thematic) dimensions. CGC provides efficient and scalable co- and tri-clustering functionality appropriate to analyze both small and large datasetsas well as a cluster refinement functionality that supports users in their quest to make sense of complex datasets.
AB - Multidimensional data cubes are increasingly ubiquitous, in particular in the geosciences. Clustering techniques encompassing their full dimensionality are necessary to identify patterns "hidden" within these cubes. Clustering Geodata Cubes (CGC) is a Python package designed for partitional clustering, which identifies groups of similar data across two (e.g., spatial and temporal) or three (e.g., spatial, temporal, and thematic) dimensions. CGC provides efficient and scalable co- and tri-clustering functionality appropriate to analyze both small and large datasetsas well as a cluster refinement functionality that supports users in their quest to make sense of complex datasets.
KW - ITC-GOLD
U2 - 10.21105/joss.04032
DO - 10.21105/joss.04032
M3 - Article
SN - 2475-9066
VL - 7
JO - Journal of open source software
JF - Journal of open source software
IS - 72
M1 - 4032
ER -