[R-pkg-team] Bug#975886: ITP: r-cran-genieclust -- GNU R Genie++ Hierarchical Clustering Algorithm with Noise Points Detection
Andreas Tille
tille at debian.org
Thu Nov 26 10:03:57 GMT 2020
Package: wnpp
Severity: wishlist
Subject: ITP: r-cran-genieclust -- GNU R Genie++ Hierarchical Clustering Algorithm with Noise Points Detection
Package: wnpp
Owner: Andreas Tille <tille at debian.org>
Severity: wishlist
* Package name : r-cran-genieclust
Version : 0.9.4
Upstream Author : Marek Gagolewski,
* URL : https://cran.r-project.org/package=genieclust
* License : AGPL-3
Programming Lang: GNU R
Description : GNU R Genie++ Hierarchical Clustering Algorithm with Noise Points Detection
A retake on the Genie algorithm - a robust hierarchical clustering
method (Gagolewski, Bartoszuk, Cena, 2016
<DOI:10.1016/j.ins.2016.05.003>). Now faster and more memory efficient;
determining the whole hierarchy for datasets of 10M points in low
dimensional Euclidean spaces or 100K points in high-dimensional ones
takes only 1-2 minutes. Allows clustering with respect to mutual
reachability distances so that it can act as a noise point detector or a
robustified version of 'HDBSCAN*' (that is able to detect a predefined
number of clusters and hence it does not dependent on the somewhat
fragile 'eps' parameter).
.
The package also features an implementation of economic inequity indices
(the Gini, Bonferroni index) and external cluster validity measures
(partition similarity scores; e.g., the adjusted Rand, Fowlkes-Mallows,
adjusted mutual information, pair sets index).
.
See also the 'Python' version of 'genieclust' available on 'PyPI', which
supports sparse data, more metrics, and even larger datasets.
Remark: This package is maintained by Debian R Packages Maintainers at
https://salsa.debian.org/r-pkg-team/r-cran-genieclust
More information about the R-pkg-team
mailing list