Affinity Propagation cannot be constructed without having the package installed

mlr3cluster

Package website: release | dev

Cluster analysis for mlr3.

mlr3cluster is an extension package for cluster analysis within the mlr3 ecosystem. It is a successor of clustering capabilities of mlr2.

Installation

Install the last release from CRAN:

install.packages("mlr3cluster")

Install the development version from GitHub:

# install.packages("pak")
pak::pak("mlr-org/mlr3cluster")

Feature Overview

The current version of mlr3cluster contains:

A selection of 24 clustering learners that represent a wide variety of clusterers: partitional, hierarchical, fuzzy, etc.
A selection of 4 performance measures
Two built-in tasks to get started with clustering

Also, the package is integrated with mlr3viz which enables you to create great visualizations with just one line of code!

Cluster Analysis

Cluster Learners

Key	Label	Packages
clust.MBatchKMeans	Mini Batch K-Means	ClusterR
clust.SimpleKMeans	K-Means (Weka)	RWeka
clust.agnes	Agglomerative Hierarchical Clustering	cluster
clust.ap	Affinity Propagation Clustering	apcluster
clust.bico	BICO Clustering	stream
clust.birch	BIRCH Clustering	stream
clust.cmeans	Fuzzy C-Means Clustering Learner	e1071
clust.cobweb	Cobweb Clustering	RWeka
clust.dbscan	Density-Based Clustering	dbscan
clust.dbscan_fpc	Density-Based Clustering with fpc	fpc
clust.diana	Divisive Hierarchical Clustering	cluster
clust.em	Expectation-Maximization Clustering	RWeka
clust.fanny	Fuzzy Analysis Clustering	cluster
clust.featureless	Featureless Clustering
clust.ff	Farthest First Clustering	RWeka
clust.hclust	Agglomerative Hierarchical Clustering	stats
clust.hdbscan	HDBSCAN Clustering	dbscan
clust.kkmeans	Kernel K-Means	kernlab
clust.kmeans	K-Means	stats, clue
clust.mclust	Gaussian Mixture Models Clustering	mclust
clust.meanshift	Mean Shift Clustering	LPCM
clust.optics	OPTICS Clustering	dbscan
clust.pam	Partitioning Around Medoids	cluster
clust.xmeans	X-means	RWeka

Cluster Measures

Key	Label	Packages
clust.ch	Calinski Harabasz	fpc
clust.dunn	Dunn	fpc
clust.silhouette	Silhouette	cluster
clust.wss	Within Sum of Squares	fpc

Example

library(mlr3)
library(mlr3cluster)

task = tsk("usarrests")
learner = lrn("clust.kmeans")
learner$train(task)
prediction = learner$predict(task = task)

More Resources

Check out the blogpost for a more detailed introduction to the package. Also, mlr3book has a section on clustering.

Future Plans

Add more learners and measures
Integrate the package with mlr3pipelines (work in progress)

If you have any questions, feedback or ideas, feel free to open an issue here.

	s = p_uty(default = apcluster::negDistMat(r = 2L), tags = c("required", "train")),
	p = p_uty(custom_check = function(x) {
	if (test_numeric(x)) {
	return(TRUE)
	} else {
	stop("`p` needs to be a numeric vector")
	}
	}, default = NA, tags = "train"),
	q = p_dbl(lower = 0L, upper = 1L, tags = "train"),
	maxits = p_int(lower = 1L, default = 1000L, tags = "train"),
	convits = p_int(lower = 1L, default = 100L, tags = "train"),
	lam = p_dbl(lower = 0.5, upper = 1L, default = 0.9, tags = "train"),
	includeSim = p_lgl(default = FALSE, tags = "train"),
	details = p_lgl(default = FALSE, tags = "train"),
	nonoise = p_lgl(default = FALSE, tags = "train"),
	seed = p_int(tags = "train")
	)
	ps$values = list(s = apcluster::negDistMat(r = 2L))

	private = list(
	.score = function(prediction, task, ...) {
	X = as.matrix(task$data(rows = prediction$row_ids))
	if (!is.double(X)) { # clusterCrit does not convert lgls/ints
	storage.mode(X) = "double"
	}
	intCriteria(X, prediction$partition, self$crit)[[1L]]
	}
	)

mlr-org / mlr3cluster Goto Github PK

mlr3cluster's Introduction

mlr3cluster

Installation

Feature Overview

Cluster Analysis

Cluster Learners

Cluster Measures

Example

More Resources

Future Plans

mlr3cluster's People

Contributors

Stargazers

Watchers

Forkers

mlr3cluster's Issues

Description

Reproducible example

Recommend Projects

Recommend Topics

Recommend Org