Parameters of the algorithm. See org.alitouka.spark.dbscan.DbscanSettings for details
Partitioning settings. See org.alitouka.spark.dbscan.spatial.rdd.PartitioningSettings for details
Runs the clustering algorithm
Runs the clustering algorithm
A data set to be clustered. See org.alitouka.spark.dbscan.RawDataSet for details
A org.alitouka.spark.dbscan.DbscanModel object which represents clustering results
Implementation of the DBSCAN algorithm which is capable of parallel processing of the input data.
Parallel processing consists of 4 high-level steps: