Reads a dataset from a CSV file.
Reads a dataset from a CSV file. That file should contain double values separated by commas
A SparkContext into which the data should be loaded
A path to the CSV file
A org.alitouka.spark.dbscan.RawDataSet populated with points
Saves clustering result into a CSV file.
Saves clustering result into a CSV file. The resulting file will contain the same data as the input file, with a cluster ID appended to each record. The order of records is not guaranteed to be the same as in the input file
A org.alitouka.spark.dbscan.DbscanModel obtained from Dbscan.train method
Path to a folder where results should be saved. The folder will contain multiple partXXXX files
Contains functions for reading and writing data