GaussianMixtureSummary¶
- 
class pyspark.ml.clustering.GaussianMixtureSummary(java_obj: Optional[JavaObject] = None)[source]¶
- Gaussian mixture clustering results for a given model. - New in version 2.1.0. - Attributes - DataFrame of predicted cluster centers for each training data point. - Size of (number of data points in) each cluster. - Name for column of features in predictions. - The number of clusters the model was trained with. - Total log-likelihood for this model on the given data. - Number of iterations. - Name for column of predicted clusters in predictions. - DataFrame produced by the model’s transform method. - DataFrame of probabilities of each cluster for each training data point. - Name for column of predicted probability of each cluster in predictions. - Attributes Documentation - 
cluster¶
- DataFrame of predicted cluster centers for each training data point. - New in version 2.1.0. 
 - 
clusterSizes¶
- Size of (number of data points in) each cluster. - New in version 2.1.0. 
 - 
featuresCol¶
- Name for column of features in predictions. - New in version 2.1.0. 
 - 
k¶
- The number of clusters the model was trained with. - New in version 2.1.0. 
 - 
logLikelihood¶
- Total log-likelihood for this model on the given data. - New in version 2.2.0. 
 - 
numIter¶
- Number of iterations. - New in version 2.4.0. 
 - 
predictionCol¶
- Name for column of predicted clusters in predictions. - New in version 2.1.0. 
 - 
predictions¶
- DataFrame produced by the model’s transform method. - New in version 2.1.0. 
 - 
probability¶
- DataFrame of probabilities of each cluster for each training data point. - New in version 2.1.0. 
 - 
probabilityCol¶
- Name for column of predicted probability of each cluster in predictions. - New in version 2.1.0. 
 
-