Service Analytics

8.Anomaly Detection

Clustering Based Techniques

Key assumption: normal data records belong to large and dense clusters, while anomalies do not belong to any of the clusters or form very small clusters
 
Anomalies detected using clustering based methods can be:
  • Data records that do not fit into
    any cluster (residuals from clustering)
  • Small clusters
  • Low density clusters or local anomalies (far from other points within the same cluster)
 
Problem -->
the specific data partitioning (and corresponding outlier scores) may vary significantly with the choice of clustering methodology

Diskussion