Clustering

This presentation provides an overview of clustering, including its definition and various types such as partitioning, hierarchical, density-based, and model-based clustering. It details specific algorithms like K-means, Fuzzy C-Means, Agglomerative, and DBSCAN, explaining their processes, advantages, and disadvantages. The document emphasizes the importance of clustering in detecting patterns and organizing data points into similar groups.

Uploaded by

Sumita Gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views67 pages

Clustering

Uploaded by

Sumita Gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 67

THIS PRESENTATION IS ABOUT

 Introduction of Clustering
 Types of Clustering
 Partitioning based Clustering
 K-means Algorithm

 Fuzzy Clustering
 Fuzzy C-Means Algorithm

 Hierarchical based Clustering

 Agglomerative Algorithm

 Density based Clustering

 DBSCAN Algorithm

 Model based Clustering

CLUSTERI
NG
CLUSTERING:
INTRODUCTION
Clustering is the task of dividing the population or data
points into a number of groups such that data points in the
same groups are more similar to other data points in the
same group than those in other groups
The aim is to segregate groups with similar traits and assign them
into clusters.
 Unsupervised Learning  Requires Data, but no labels.
 Detect Patterns:
 Group emails or search results
 Customer shopping patterns
 Regions of images
TYPES OF
CLUSTERING
CLUSTERING:

TYPES
Partitioning methods:
Its simply a division of the set of data objects into non-
overlapping clusters such that each objects is in exactly one
subset. Example: k-Means

 Hierarchical clustering:
Also known as 'nesting clustering' as it also clusters to exist
within bigger clusters to form a tree. Example:
Agglometric Clustering
CLUSTERING:

TYPES
Density-based clustering:
In this clustering model there will be a searching of data
space for areas of varied density of data points in the
data space. Example: DBSCAN

 Model-based clustering:
It provides a framework for incorporating our
knowledge about a domain.
PARTITIONING
CLUSTERING
PARTITION BASED
CLUSTERING
EXAMPLE: K-
MEANS
K-MEANS

CLUSTERING
An Iterative Clustering Algorithm
 Partition-based Clustering
 Each Cluster is associated with a centroid
 Each point is assigned to the cluster with the closest
centroid
 Number of clusters, K, must be specified.
K-MEANS
CLUSTERING
K-MEANS
CLUSTERING
 1. Initial centroids are often chosen randomly.
 Clusters produced vary from one run to another
 2. The centroid is (typically) the mean of the points in the
cluster.
 3. “Closeness” is measured by Euclidean distance, cosine
similarity, correlation, etc.
 4. K-means will converge for common similarity
measures
mentioned above.
 5. Most of the convergence happens in the first few
iterations.
 Often the stopping condition is changed to “Until relatively
few points change clusters”
K-MEANS CLUSTERING:
EXAMPLE
K-MEANS CLUSTERING:
EXAMPLE
K-MEANS CLUSTERING:
EXAMPLE
K-MEANS CLUSTERING:
EXAMPLE
K-MEANS CLUSTERING:
EXAMPLE
K-MEANS CLUSTERING:
EXAMPLE
K-MEANS CLUSTERING:
EXAMPLE
K-MEANS CLUSTERING:
EXAMPLE
K-MEANS
ADVANTAGES
Advantages
 Relatively simple to implement.
 Scales to large data sets.
 Guarantees convergence.
 Can warm-start the positions of centroids.
 Easily adapts to new examples.
 Generalizes to clusters of different
shapes and sizes, such as elliptical clusters.
K-MEANS

DISADVANTAGE
Choosing k manually.
 Being dependent on initial values.
For a low k, you can mitigate this dependence by running k-means
several times with different initial values and picking the best result.
 Clustering data of varying sizes and density.
k-means has trouble clustering data where clusters are of varying sizes
and density.
 Clustering outliers.
Centroids can be dragged by outliers, or outliers might get their own
cluster instead of being ignored. Consider removing or clipping outliers
before clustering.
 Scaling with number of dimensions.
As the number of dimensions increases, a distance-based similarity
measure converges to a constant value between any given examples.
HIERARCHICAL
CLUSTERING
H IERARCHICAL
CLUSTERING
H IERARCHICAL
CLUSTERING
E XAMPLE :
C LUSTERI
A GGLOMERATIVE
NG
AGGLOMERATIVE
C LUSTERING
AGGLOMERATIVE
C LUSTERING
AGGLOMERATIVE
C LUSTERING
AGGLOMERATIVE
C LUSTERING
AGGLOMERATIVE
C LUSTERING
AGGLOMERATIVE
C LUSTERING
AGGLOMERATIVE
C LUSTERING
AGGLOMERATIVE
C LUSTERING
AGGLOMERATIVE
C LUSTERING
AGGLOMERATIVE
C LUSTERING
AGGLOMERATIVE
C LUSTERING
AGGLOMERATIVE
C LUSTERING
AGGLOMERATIVE
C LUSTERING
AGGLOMERATIVE
C LUSTERING
AGGLOMERATIVE
C LUSTERING
AGGLOMERATIVE
C LUSTERING
AGGLOMERATIVE
C LUSTERING
AGGLOMERATIVE
C LUSTERING
AGGLOMERATIVE
C LUSTERING
AGGLOMERATIVE
C LUSTERING
AGGLOMERATIVE
C LUSTERING
H IERARCHICAL
CLUSTERING
DENSITY BASED
CLUSTERING
DENSITY BASED
C LUSTERING
K-MEANS VS DENSITY BASE
CLUSTERING
DENSITY BASED
C LUSTERING
EXAMPLE:
DBSCAN
DBSC
AN
DBSC
AN
DBSC
AN
DBSC
AN
DBSCAN Algorithm Steps
DBSCAN
Example
DBSCAN
Example
DBSCAN
Example
DBSCAN
Example
DBSCAN
Example
DBSCAN
Example
DBSCAN
Example
DBSCAN
Example
DBSCAN: ADVANTAGES &
DISADVANTAGES

ML Unit III.pptx
No ratings yet
ML Unit III.pptx
82 pages
ML Module 4 Unsupervised Learning - Updated
No ratings yet
ML Module 4 Unsupervised Learning - Updated
55 pages
ML unit 4
No ratings yet
ML unit 4
110 pages
Clustering and K-Means Algorithm
No ratings yet
Clustering and K-Means Algorithm
81 pages
Lecture 2.1.1 to 2.1.2 (1)
No ratings yet
Lecture 2.1.1 to 2.1.2 (1)
97 pages
UNIT 4
No ratings yet
UNIT 4
125 pages
Machine Learning Notes-1 (Clustering-1)
No ratings yet
Machine Learning Notes-1 (Clustering-1)
25 pages
Lecture 4.6 Unsupervised-learning Clustering
No ratings yet
Lecture 4.6 Unsupervised-learning Clustering
60 pages
8. Clustering
No ratings yet
8. Clustering
80 pages
Final ML Unit3 May24
No ratings yet
Final ML Unit3 May24
154 pages
M3 - Unsupervised Machine Learning
No ratings yet
M3 - Unsupervised Machine Learning
35 pages
EML %th Module
No ratings yet
EML %th Module
40 pages
Stat 390 Presentation 2
No ratings yet
Stat 390 Presentation 2
14 pages
Unit-4
No ratings yet
Unit-4
19 pages
datamining-lect8
No ratings yet
datamining-lect8
79 pages
21csc305p Machine Learning Unit 3_updated (2)
No ratings yet
21csc305p Machine Learning Unit 3_updated (2)
147 pages
IT3080 Lecture04 2023
No ratings yet
IT3080 Lecture04 2023
56 pages
2.3.1 The McCulloch-Pitts Model of Neuron
No ratings yet
2.3.1 The McCulloch-Pitts Model of Neuron
2 pages
DSML-ML09. Unsupervised Learning
No ratings yet
DSML-ML09. Unsupervised Learning
69 pages
Clustering
No ratings yet
Clustering
65 pages
Clustering: K-Means, Agglomerative, DBSCAN: Tan, Steinbach, Kumar
No ratings yet
Clustering: K-Means, Agglomerative, DBSCAN: Tan, Steinbach, Kumar
45 pages
Clustering-Part1.pptx
No ratings yet
Clustering-Part1.pptx
84 pages
Unsupervised Learning Update
No ratings yet
Unsupervised Learning Update
37 pages
Chapter 3 Unsupervised Learning
No ratings yet
Chapter 3 Unsupervised Learning
45 pages
K mean
No ratings yet
K mean
5 pages
ML Unit-4 Final 2024-25
No ratings yet
ML Unit-4 Final 2024-25
28 pages
Clustering
No ratings yet
Clustering
11 pages
Clustering
No ratings yet
Clustering
125 pages
Machine Learning & Data Mining
No ratings yet
Machine Learning & Data Mining
108 pages
Week 9 Part 1 Clustering
No ratings yet
Week 9 Part 1 Clustering
44 pages
Clustering
No ratings yet
Clustering
75 pages
UNIT - 4 DWDM
No ratings yet
UNIT - 4 DWDM
27 pages
Clustering Explanation
No ratings yet
Clustering Explanation
8 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
83 pages
Chapter 5. Clustering Algorithms-Stud
No ratings yet
Chapter 5. Clustering Algorithms-Stud
44 pages
Week 10 Lecture - Introduction to Clustering(1)
No ratings yet
Week 10 Lecture - Introduction to Clustering(1)
35 pages
M5
No ratings yet
M5
40 pages
Lecture 01 - Unsupervised Learning (Optional)
No ratings yet
Lecture 01 - Unsupervised Learning (Optional)
57 pages
Clustering Algorithm
No ratings yet
Clustering Algorithm
47 pages
ML - 8
No ratings yet
ML - 8
70 pages
Unit 4
No ratings yet
Unit 4
74 pages
Week 9
No ratings yet
Week 9
66 pages
M5
No ratings yet
M5
40 pages
8. Clustering
No ratings yet
8. Clustering
38 pages
22AIP3101A Session 9
No ratings yet
22AIP3101A Session 9
38 pages
kmeansfinal
No ratings yet
kmeansfinal
16 pages
Unit 4 Clustering - K-Means and Hierarchical
No ratings yet
Unit 4 Clustering - K-Means and Hierarchical
40 pages
Chap7 Basic Cluster Analysis
No ratings yet
Chap7 Basic Cluster Analysis
82 pages
Clustering
No ratings yet
Clustering
84 pages
AI Chapter 3 Part 5
No ratings yet
AI Chapter 3 Part 5
30 pages
Machine Learning Chapter 3
No ratings yet
Machine Learning Chapter 3
12 pages
K Means Clustering
No ratings yet
K Means Clustering
22 pages
Clustering Techniques - Hierarchical, K-Means Clustering
No ratings yet
Clustering Techniques - Hierarchical, K-Means Clustering
22 pages
U1 - KMeans - 5th Sem - DS
No ratings yet
U1 - KMeans - 5th Sem - DS
14 pages
unsupervised learning
No ratings yet
unsupervised learning
23 pages
DM Lecture 06
No ratings yet
DM Lecture 06
32 pages
K Mean Clustering1
No ratings yet
K Mean Clustering1
23 pages
K Mean
No ratings yet
K Mean
7 pages
Machine Learning & Data Mining: Understanding
No ratings yet
Machine Learning & Data Mining: Understanding
7 pages
K means algorithm
No ratings yet
K means algorithm
4 pages
Agreement in (Message-Passing) Synchronous Systems With Failures - Consensus Algorithm For Crash Failures
No ratings yet
Agreement in (Message-Passing) Synchronous Systems With Failures - Consensus Algorithm For Crash Failures
16 pages
Module 1_Aug 2024
No ratings yet
Module 1_Aug 2024
93 pages
DWDM_Concept_Demonstration
No ratings yet
DWDM_Concept_Demonstration
102 pages
MA 214 Lecture 5
No ratings yet
MA 214 Lecture 5
123 pages
Hungarian Method
100% (1)
Hungarian Method
5 pages
Updated DM
No ratings yet
Updated DM
72 pages
Low Pass Sampling Theorem
No ratings yet
Low Pass Sampling Theorem
24 pages
Lab 07 Adversarial Search
No ratings yet
Lab 07 Adversarial Search
27 pages
UNIT - 1 DLNN
No ratings yet
UNIT - 1 DLNN
36 pages
Dokumen - Tips - Mmds 2014 Talk Distributing ML Algorithms From Gpus To The Cloud
No ratings yet
Dokumen - Tips - Mmds 2014 Talk Distributing ML Algorithms From Gpus To The Cloud
34 pages
Image Representation (Compression) : Muhammad Aminul Akbar
No ratings yet
Image Representation (Compression) : Muhammad Aminul Akbar
35 pages
Indexing and Hashing: Solutions To Practice Exercises
No ratings yet
Indexing and Hashing: Solutions To Practice Exercises
11 pages
Searching in Problem Solving AI - PPTX - 20240118 - 183824 - 0000
No ratings yet
Searching in Problem Solving AI - PPTX - 20240118 - 183824 - 0000
57 pages
Example:1: A.Circular Shift: All All
No ratings yet
Example:1: A.Circular Shift: All All
28 pages
K Means Clustering Algorithm: Explained: Dni Institute
No ratings yet
K Means Clustering Algorithm: Explained: Dni Institute
17 pages
An Efficient Classification Scheme For Classical Maze Problems
No ratings yet
An Efficient Classification Scheme For Classical Maze Problems
20 pages
COMP20007 Design of Algorithms
No ratings yet
COMP20007 Design of Algorithms
15 pages
Dbscan Clustering 1
No ratings yet
Dbscan Clustering 1
10 pages
Linear Equations in Linear Algebra: Row Reduction and Echelon Forms
No ratings yet
Linear Equations in Linear Algebra: Row Reduction and Echelon Forms
31 pages
Snapshot for FIFO channel
No ratings yet
Snapshot for FIFO channel
5 pages
Mathematical Analysis NonRecursive Algorithms
100% (1)
Mathematical Analysis NonRecursive Algorithms
15 pages
GAN Script
No ratings yet
GAN Script
5 pages
Ee 15 Project Documentation
No ratings yet
Ee 15 Project Documentation
5 pages
DMBI_QB_AssignmentQ
No ratings yet
DMBI_QB_AssignmentQ
8 pages
Fake Image Detection
No ratings yet
Fake Image Detection
4 pages
Factoring Polynomials: Be Sure Your Answers Will Not Factor Further!
No ratings yet
Factoring Polynomials: Be Sure Your Answers Will Not Factor Further!
5 pages
(1d) Linear Programming - Simplex Method
No ratings yet
(1d) Linear Programming - Simplex Method
32 pages
Ad3351 Daa Lab Set4 Afternoon Batch Qp 29.11.2024
No ratings yet
Ad3351 Daa Lab Set4 Afternoon Batch Qp 29.11.2024
3 pages
Notes - Union-Find Disjoint Sets (UFDS)
No ratings yet
Notes - Union-Find Disjoint Sets (UFDS)
1 page
Matched Filtering and Timing Recovery in Digital Receivers - Match Filter, Timing Recovery
No ratings yet
Matched Filtering and Timing Recovery in Digital Receivers - Match Filter, Timing Recovery
8 pages
Practice Questions CNNs Solns
No ratings yet
Practice Questions CNNs Solns
11 pages
Linear Block Code Matlab
No ratings yet
Linear Block Code Matlab
1 page
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet

Clustering

Uploaded by

Clustering

Uploaded by

THIS PRESENTATION IS ABOUT

 Hierarchical based Clustering

 Density based Clustering

 Model based Clustering

You might also like