15_GMC

The document discusses probabilistic model-based clustering, emphasizing the use of Gaussian Mixture Models (GMM) to identify latent categories within datasets. It explains the Expectation-Maximization (EM) algorithm as a method for optimizing clustering by iteratively assigning data points to clusters and updating cluster parameters. Additionally, it highlights the advantages and disadvantages of GMMs, including their speed and challenges related to component selection and singularities in covariance estimation.

Uploaded by

l.arrizabalaga

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

15_GMC

Uploaded by

l.arrizabalaga

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Probabilistic Model-Based Clustering

Rubén Sánchez Corcuera

[email protected]

Gaussian Mixture ■ In all the cluster analysis methods we have discussed so far, each
data object can be assigned to only one of a number of clusters.

Clustering
■ This cluster assignment rule is required in some applications such
as assigning customers to marketing managers.
■ However, in other applications, this rigid requirement may not be
desirable.

Probabilistic Model-Based Clustering Probabilistic Model-Based Clustering

■ The goal of cluster analysis is to ﬁnd hidden categories.
■ We conduct cluster analysis on a dataset because we assume that the
■ A data set that is the subject of cluster analysis can be regarded as a
objects in the dataset in fact belong to different inherent categories.
sample of the possible instances of the hidden categories, but without any
■ Clustering tendency analysis can be used to examine whether a dataset category labels.
contains objects that may lead to meaningful clusters.
■ The clusters derived from cluster analysis are inferred using the data set,
■ Here, the inherent categories hidden in the data are latent, which means and are designed to approach the hidden categories.
they cannot be directly observed.
■ Statistically, we can assume that a hidden category is a distribution over the
○ Instead, we have to infer them using the data observed. data space, which can be mathematically represented using a probability
density function (or distribution function).
■ For example, the topics hidden in a set of reviews in the an online store are
latent because one cannot read the topics directly. ○ We call such a hidden category a probabilistic cluster.
■ However, the topics can be inferred from the reviews because each review ■ For a probabilistic cluster, C, its probability density function, f , and a point,
is about one or multiple topics. o, in the data space, f(o) is the relative likelihood that an instance of C
appears at o.
3 4
Probabilistic Model-Based Clustering Expectation Maximization
Example with a tech product
■ It can be shown that k-means clustering is a special case of fuzzy
clustering. The k-means algorithm iterates until the clustering cannot be
improved.
■ Each iteration consists of two steps:
1. The expectation step (E-step): Given the current cluster centers, each
object is assigned to the cluster with a center that is closest to the
object. Here, an object is expected to belong to the closest cluster.
2. The maximization step (M-step): Given the cluster assignment, for
each cluster, the algorithm adjusts the center so that the sum of the
distances from the objects assigned to this cluster and the new center
is minimized. That is, the similarity of objects assigned to a cluster is
maximized.

5 6

Expectation Maximization Expectation Maximization Characteristics

■ We can generalize this two-step method to tackle fuzzy clustering and
■ In many applications, probabilistic model-based clustering has been shown to
probabilistic model-based clustering.
be effective because it is more general than partitioning methods and fuzzy
■ In general, an expectation-maximization (EM) algorithm is a framework that clustering methods.
approaches maximum likelihood or maximum a posteriori estimates of
■ A distinct advantage is that appropriate statistical models can be used to
parameters in statistical models.
capture latent clusters.
■ In the context of fuzzy or probabilistic model-based clustering, an EM algorithm
■ The EM algorithm is commonly used to handle many learning problems in data
starts with an initial set of parameters and iterates until the clustering cannot be
mining and statistics due to its simplicity.
improved, that is, until the clustering converges or the change is sufﬁciently
small (less than a preset threshold). ■ Note that, in general, the EM algorithm may not converge to the optimal solution.
It may instead converge to a local maximum 🡪 a good solution, but not the best
■ Each iteration also consists of two steps:
○ Many heuristics have been explored to avoid this. For example, we could
1. The expectation step assigns objects to clusters according to the current run the EM process multiple times using different random initial values.
fuzzy clustering or parameters of probabilistic clusters. ■ Furthermore, the EM algorithm can be very costly if the number of distributions
is large or the data set contains very few observed data points that maximize the
2. The maximization step ﬁnds the new clustering or parameters that
expected likelihood in probabilistic model-based clustering.
maximize the expected likelihood in probabilistic model-based clustering.
7 8
Gaussian Mixture Models Gaussian Mixture Models

■ A Gaussian Mixture Model represents the probability distribution of the data

■ Sklearn implements Expectation Maximization with Gaussian Mixture. as a combination of multiple Gaussian distributions. Each Gaussian
■ A Gaussian mixture model is a probabilistic model that assumes all the data component has its own mean (μ), covariance (Σ), and weight (π):
points are generated from a mixture of a ﬁnite number of Gaussian
distributions with unknown parameters.
■ One can think of mixture models as generalizing k-means clustering to
incorporate information about the covariance structure of the data as well
as the centers of the latent Gaussians.
■ It can also draw conﬁdence ellipsoids for multivariate models, and compute
the Bayesian Information Criterion to assess the number of clusters in the
data.

9 10

Gaussian Mixture Models Gaussian Mixture Models: Advantages

The GaussianMixture comes with ■ Speed: It is the fastest algorithm for learning mixture models
different options to constrain the
covariance of the difference ■ Agnostic: As this algorithm maximizes only the likelihood, it will not
classes estimated: spherical, bias the means towards zero, or bias the cluster sizes to have
diagonal, tied or full covariance. speciﬁc structures that might or might not apply.

11 12
Gaussian Mixture Models: Disadvantages Selecting the number of components in a
classical Gaussian Mixture Model

■ Singularities: When one has insufﬁciently many points per mixture,

estimating the covariance matrices becomes difficult, and the
algorithm is known to diverge and find solutions with infinite ■ The BIC criterion can be used to select the number of components
likelihood unless one regularizes the covariances artificially. in a Gaussian Mixture in an efficient way. In theory, it recovers the
true number of components only in the asymptotic regime (i.e. if
■ Number of components: This algorithm will always use all the much data is available and assuming that the data was actually
components it has access to, needing held-out data or information generated i.i.d. from a mixture of Gaussian distribution)
theoretical criteria to decide how many components to use in the
absence of external cues.

13 14

Selecting the number of components in a

classical Gaussian Mixture Model Further reading

■ Section 11.1 in [Han & Kamber, 2016]

15 16

Time Series Analysis by State Space Methods
100% (9)
Time Series Analysis by State Space Methods
369 pages
AP Stat 2019 Practice
80% (5)
AP Stat 2019 Practice
140 pages
Econ 212: Using Stata To Estimate VAR and Stractural VAR Models
100% (1)
Econ 212: Using Stata To Estimate VAR and Stractural VAR Models
17 pages
lecture_06
No ratings yet
lecture_06
51 pages
EM and Kmeans relations
No ratings yet
EM and Kmeans relations
70 pages
Clustering Mixture
No ratings yet
Clustering Mixture
22 pages
Pattern Analysis-Machine Learning
No ratings yet
Pattern Analysis-Machine Learning
74 pages
iris_mbc_solution
No ratings yet
iris_mbc_solution
6 pages
Get One More Story in Your Member Preview When You Sign Up. It's Free
No ratings yet
Get One More Story in Your Member Preview When You Sign Up. It's Free
12 pages
GaussianMixtureModel(GMM)_0a8d7758700f041bd57d8aef0862eb14
No ratings yet
GaussianMixtureModel(GMM)_0a8d7758700f041bd57d8aef0862eb14
18 pages
Week 7 - Latent Variable Models and Expectation Maximization
No ratings yet
Week 7 - Latent Variable Models and Expectation Maximization
39 pages
PROBABILISTIC Learning Jb-new
No ratings yet
PROBABILISTIC Learning Jb-new
13 pages
Concepts and Techniques: - Chapter 11
No ratings yet
Concepts and Techniques: - Chapter 11
103 pages
ML Lecture06 Unsupervised Learning
No ratings yet
ML Lecture06 Unsupervised Learning
87 pages
Mixture Models and Clustering
No ratings yet
Mixture Models and Clustering
8 pages
Chap2 Part2 GMM
No ratings yet
Chap2 Part2 GMM
34 pages
Introduction To (Statistical) Machine Learning
No ratings yet
Introduction To (Statistical) Machine Learning
30 pages
Gaussian Mixture Mode
No ratings yet
Gaussian Mixture Mode
30 pages
Lec. 15-Final. ClusAdvanced
No ratings yet
Lec. 15-Final. ClusAdvanced
103 pages
Week 5 v1.1 - Unsupervised Learning
No ratings yet
Week 5 v1.1 - Unsupervised Learning
40 pages
I2ml3e Chap7
No ratings yet
I2ml3e Chap7
22 pages
ET4248E - Chap9 - K-Means and GMM
No ratings yet
ET4248E - Chap9 - K-Means and GMM
27 pages
Expectation Maximization
No ratings yet
Expectation Maximization
23 pages
Week3 Statnlp Web
No ratings yet
Week3 Statnlp Web
58 pages
ML UNIT III
No ratings yet
ML UNIT III
12 pages
5 Clustering
No ratings yet
5 Clustering
38 pages
Gaussian Mixture Modelling GMM
No ratings yet
Gaussian Mixture Modelling GMM
11 pages
Machine Learning & Data Mining: Understanding
No ratings yet
Machine Learning & Data Mining: Understanding
7 pages
gmm
No ratings yet
gmm
8 pages
Module - 5 - ECE3047 - Machine Learning
No ratings yet
Module - 5 - ECE3047 - Machine Learning
52 pages
Statistical Methods For NLP: Document and Topic Clustering, K-Means, Mixture Models, Expectation-Maximization
No ratings yet
Statistical Methods For NLP: Document and Topic Clustering, K-Means, Mixture Models, Expectation-Maximization
47 pages
Week 4 - Lecture Slides - K-Means, Mixture Models, & EM
No ratings yet
Week 4 - Lecture Slides - K-Means, Mixture Models, & EM
65 pages
Unit 5
No ratings yet
Unit 5
5 pages
CSC454_9
No ratings yet
CSC454_9
29 pages
GAUSSIAN MIXTURES
No ratings yet
GAUSSIAN MIXTURES
5 pages
UNIT 5 - ML
No ratings yet
UNIT 5 - ML
10 pages
9 Unsupervised Learning: 9.1 K-Means Clustering
No ratings yet
9 Unsupervised Learning: 9.1 K-Means Clustering
34 pages
ELLIPTICAL MIXTURE MODELS IMPROVE THE ACCURACY OF GAUSSIAN MIXTURE MODELS WITH EXPECTATIONMAXIMIZATION ALGORITHM
No ratings yet
ELLIPTICAL MIXTURE MODELS IMPROVE THE ACCURACY OF GAUSSIAN MIXTURE MODELS WITH EXPECTATIONMAXIMIZATION ALGORITHM
20 pages
Module13 GaussianMixtureModel
No ratings yet
Module13 GaussianMixtureModel
17 pages
Class19-22 Clustering 17-25oct2019
No ratings yet
Class19-22 Clustering 17-25oct2019
42 pages
ML.5-Clustering Techniques (Week 9)
No ratings yet
ML.5-Clustering Techniques (Week 9)
71 pages
Image Segmentation1
No ratings yet
Image Segmentation1
42 pages
Expectation-Maximization Clustring V2
No ratings yet
Expectation-Maximization Clustring V2
9 pages
Enhancing Clustering Mechanism by Implementation of EM Algorithm For Gaussian Mixture Model
No ratings yet
Enhancing Clustering Mechanism by Implementation of EM Algorithm For Gaussian Mixture Model
4 pages
Gaussian Mixture Models Unit-III
No ratings yet
Gaussian Mixture Models Unit-III
13 pages
DSA5102_lecture10
No ratings yet
DSA5102_lecture10
40 pages
Symmetrical Based Projects
No ratings yet
Symmetrical Based Projects
105 pages
Dsci303-19 GM - em
No ratings yet
Dsci303-19 GM - em
81 pages
CPSC 540: Machine Learning: Mixture Models, Expectation Maximization
No ratings yet
CPSC 540: Machine Learning: Mixture Models, Expectation Maximization
38 pages
Clustering, K-Means,. Expectation Maximization, Mean Shift, Classifier Ensembles, Bagging, Boosting
No ratings yet
Clustering, K-Means,. Expectation Maximization, Mean Shift, Classifier Ensembles, Bagging, Boosting
21 pages
Topic: Machine Learning
No ratings yet
Topic: Machine Learning
35 pages
Gaussian Mixture Model (GMM)
No ratings yet
Gaussian Mixture Model (GMM)
10 pages
6.2 K Means
No ratings yet
6.2 K Means
23 pages
01Clustering (4)
No ratings yet
01Clustering (4)
43 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
64 pages
Unsupervised Learning-01
No ratings yet
Unsupervised Learning-01
42 pages
401 Week7 Part 2 EM Algorithm
No ratings yet
401 Week7 Part 2 EM Algorithm
58 pages
Week 10
No ratings yet
Week 10
50 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
09_Regression
No ratings yet
09_Regression
5 pages
03_Data_Preprocessing
No ratings yet
03_Data_Preprocessing
15 pages
14_DBSCAN
No ratings yet
14_DBSCAN
7 pages
13_BIRCH
No ratings yet
13_BIRCH
8 pages
Exploring The Determinants of Exclusive Breastfeeding Among Infants Under Six Months in Ethiopia Using Multilevel Analysis - Sukses
No ratings yet
Exploring The Determinants of Exclusive Breastfeeding Among Infants Under Six Months in Ethiopia Using Multilevel Analysis - Sukses
17 pages
Learn_Seaborn_1674064934
No ratings yet
Learn_Seaborn_1674064934
24 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
33 pages
REVIEW QUESTION IN IQC AND SPC Problem Solving PDF
No ratings yet
REVIEW QUESTION IN IQC AND SPC Problem Solving PDF
3 pages
Interpreting Error Bars - BIOLOGY FOR LIFE
No ratings yet
Interpreting Error Bars - BIOLOGY FOR LIFE
2 pages
Polynomial Regression: Y X X X X XX
No ratings yet
Polynomial Regression: Y X X X X XX
15 pages
COST ATKT Oct 2019 - Paper Solution
No ratings yet
COST ATKT Oct 2019 - Paper Solution
13 pages
Learning Platform
No ratings yet
Learning Platform
4 pages
SPSS 19 Answers To Selected Exercises
No ratings yet
SPSS 19 Answers To Selected Exercises
68 pages
Descriptive Statistics I Theory Questions (1997-2016)
No ratings yet
Descriptive Statistics I Theory Questions (1997-2016)
9 pages
D071171011 - Tugas 02
100% (1)
D071171011 - Tugas 02
7 pages
Midterm Quiz 1 - Attempt Review
No ratings yet
Midterm Quiz 1 - Attempt Review
6 pages
Understanding Machine Learning Algorithms - in Depth
No ratings yet
Understanding Machine Learning Algorithms - in Depth
167 pages
Bivariate Analysis: Research Methodology Digital Assignment Iii
No ratings yet
Bivariate Analysis: Research Methodology Digital Assignment Iii
6 pages
International Standard: Statistical Interpretation of Data - Median - Estimation and Confidence Intervals
No ratings yet
International Standard: Statistical Interpretation of Data - Median - Estimation and Confidence Intervals
6 pages
Test 1 Formula Sheet ISDS 2000
No ratings yet
Test 1 Formula Sheet ISDS 2000
1 page
[Ebooks PDF] download Structural Equation Modeling Using R/SAS: A Step-by-Step Approach with Real Data Analysis 1st Edition Ding-Geng Chen full chapters
100% (2)
[Ebooks PDF] download Structural Equation Modeling Using R/SAS: A Step-by-Step Approach with Real Data Analysis 1st Edition Ding-Geng Chen full chapters
65 pages
Solution - Test 2
No ratings yet
Solution - Test 2
6 pages
PPNCKH-Luyện tập trắc nghiệm 3-1
No ratings yet
PPNCKH-Luyện tập trắc nghiệm 3-1
7 pages
Sampling Errors and Research Bias
No ratings yet
Sampling Errors and Research Bias
39 pages
Chap 5 Module
No ratings yet
Chap 5 Module
18 pages
Best Flagss
No ratings yet
Best Flagss
4 pages
Department of Mathematics MAL 108 (Introduction To Statistics) Tutorial Sheet No. 6 (Sampling Distribution)
No ratings yet
Department of Mathematics MAL 108 (Introduction To Statistics) Tutorial Sheet No. 6 (Sampling Distribution)
2 pages
Attempt All The Questions. All The Questions Are Compulsory and Carry 10 Marks
No ratings yet
Attempt All The Questions. All The Questions Are Compulsory and Carry 10 Marks
12 pages
Int 354
No ratings yet
Int 354
4 pages
Tutorial On Biostatistics: Longitudinal Analysis of Correlated Continuous Eye Data
No ratings yet
Tutorial On Biostatistics: Longitudinal Analysis of Correlated Continuous Eye Data
15 pages
Journal of Chemometrics - 2018 - Brereton - Introduction To Analysis of Variance
No ratings yet
Journal of Chemometrics - 2018 - Brereton - Introduction To Analysis of Variance
4 pages