Illustrating The Central Limit Theorem: Lesson
Illustrating The Central Limit Theorem: Lesson
1. population mean
2. population variance
3. population standard deviation
4. illustrate the probability histogram of the sampling distribution of the
means
Activity 2
Consider all the possible samples of size 2 that can be drawn with
replacement from the population 1, 2, 3, 4, 5, and 6. Create a sampling
distribution of the sample mean.
ΣX
In Activity 1, we get the population mean using the formula μ=
N
where:
μ – population mean
ΣX −¿ sum of all data in terms of X
N - population size
Therefore, we have:
ΣX
μ= Substitute
N
1+ 2+ 3+4 +5+6
= Add each data divided by the population size
6
21
= Simplify
6
μ = 3.5 (population mean)
Σ( X−μ)2
To solve for the population variance, we use the formula σ2 =
N
Probability Data – Population Mean Square of the Data – Population
X
P (X) (X- μ) Mean (X- μ)2
1
1 1-3.5 = -2.5 (2.5)2 = (2.5)(2.5) = 6.25
6
1
2 2-3.5 = -1.5 (-1.5)2 = (-1.5)(-1.5) = 2.25
6
1
3 3-3.5 = -0.5 (-0.5)2 = (-0.5)(-0.5)= 0.25
6
1
4 4-3.5 = 0.5 (0.5)2 = (0.5)(0.5) = 0.25
6
1
5 5-3.5 = 1.5 (1.5)2 = (1.5)(1.5) = 2.25
6
1
6 6-3.5 = 2.5 (2.5)2 = (2.5)(2.5) = 6.25
6
N=6 Σ P ( X )=1 Σ (x - μ ¿2 = 17.5
Σ( X−μ)2
σ2 =
N
17.5
=
6
σ2 ≈ 2.92
2
3. To solve for the population standard deviation, use the formula,σ =
17.5
√ ∑ ( X −μ)
N
thus, we have σ =
6
≈ √ 2.92
√
σ ≈ 1.71
4. For the probability histogram of the population wherein X = x and n = 1, we have
0 1 2 3 4 5 6
For Activity 2, the table shows and summarizes the information needed to compute
the mean, variance, and standard deviation of the sampling distribution of the
sample mean.
Sample Sample
Observation Sample Mean ( X́ −μ X́ ) ¿ Observation Sample Mean ( X́ −μ X́ ) ¿
( X́ ) ( X́ )
1 (1,1) 1 -2.5 6.25 19 (4,1) 2.5 -1 1
2 (1,2) 1.5 -2 4 20 (4,2) 3 -0.5 0.25
3 (1,3) 2 -1.5 2.25 21 (4,3) 3.5 0 0
4 (1,4) 2.5 -1 1 22 (4,4) 4 0.5 0.25
5 (1,5) 3 -0.5 0.25 23 (4,5) 4.5 1 1
6 (1,6) 3.5 0 0 24 (4,6) 5 1.5 2.25
7 (2,1) 1.5 -2 4 25 (5,1) 3 -0.5 0.25
8 (2,2) 2 -1.5 2.25 26 (5,2) 3.5 0 0
9 (2,3) 2.5 -1 1 27 (5,3) 4 0.5 0.25
10 (2,4) 3 -0.5 0.25 28 (5,4) 4.5 1 1
11 (2,5) 3.5 0 0 29 (5,5) 5 1.5 2.25
12 (2,6) 4 0.5 0.25 30 (5,6) 5.5 2 4
13 (3,1) 2 -1.5 2.25 31 (6,1) 3.5 0 0
14 (3,2) 2.5 -1 1 32 (6,2) 4 0.5 0.25
15 (3,3) 3 -0.5 0.25 33 (6,3) 4.5 1 1
16 (3,4) 3.5 0 0 34 (6,4) 5 1.5 2.25
17 (3,5) 4 0.5 0.25 35 (6,5) 5.5 2 4
18 (3,6) 4.5 1 1 36 (6,6) 6 2.5 6.25
∑ X́=¿ 126 ∑ ¿¿ 52.5
Let us now solve for the mean, variance, and standard deviation of the sampling
distribution with sample size 2 (with replacement).
126
¿
36
¿3.5
1. The population mean μ=3.5 while the mean of the sampling distribution
μ X́ =¿3.5. Notice that the population mean is equal to the mean of the sampling
distribution. That is μ=μ X́ .
2. The population variance σ2 ≈ 2.92 while the variance of the sampling distribution
σ 2X́ ≈ 1.46. Notice that if we divide the population variance by the sample size of
2, we have 1.46 which equal to the variance of the sampling distribution of the
2
2.92
sample mean. That is, =1.46therefore σ 2X́ = σ
2 n
3. The population standard deviation σ ≈ 1.71 while the standard deviation of the
sampling distribution σ X́ ≈ 1.21. Notice that if we divide the population
standard deviation by the square root of the sample size of 2, we have 1.21
which is equal to the standard deviation of the sampling distribution. That is,
1.71 σ
=1.21 .Therefore, σ X́ = .
√2 √n
4. The probability histogram of the population wherein n=1 looks like this
P(x)
0 1 2 3 4 5 6 X
While the probability histogram of the sampling distribution of the sample mean
with size n = 2 looks like this.
The illustration above is more likely to have a curve of a normal distribution while
the probability histogram of the population is not normally distributed.
Based on our activity, it was evident in the Central Limit Theorem that the
sampling distribution will be approximately normally distributed for large sample
sizes regardless of the distribution from which we are sampling. Also, it is
important to recall that the mean of the sampling distribution of the sample mean
is equal to the population mean: μ X́ =μ and the standard deviation of the sampling
σ
distribution of the sample mean (sampling with replacement) is equal to: σ X́ = .
√n
If the population is normally distributed, the sampling distribution of the
sample mean is also normally distributed. But what if the population is not
normal?
That is where the Central Limit Theorem addressed this question. The
distribution of the sample mean tends toward the normal distribution as the
sample size increases, regardless of the distribution from which we are sampling.
As a simple guideline, the sample mean can be considered approximately normally
distributed if the sample size is at least 30 (n ≥ 30 ¿.
If the sample size is sufficiently large, the Central Limit Theorem can be used
to answer the sample mean in the same manner that a normal distribution can be
used to answer questions about individual samples. This also means that even if
the population is not normally distributed, or if we don’t know of its distribution,
the Central Limit Theorem allows us to conclude that the distribution of the sample
mean will be normal if the sample size is sufficiently large. It is generally accepted
that a sample size of at least 30 is large enough to conclude that the Central Limit
Theorem will ensure a normal distribution in the sampling process regardless of
the distribution of the original population.
Why it is important to know the Central Limit theorem?
Many statistics have distributions that are approximately normal for large
sample sizes, even when we are sampling from a distribution that is not normal.
This means that we can develop statistical inference procedures that are based on
the normal distribution, even if we are sampling from a population that is not
normal, provided we have a large sample size.
Further, we can continue to use the z conversion formula in our
calculations. This time we will use the formula,
X́−μ
z= σ
√n
Solution:
It is not given that the population is normally distributed but since n ¿ 30, then
you can assume that the sampling distribution of the mean ages of 100 barangay
residents is normal according to the Central Limit Theorem.
The Central Limit Theorem describes the normality of the distribution of the sample
mean taken from a population that is not normally distributed.
Step 1: Identify the given information:
μ=34 , σ=4 , X́ =35 , n=100
Step 2: Identify what is asked.
P( X́ <35)
X́−μ
Z=
σ
√n
35−34
10
Z= 4 = =2.5
4
√ 100
P ( Z <2.5 ) =0 .9938
Therefore, the probability that the random sample of 100 persons has an average of
fewer than 35 years is 0.9938 or 99.38%
Suppose that the average age of the people living in a Barangay is 34 with a
standard deviation of 4. One hundred (100) residents of a certain Barangay decided
to take summer outing after COVID-19 pandemic and Enhanced Community
Quarantine has been lifted for bonding and relaxation.
Probable Time
The mean time it takes a group of senior high students to complete a
certain examination is 50.6 minutes. The standard deviation is 6 minutes.
Assume that the variable is normally distributed.
Find P(X¿ 48) by getting the area under the normal curve.
P(X˂ 48) = P(z ˂ -0.43)
= 0.3336
Probable Time
The mean time it takes a group of senior high students to complete a
certain examination is 50.6 minutes. The standard deviation is 6 minutes.
Assume that the variable is normally distributed.
The probability that 49 randomly selected senior high school students will
complete the test in less than 48 minutes is 0.0012 or 0.12%
Probable Time
The mean time it takes a group of senior high students to complete a
certain examination is 50.6 minutes. The standard deviation is 6 minutes.
Assume that the variable is normally distributed.
3. If 49 randomly selected senior high school students take the
examination, what is the probability that the mean time it takes
the group to complete the test will be more than 51 minutes?
Step 1: Identify the given information:
μ= 50.6
σ =6
X́ = 51
n = 49
Step 2: Identify what is asked.
P( X́ > ¿ 51)
The probability that 49 randomly selected senior high students will complete
the test in more than 51 minutes is 0.3192 or 31.92%
Probable Time
The mean time it takes a group of senior high students to complete a
certain examination is 50.6 minutes. The standard deviation is 6 minutes.
Assume that the variable is normally distributed.
4. If 49 randomly selected senior high students take the examination,
what is the probability that the mean time it takes the group to
complete the test is between 47.8 and 53 minutes?
Step 1: Identify the given information:
μ= 50.6
σ =6
X́ = 47.8 and 53, n = 49
Step 2: Identify what is asked.
To find the probability that 49 randomly selected senior high school students
will complete the test between 47.8 and 53 minutes, subtract the smaller
area from the bigger area under the normal curve.
My Favorite Breed!
A veterinarian studies a large population of one breed of dogs which is shih
tzu. The shih tzu’s mean length is 50 inches with standard deviation 26
inches. If a sample of 169 shih tzu are randomly selected, what is the chance
that the sample mean is:
b. at least 47 inches?
Activity 1
a. less than 49?
Data: n = 169, σ X́ = standard error
because of x is not single, given by
26
= 2, μ = 50
√169
49−50
P( X́ < ¿ 49) = P(z ¿ )
2
= P(z ¿ - 0.5)
= 0.3085
Activity 2:
Is It Worth It?
The mean annual salary of all the frontlines (nurses, medical technologists,
radiologic technologists, phlebotomists) in the Philippines is Php 42,500.
Assume that this is normally distributed with standard deviation Php 5,600.
A random sample of 25 health workers is drawn from this population, find
the probability that the mean salary of the sample is:
a. less than Php 40,500?