100% found this document useful (2 votes)
3K views

Illustrating The Central Limit Theorem: Lesson

1. The document describes two activities to illustrate the central limit theorem. Activity 1 looks at the population and finds its mean, variance, and standard deviation. Activity 2 considers all possible samples from the population and finds the mean, variance, and standard deviation of the sampling distribution. 2. The activities show that as sample size increases, the sampling distribution of the mean approximates the normal distribution, even if the population is not normally distributed. This demonstrates the central limit theorem. 3. The document compares the measures (mean, variance, standard deviation) between the population and sampling distribution with different sample sizes. This shows how the sampling distribution converges to the normal distribution as sample size increases.

Uploaded by

aiza
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
100% found this document useful (2 votes)
3K views

Illustrating The Central Limit Theorem: Lesson

1. The document describes two activities to illustrate the central limit theorem. Activity 1 looks at the population and finds its mean, variance, and standard deviation. Activity 2 considers all possible samples from the population and finds the mean, variance, and standard deviation of the sampling distribution. 2. The activities show that as sample size increases, the sampling distribution of the mean approximates the normal distribution, even if the population is not normally distributed. This demonstrates the central limit theorem. 3. The document compares the measures (mean, variance, standard deviation) between the population and sampling distribution with different sample sizes. This shows how the sampling distribution converges to the normal distribution as sample size increases.

Uploaded by

aiza
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 16

Lesson

Illustrating the Central


1 Limit Theorem
Activity 1

Consider a population of Senior High School consisting of the values 1, 2, 3,


4, 5, and 6.

Compute the following:

1. population mean
2. population variance
3. population standard deviation
4. illustrate the probability histogram of the sampling distribution of the
means

Activity 2

Consider all the possible samples of size 2 that can be drawn with
replacement from the population 1, 2, 3, 4, 5, and 6. Create a sampling
distribution of the sample mean.

Compute the following:

1. Mean of the sampling distribution of the sample mean


2. Variance of the sampling distribution of the sample mean
3. Standard deviation of the sampling distribution of the sample mean
4. Illustrate the probability histogram of the sampling distribution of the
mean.

From Activities 1 and 2, compare the following:


1. population mean and the mean of the sampling distribution of the
sample mean
2. population variance and the variance of the sampling distribution of
the sample mean
3. population standard deviation and the standard deviation of the
sampling distribution of the sample mean
4. probability histogram of the population and the sampling distribution
of the mean with size 2

ΣX
In Activity 1, we get the population mean using the formula μ=
N

where:
μ – population mean
ΣX −¿ sum of all data in terms of X
N - population size

Therefore, we have:

ΣX
μ= Substitute
N
1+ 2+ 3+4 +5+6
= Add each data divided by the population size
6
21
= Simplify
6
μ = 3.5 (population mean)

Σ( X−μ)2
To solve for the population variance, we use the formula σ2 =
N
Probability Data – Population Mean Square of the Data – Population
X
P (X) (X- μ) Mean (X- μ)2
1
1 1-3.5 = -2.5 (2.5)2 = (2.5)(2.5) = 6.25
6
1
2 2-3.5 = -1.5 (-1.5)2 = (-1.5)(-1.5) = 2.25
6
1
3 3-3.5 = -0.5 (-0.5)2 = (-0.5)(-0.5)= 0.25
6
1
4 4-3.5 = 0.5 (0.5)2 = (0.5)(0.5) = 0.25
6
1
5 5-3.5 = 1.5 (1.5)2 = (1.5)(1.5) = 2.25
6
1
6 6-3.5 = 2.5 (2.5)2 = (2.5)(2.5) = 6.25
6
N=6 Σ P ( X )=1 Σ (x - μ ¿2 = 17.5

2. Solve for the population variance using the formula

Σ( X−μ)2
σ2 =
N
17.5
=
6
σ2 ≈ 2.92
2
3. To solve for the population standard deviation, use the formula,σ =

17.5
√ ∑ ( X −μ)
N
thus, we have σ =
6
≈ √ 2.92

σ ≈ 1.71
4. For the probability histogram of the population wherein X = x and n = 1, we have

0 1 2 3 4 5 6

For Activity 2, the table shows and summarizes the information needed to compute
the mean, variance, and standard deviation of the sampling distribution of the
sample mean.

Sample Sample
Observation Sample Mean ( X́ −μ X́ ) ¿ Observation Sample Mean ( X́ −μ X́ ) ¿
( X́ ) ( X́ )
1 (1,1) 1 -2.5 6.25 19 (4,1) 2.5 -1 1
2 (1,2) 1.5 -2 4 20 (4,2) 3 -0.5 0.25
3 (1,3) 2 -1.5 2.25 21 (4,3) 3.5 0 0
4 (1,4) 2.5 -1 1 22 (4,4) 4 0.5 0.25
5 (1,5) 3 -0.5 0.25 23 (4,5) 4.5 1 1
6 (1,6) 3.5 0 0 24 (4,6) 5 1.5 2.25
7 (2,1) 1.5 -2 4 25 (5,1) 3 -0.5 0.25
8 (2,2) 2 -1.5 2.25 26 (5,2) 3.5 0 0
9 (2,3) 2.5 -1 1 27 (5,3) 4 0.5 0.25
10 (2,4) 3 -0.5 0.25 28 (5,4) 4.5 1 1
11 (2,5) 3.5 0 0 29 (5,5) 5 1.5 2.25
12 (2,6) 4 0.5 0.25 30 (5,6) 5.5 2 4
13 (3,1) 2 -1.5 2.25 31 (6,1) 3.5 0 0
14 (3,2) 2.5 -1 1 32 (6,2) 4 0.5 0.25
15 (3,3) 3 -0.5 0.25 33 (6,3) 4.5 1 1
16 (3,4) 3.5 0 0 34 (6,4) 5 1.5 2.25
17 (3,5) 4 0.5 0.25 35 (6,5) 5.5 2 4
18 (3,6) 4.5 1 1 36 (6,6) 6 2.5 6.25
∑ X́=¿ 126 ∑ ¿¿ 52.5

Sampling Distribution of the sample mean with replacement


X́ f Probability P( X́ )
1
1 1
36
2
1.5 2
36
3
2 3
36
4
2.5 4
36
5
3 5
36
6
3.5 6
36
5
4 5
36
4
4.5 4
36
3
5 3
36
2
5.5 2
36
1
6 1
36

Let us now solve for the mean, variance, and standard deviation of the sampling
distribution with sample size 2 (with replacement).

1. Mean of the sampling distribution of the sample mean


Σ X́
μ X́ = where n is the number of all possible samples of size 2
n

126
¿
36
¿3.5

2. Variance of the sampling distribution of the sample mean


σ 2X́ = Σ ¿ ¿ where n is the number of all possible samples of size 2
52.5
¿
36
≈ 1.46

3. Standard Deviation of the sampling distribution of the sample mean


σ X́ = √ Σ ¿ ¿¿ where n is the number of all possible samples of size 2
52.6
¿

≈ 1.21
36

5. To illustrate the probability histogram of the sampling distribution of


the sample, the following table is needed:
Comparison
we compared the values solved through the data in the population and the
sampling distribution.

1. The population mean μ=3.5 while the mean of the sampling distribution
μ X́ =¿3.5. Notice that the population mean is equal to the mean of the sampling
distribution. That is μ=μ X́ .

2. The population variance σ2 ≈ 2.92 while the variance of the sampling distribution
σ 2X́ ≈ 1.46. Notice that if we divide the population variance by the sample size of
2, we have 1.46 which equal to the variance of the sampling distribution of the
2
2.92
sample mean. That is, =1.46therefore σ 2X́ = σ
2 n

3. The population standard deviation σ ≈ 1.71 while the standard deviation of the
sampling distribution σ X́ ≈ 1.21. Notice that if we divide the population
standard deviation by the square root of the sample size of 2, we have 1.21
which is equal to the standard deviation of the sampling distribution. That is,
1.71 σ
=1.21 .Therefore, σ X́ = .
√2 √n
4. The probability histogram of the population wherein n=1 looks like this

P(x)

0 1 2 3 4 5 6 X

While the probability histogram of the sampling distribution of the sample mean
with size n = 2 looks like this.
The illustration above is more likely to have a curve of a normal distribution while
the probability histogram of the population is not normally distributed.

Based on our activity, it was evident in the Central Limit Theorem that the
sampling distribution will be approximately normally distributed for large sample
sizes regardless of the distribution from which we are sampling. Also, it is
important to recall that the mean of the sampling distribution of the sample mean
is equal to the population mean: μ X́ =μ and the standard deviation of the sampling
σ
distribution of the sample mean (sampling with replacement) is equal to: σ X́ = .
√n
If the population is normally distributed, the sampling distribution of the
sample mean is also normally distributed. But what if the population is not
normal?
That is where the Central Limit Theorem addressed this question. The
distribution of the sample mean tends toward the normal distribution as the
sample size increases, regardless of the distribution from which we are sampling.
As a simple guideline, the sample mean can be considered approximately normally
distributed if the sample size is at least 30 (n ≥ 30 ¿.

If the sample size is sufficiently large, the Central Limit Theorem can be used
to answer the sample mean in the same manner that a normal distribution can be
used to answer questions about individual samples. This also means that even if
the population is not normally distributed, or if we don’t know of its distribution,
the Central Limit Theorem allows us to conclude that the distribution of the sample
mean will be normal if the sample size is sufficiently large. It is generally accepted
that a sample size of at least 30 is large enough to conclude that the Central Limit
Theorem will ensure a normal distribution in the sampling process regardless of
the distribution of the original population.
Why it is important to know the Central Limit theorem?
Many statistics have distributions that are approximately normal for large
sample sizes, even when we are sampling from a distribution that is not normal.
This means that we can develop statistical inference procedures that are based on
the normal distribution, even if we are sampling from a population that is not
normal, provided we have a large sample size.
Further, we can continue to use the z conversion formula in our
calculations. This time we will use the formula,

X́−μ
z= σ
√n

Let us illustrate this through this example:


Suppose that the average age of the people living in a Barangay is 34 with a
standard deviation of 4. If 100 residents of a certain Barangay decided to take
summer outing after COVID-19 pandemic and Enhanced Community Quarantine has
been lifted for bonding and relaxation, what is the probability that the average age of
these residents is less than 35?

Solution:

It is not given that the population is normally distributed but since n ¿ 30, then
you can assume that the sampling distribution of the mean ages of 100 barangay
residents is normal according to the Central Limit Theorem.
The Central Limit Theorem describes the normality of the distribution of the sample
mean taken from a population that is not normally distributed.
Step 1: Identify the given information:
μ=34 , σ=4 , X́ =35 , n=100
Step 2: Identify what is asked.

P( X́ <35)

Step 3: Identify the formula to be used.


The problem is dealing with data about the sample mean or n
X́−μ
observations, so the formula to be used to standardize 48 is z = σ
√n
Step 4: Compute for the Probability:

X́−μ
Z=
σ
√n
35−34
10
Z= 4 = =2.5
4
√ 100

Find P( X́ ˂ 35) by getting the area under the normal curve.


P( X́ ˂ 35)= P ( Z <2.5 )

P ( Z <2.5 ) =0 .9938

Therefore, the probability that the random sample of 100 persons has an average of
fewer than 35 years is 0.9938 or 99.38%

Consider the illustrations below for a better understanding of the Central


Limit Theorem. Still refer to our previous example:

Suppose that the average age of the people living in a Barangay is 34 with a
standard deviation of 4. One hundred (100) residents of a certain Barangay decided
to take summer outing after COVID-19 pandemic and Enhanced Community
Quarantine has been lifted for bonding and relaxation.

If we make a relative histogram of samples with various sample sizes, it


would look like the histograms below.

POPULATION (individuals) Distribution of ages of


individuals living in a Barangay. This is an
example of uniform distribution whenever the
sample size n =1. Each individual has an equal
probability of selecting as a sample. The
histogram will look like on the left illustration
not so interesting!

Sampling Distribution of the SAMPLE MEAN


(n=40). Different samples of sizes 40 were
selected then calculate the mean of each sample.
The histogram for those mean will look like this
illustration which becomes a little bit interesting!
The curve starts to show.
Sampling Distribution of the SAMPLE MEAN
(n=80). Different samples of sizes 80 were
selected, then calculate the mean of each
sample. The histogram for those mean will look
like this illustration that is almost close to a
normal curve.

As we continuously increase the samples and calculate the sample


mean, the histogram of the illustration comes to be normally distributed.
And that is where the Central Limit Theorem is used to make better
inferences.

Lesson Solving Problems Involving

1 Sampling Distribution of the


Sample Mean

Probable Time
The mean time it takes a group of senior high students to complete a
certain examination is 50.6 minutes. The standard deviation is 6 minutes.
Assume that the variable is normally distributed.

1. What is the probability that a randomly selected senior high school


student will complete the examination in less than 48 minutes?

To answer the problem, follow the steps:

Step 1. Identify the given information:


μ = 50.6
σ =6
X = 48

Step 2. Identify what is asked for:


P(X ˂ 48)

Step 3. Identify the formula to be used:


The problem is dealing with an individual data obtained from the
X−μ
population so the formula to be used is z = to convert 48 to
σ
standard score.

Step 4. Compute for the Probability:


X−μ
z=
σ
48−50.6
z =
6
z = - 0.43

Find P(X¿ 48) by getting the area under the normal curve.
P(X˂ 48) = P(z ˂ -0.43)
= 0.3336

Therefore, the probability that a randomly selected college student will


complete the examination in less than 48 minutes is 0.3336 or 33.36%

Probable Time
The mean time it takes a group of senior high students to complete a
certain examination is 50.6 minutes. The standard deviation is 6 minutes.
Assume that the variable is normally distributed.

2. If 49 randomly selected senior high school students take the


examination, what is the probability that the mean time it takes the
group to complete the test will be less than 48 minutes?
Step 1: Identify the given information:
μ= 50.6
σ =6
X́ = 48
n = 49

Step 2: Identify what is asked.


P( X́ ˂ 48)

Step 3: Identify the formula to be used.

The problem is dealing with data about the sample mean or n


observations, so the formula to be used to standardize 48 is z =
X́−μ
σ
√n
Step 4: Compute for the Probability:
X́−μ
z= σ
√n
48−50.6
= 6 = - 3.03
√ 49
Find P( X́ ˂ 48) by getting the area under the normal curve.
P( X́ ˂ 48) = P(z ˂ -3.03)
= 0.0012

The probability that 49 randomly selected senior high school students will
complete the test in less than 48 minutes is 0.0012 or 0.12%

Probable Time
The mean time it takes a group of senior high students to complete a
certain examination is 50.6 minutes. The standard deviation is 6 minutes.
Assume that the variable is normally distributed.
3. If 49 randomly selected senior high school students take the
examination, what is the probability that the mean time it takes
the group to complete the test will be more than 51 minutes?
Step 1: Identify the given information:
μ= 50.6
σ =6
X́ = 51
n = 49
Step 2: Identify what is asked.
P( X́ > ¿ 51)

Step 3: Identify the formula to be used.

The problem is dealing with data about the sample mean or n


observations, so the formula to be used to standardize 51 is z =
X́−μ
σ
√n
Step 4: Compute for the Probability:
X́−μ
z= σ
√n
51−50.6
= 6
√ 49
= 0.47
Find P( X́ > ¿ 51) by getting the area under the normal curve.

P( X́ > ¿ 51) = P(z ¿ 0.47)


= 1 −¿P( z <0.47)
= 1 – 0.6808
= 0.3192

The probability that 49 randomly selected senior high students will complete
the test in more than 51 minutes is 0.3192 or 31.92%

Probable Time
The mean time it takes a group of senior high students to complete a
certain examination is 50.6 minutes. The standard deviation is 6 minutes.
Assume that the variable is normally distributed.
4. If 49 randomly selected senior high students take the examination,
what is the probability that the mean time it takes the group to
complete the test is between 47.8 and 53 minutes?
Step 1: Identify the given information:
μ= 50.6
σ =6
X́ = 47.8 and 53, n = 49
Step 2: Identify what is asked.

P(47.8 ¿ X́ <¿ 53)

Step 3: Identify the formula to be used.

The problem is dealing with data about the sample mean or n


observations, so the formula to be used to standardize 47.8 and 53
X́−μ
is z = σ
√n
Step 4: Compute for the Probability:

(4.a) P( X́ < ¿47.8) (4.b) P( X́ < ¿ 53)


X́−μ X́−μ
z1 = σ z 2= σ
√n √n
47.8−50.6 53−50.6
= 6 = 6
√ 49 √ 49
= - 3.27 = 2.8

Find P( X́ > ¿ 47.8) by getting the Find P( x́ <¿ 53) by getting


the
area under the normal curve. area under the normal curve
P( x́ >¿ 47.8) = P(z ¿ - 3.27) = 0.0005 P( x́ <¿ 53) = P(z ¿ 2.8) = 0.9974

To find the probability that 49 randomly selected senior high school students
will complete the test between 47.8 and 53 minutes, subtract the smaller
area from the bigger area under the normal curve.

That is 0.9974– 0.0005 =0.9969 or 99.69%


Activity 1:
Below are problems you need to ponder. Read each statement carefully and
answer the questions. Use separate sheet/s of paper for your answers

My Favorite Breed!
A veterinarian studies a large population of one breed of dogs which is shih
tzu. The shih tzu’s mean length is 50 inches with standard deviation 26
inches. If a sample of 169 shih tzu are randomly selected, what is the chance
that the sample mean is:

a. less than 49 inches?

b. at least 47 inches?

c. between 46 and 48 inches?

Activity 1
a. less than 49?
Data: n = 169, σ X́ = standard error
because of x is not single, given by
26
= 2, μ = 50
√169
49−50
P( X́ < ¿ 49) = P(z ¿ )
2
= P(z ¿ - 0.5)
= 0.3085
Activity 2:

Is It Worth It?
The mean annual salary of all the frontlines (nurses, medical technologists,
radiologic technologists, phlebotomists) in the Philippines is Php 42,500.
Assume that this is normally distributed with standard deviation Php 5,600.
A random sample of 25 health workers is drawn from this population, find
the probability that the mean salary of the sample is:
a. less than Php 40,500?

b. greater than Php 41,000?

c. between Php 40,400 and Php 45,000?

You might also like