UPDATED complete notes of BCS301 (2)
UPDATED complete notes of BCS301 (2)
PROBABILITY DISTRIBUTIONS
Introduction
The three main components of statistics are collection of data, analysis of data and
inference. The scope of statistics is very vast and finds its applications in almost all spheres.
Statistical methods guides us in any scientific enquiry and leans heavily on probability
theory.
Basic Definitions and concepts
Experiment is a process by which measurement or observation is obtained.
Random experiment : An experiment which does not result in the same out come when
performed under the same conditions.
Eg : 1. Tossing of a fair coin.
2. Throwing of a fair die.
Sample space : The set of all possible outcomes of a random experiment is called a sample
space usually denoted by S.
Eg 1. S = H,T is the sample space in tossing of a single coin.
2. S = HH,TH,HT,TT is the sample space obtained in tossing of two coins.
3. S = 1,2,3,4,5,6 in throwing of a die.
Exhaustive event : An event consisting of all the various possibilities is called an exhaustive
event.
Mutually exclusive events : Events are said to be mutually exclusive if and only if one of
them can take place at a time, in other words happening of one event prevent the
simultaneous happening of the others.
Eg: In the tossing of a coin, either heads or tails may turn up but not both.
Independent events: Two or more events are said to be independent if the happening or non
happening of one event does not prevent the happening or non happening of the others.
Eg: 1.When two coins are tossed the event of getting head is an independent event as both
the coins can turn out heads.
Probability is the measure of the chance with which we can expect the event to occur. If an
event A can occur in m different ways out of number of n possible ways, all of which are
m
equally likely then the probability of the event A denoted by P(A) is given by P(A) = n .
1
2) If P(A) = 0 then A is called a null event.
3) If P(A) = 1 then A is called a sure event.
̅̅̅ = 1 , the set of all unfavourable events is denoted by A
4) P(A) + P(A) ̅.
Introduction
When we consider an experiment of tossing two coins. The sample points or out comes are
qualitative and are described by their attributes though the probabilities attached to each of
these outcomes are quantitative. We now associate to each sample point a real number called
the random variable.
Random variable
Definition: Let S be a sample space then X : S R defined over the elements of S is
called a random variable. A random variable is a function that assigns a real number to
every sample point in the sample space of a random experiment. Random variables are
usually denoted by X, Y, Z … and it may be noted that different random variables may be
associated with the sample space S. The set of all real numbers of a random variable X is
called the range of X.
Eg:
1. While tossing a coin, suppose that the value 1 is associated for the outcome ‘head’ and 0
for the outcome tail. We have the sample space S = {H, T} and if X is the random
variable , then X(H) = 1 & X(T) = 0.
Range of X = {0,1}
2. Suppose a coin is tossed twice, we shall associate two different random variables X, Y as
follows, where we have the sample space
S = {HH, HT, TH, TT}
X = Number of ‘heads’ in the outcome.
The association of the elements in S to X is as follows.
2
Out come HH HT TH TT
Random variable 2 1 1 0
X
Range of X = {0,1,2}
Suppose Y = Number of ‘tails’ in the outcome, we have
Out come HH HT TH TT
Random variable Y 0 1 1 2
Range of Y = {0,1,2}
3
Mean μ = ∑i xi . p(xi )
Variance V = ∑i(xi − μ)2 . p(xi )
V = ∑i xi2 . p(xi ) − [∑i xi . p(xi )]2
V = ∑ xi2 . p(xi ) − μ2
Standard deviation σ = √V
Problems
1. A coin is tossed twice. A random variable X represent the number of heads turning up.
Find the discrete probability distribution for X. Also find its mean and variance.
Sol: Here S = {HH, HT, TH, TT}
X = {0,1,2}
1 1 1 1
Now, P(HH) = 4 P(HT) = 4 P(TH) = 4 P(TT) = 4
1
P(X = 0, i. e. , no head turning up) = P(TT) =
4
1 1 1
P(X = 1, i. e. , one head turning up) = P(HT ∪ TH) = P(HT) + P(TH) = + =
4 4 2
1
P(X = 2, i. e. , two heads turning up) = P(HH) =
4
The discrete probability distribution for X is as follows
X=𝐱 𝐢 0 1 2
p(xi ) 1 1 1
4 2 4
1 1 1 1 1
Mean , μ = ∑i xi . p(xi ) = (0). (4) + (1). (2) + (2). (4) = 2 + 2
𝛍=𝟏
1 1 1
V = (0 − 1)2 . 4 + (1 − 1)2 . 2 + (2 − 1)2 . 4
1 1
V= 4+0+4
𝟏
𝐕=𝟐
4
2. A random experiment of tossing a ‘die’ twice is performed. Random variable X on this
sample space is defined to be the sum of the two numbers turning up on the toss. Find the
discrete probability distribution for the random variable X and compute the corresponding mean
and standard deviation.
Sol: Here S = {(x, y) where x = 1,2, … ,6 ; y = 1,2, … ,6}
i.e., S = {(1,1), (1,2), (1,3), (1,4), (1,5), (1,6), (2,1), (2,2), (2,3), … , (6,5), (6,6)}
Number of elements in S=36.
The set of values of the random variable X defined as sum of two numbers on the face of the
‘die’ are {2,3,4,5, … ,10,11,12}
n(E) 1 2
Now, p(x1 ) = = 36 ; p(x2 ) = 36 ; …
n(S)
𝐗 2 3 4 5 6 7 8 9 10 11 12
= 𝐱𝐢
p(xi ) 1 2 3 4 5 6 5 4 3 2 1
36 36 36 36 36 36 36 36 36 36 36
1 2 1
V = (2 − 7)2 . 36 + (3 − 7)2 . 36 + ⋯ + (12 − 7)2 . 36
𝟐𝟏𝟎 𝟑𝟓
𝐕= =
𝟑𝟔 𝟔
Std. deviation , σ = √V
𝟑𝟓
𝛔 = √ 𝟔 = 𝟐. 𝟒𝟐
5
3. Show that the following distribution represents a discrete probability distribution. Find
the mean and variance
𝐱 10 20 30 40
p(x) 1 3 3 1
8 8 8 8
Mean μ = ∑ x . p(x)
1 3 3 1
μ = (10). + (20). + (30). + (40).
8 8 8 8
10+60+90+40 200
μ= =
8 8
𝛍 = 𝟐𝟓
Variance, V = ∑(x − μ)2 . p(x)
1 3 3 1
V = (10 − 25)2 . 8 + (20 − 25)2 . 8 + (30 − 25)2 . 8 + (40 − 25)2 . 8
𝟔𝟎𝟎
𝐕= = 𝟕𝟓
𝟖
4. Find the value of k such that the following distribution represents a finite probability
distribution. Hence find its mean and standard deviation. Also find p(x ≤ 1) , p(x > 1) and
p(−1 < x ≤ 2)
𝐱 -3 -2 -1 0 1 2 3
p(x) k 2k 3k 4k 3k 2k k
Sol: We must have p(x) ≥ 0 for all x and ∑ p(x) = 1. The first condition is satisfied if
k ≥ 0 . Since ∑ p(x) = 1
k + 2k + 3k + 4k + 3k + 2k + k = 1
16k = 1
𝟏
∴ 𝐤 = 𝟏𝟔
6
The discrete probability distribution is as follows
𝐱 -3 -2 -1 0 1 2 3
p(x) 1 2 3 4 3 2 1
16 16 16 16 16 16 16
Mean μ = ∑ x . p(x)
1 2 3 4 3 2 1
μ = (−3). 16 + (−2). 16 + (−1). 16 + (0). 16 + (1). 16 + (2). 16 + (3). 16
𝛍=𝟎
Variance, V = ∑(x − μ)2 . p(x)
1 2 3 4 3
V = (−3 − 0)2 . 16 + (−2 − 0)2 . 16 + (−1 − 0)2 . 16 + (0 − 0)2 . 16 + (1 − 0)2 . 16 +
2 1
(2 − 0)2 . 16 + (3 − 0)2 . 16
𝟏 𝟒𝟎 𝟓
𝐕 = 𝟏𝟔 (𝟗 + 𝟖 + 𝟑 + 𝟎 + 𝟑 + 𝟖 + 𝟗) = 𝟏𝟔 = 𝟐
Standard deviation σ = √V
𝟓
𝛔 = √𝟐 = 𝟏. 𝟓𝟖
Now, p(x ≤ 1) = p(−3) + p(−2) + p(−1) + p(0) + p(1)
1 2 3 4 3
= 16 + 16 + 16 + 16 + 16
𝟏𝟑
𝐩(𝐱 ≤ 𝟏) = 𝟏𝟔
5. A random variable X has the following probability function for various values of x.
𝐱 0 1 2 3 4 5 6 7
P(x) 0 k 2k 2k 3k k2 2k 2 7k 2 + k
i) Find k ii) Evaluate P(x < 6), P(x ≥ 6) and P(3 < x ≤ 6). Also find the probability
distribution and the distribution function of X.
7
Sol: We must have P(x) ≥ 0 and ∑ P(x) = 1.
The first condition is satisfied for k ≥ 0 and we have to find k such that
∑ P(x) = 1
i.e, 0 + k + 2k + 2k + 3k + k 2 + 2k 2 +(7k 2 + k) = 1
10k 2 + 9k − 1 = 0
(10k − 1)(k + 1) = 0
𝟏
𝐤 = 𝟏𝟎 and 𝐤 = −𝟏 ( Neglect)
𝐱 0 1 2 3 4 5 6 7
P(x) 0 1 1 1 3 1 1 17
10 5 5 10 100 50 100
1 1 1 3 1 81
= 0 + 10 + 5 + 5 + 10 + 100 = 100
𝐏(𝐱 < 𝟔) = 𝟎. 𝟖𝟏
1 17 19
P(x ≥ 6) = 50 + 100 = 100
𝐏(𝐱 ≥ 𝟔) = 𝟎. 𝟏𝟗
P(3 < x ≤ 6) = P(4) + P(5) + P(6)
3 1 1 33
= 10 + 100 + 50 = 100
𝐏(𝟑 < 𝐱 ≤ 𝟔) = 𝟎. 𝟑𝟑
The probability distribution is as follows
𝐱 0 1 2 3 4 5 6 7
8
𝐱 0 1 2 3 4 5 6 7
6. A random variable X take the values −3, −2, −1,0,1,2,3 such hat P(X = 0) = P(X < 0)
and P(X = −3) = P(X = −2) = P(X = −1) = P(X = 1) = P(X = 2) = P(X = 3) .Find the
probability distribution.
Sol: Let the distribution be as follows
𝐗 -3 -2 -1 0 1 2 3
P(X) p1 p2 p3 p4 p5 p6 p7
P(X) 1 1 1 3 1 1 1
9 9 9 9 9 9 9
9
7. The p.d.f. of variate X is given by the following table.
𝐱 0 1 2 3 4 5 6
For what value of K, this represents a valid probability distribution? Also find P(x ≥ 5) and
P(3 < x ≤ 6).
Sol: The probability distribution is valid if P(x) ≥ 0 and ∑ P(x) = 1
Hence we must have K ≥ 0 and K + 3K + 5K + 7K + 9K + 11K + 13K = 1
𝟏
49K = 1𝐊 =
𝟒𝟗
Binomial Distribution
If 𝐩 is the probability of success and 𝐪 is the probability of failure, the probability of 𝐱
success out of 𝐧 trials is given by 𝐏(𝐱) = 𝐧𝐂𝐱 𝐩𝐱 𝐪𝐧−𝐱
Mean and Standard deviation of the Binomial Distribution
Mean, 𝛍 = 𝐧𝐩
Variance, 𝐕 = 𝐧𝐩𝐪
Problems
4
1. Find the binomial probability which has mean 2 and variance 3.
4
Sol: Given Mean, μ = 2 and Variance, V = 3
10
np = 2
npq = V
4
2q =
3
𝟐
𝐪=
𝟑
2
Since p = 1 − q = 1 − 3
𝟏
𝐩=
𝟑
Since np = 2
n
=2
3
𝐧=𝟔
The binomial probability function is 𝐏(𝐱) = 𝐧𝐂𝐱 𝐩𝐱 𝐪𝐧−𝐱 becomes
𝟏 𝐱 𝟐 𝟔−𝐱
𝐏(𝐱) = 𝟔𝐂𝐱 ( ) ( )
𝟑 𝟑
The distribution of probability is as follows
𝐱 0 1 2 3 4 5 6
P(x) 𝟐 𝟔 𝟔 𝟏 𝟏 𝟐 𝟓 𝟔 𝟏 𝟐 𝟐 𝟒 𝟔 𝟏 𝟑 𝟐 𝟑 𝟔 𝟏 𝟒 𝟐 𝟐 𝟔 𝟏 𝟓 𝟐 𝟏 𝟏 𝟔
( ) 𝐂𝟏 ( ) ( ) 𝐂𝟐 ( ) ( ) 𝐂𝟑 ( ) ( ) 𝐂𝟒 ( ) ( ) 𝐂𝟓 ( ) ( ) ( )
𝟑 𝟑 𝟑 𝟑 𝟑 𝟑 𝟑 𝟑 𝟑 𝟑 𝟑 𝟑
2.When a coin is tossed 4 times, find the probability of getting i) exactly one head
ii) atmost 3 heads iii) atleast two heads.
𝟏 𝟏
Sol: Given 𝐧 = 𝟒 , 𝐩 = 𝐏(𝐇) = 𝟐 , 𝐪 = 𝐏(𝐓) = 𝟐 (since p + q = 1)
1 4
P(x) = 4Cx (2)
𝟏
𝐏(𝐱) = 𝟏𝟔 . 𝟒𝐂𝐱
11
1 1
i) P(1 head) = P(x = 1) = . 4C1 = (4)
16 16
𝟏
𝐏(𝐱 = 𝟏) = 𝟒
𝐏(𝐱 = 𝟏) = 𝟎. 𝟐𝟓
ii) P(atmost 3 heads) = P(x ≤ 3)
P(x ≤ 3) = P(x = 0) + P(x = 1) + P(x = 2) + P(x = 3)
1 1 1 1
P(x ≤ 3) = 16 . 4C0 + 16 . 4C1 + 16 . 4C2 + 16 . 4C3
1
P(x ≤ 3) = 16 [1 + 4 + 6 + 4]
15
P(x ≤ 3) = 16
𝐏(𝐱 ≤ 𝟑) = 𝟎. 𝟗𝟑𝟕𝟓
iii) P(atleast 2 heads) = P(x ≥ 2)
P(x ≥ 2) = 1 − P(x < 3)
P(x ≥ 2) = 1 − [P(x = 0) + P(x = 1)]
1 4 1
P(x ≥ 2) = 1 − [ . C0 + . 4C1 ]
16 16
1
P(x ≥ 2) = 1 − 16 [1 + 4]
5
P(x ≥ 2) = 1 − 16
11
P(x ≥ 2) = 16
𝐏(𝐱 ≥ 𝟐) = 𝟎. 𝟔𝟖𝟕𝟓
3. In a consignment of electric lamps 5% are defective. If a random sample of 8 lamps are
inspected. What is the probability that one or more lamps are defective?
𝟓
Sol: Given n = 8, Probability of defective lamp =𝐩 = 𝟓% = 𝟏𝟎𝟎 = 𝟎. 𝟎𝟓
∴ q = 1 − p = 1 − 0.05
𝐪 = 𝟎. 𝟗𝟓
From binomial distribution,
P(x) = ncx px qn−x
P(x) = 8cx (0.05)x (0.95)8−x
Probability that one or more lamps are defective
P(x ≥ 1) = 1 − P(x < 1)
P(x ≥ 1) = 1 − P(x = 0)
12
P(x ≥ 1) = 1 − 8c0 (0.05)0 (0.95)8−0
𝐏(𝐱 ≥ 𝟏) = 𝟎. 𝟑𝟑𝟔𝟔.
Thus the required probability is 0.3366
4. The probability that a person aged 60 years will live upto 70 is 0.65. What is the
probability that out of 10 person aged 60 atleast 7 of them will live upto 70.
Sol: Given n = 10, 𝐩 = 𝟎. 𝟔𝟓
∴ q = 1 − p = 1 − 0.65
𝐪 = 𝟎. 𝟑𝟓
Let x denote the number of person aged 60 years living upto 70 years.
From binomial distribution,
P(x) = ncx px qn−x
P(x) = 10cx (0.65)x (0.35)10−x
Probability that out of 10 person aged 60 atleast 7 of them will live upto 70
P(x ≥ 7) = P(7) + P(8) + P(9) + P(10)
P(x ≥ 7) = 10c7 (0.65)7 (0.35)3 + 10c8 (0.65)8 (0.35)2 + 10c9 (0.65)9 (0.35)1 +
10c10 (0.65)10 (0.35)0
P(𝐱 ≥ 𝟕) = 𝟎. 𝟓𝟏𝟑𝟖
5.The number of telephone lines busy at an instant of time is a binomial variate with
probability 0.1 that a line is busy. If 10 such lines are chosen at random, what is the
probability that i) no line is busy ii) all lines are busy iii) atleast one line is busy
iv) atmost 2 lines are busy.
Sol: Given n = 10, 𝐩 = 𝟎. 𝟏
∴ q = 1 − p = 1 − 0.1
𝐪 = 𝟎. 𝟗
Let x denote the number of telephone lines busy.
From binomial distribution,
P(x) = ncx px qn−x
𝐏(𝐱) = 𝟏𝟎𝐜𝐱 (𝟎. 𝟏)𝐱 (𝟎. 𝟗)𝟏𝟎−𝐱
i) Probability that no line is busy
P(x = 0) = 10c0 (0.1)0 (0.9)10−0 = (0.9)10 = 𝟎. 𝟑𝟒𝟖𝟕
ii) Probability that all lines are busy
P(x = 10) = 10c10 (0.1)10 (0.9)0 = (𝟎. 𝟏)𝟏𝟎
13
iii) Probability that atleast one line is busy
P(x ≥ 1) = 1 − P(x < 1)
P(x ≥ 1) = 1 − P(x = 0)
P(x ≥ 1) = 1 − 10c0 (0.1)0 (0.9)10
P(x ≥ 1) = 1 − 0.3487
P(𝐱 ≥ 𝟏) = 𝟎. 𝟔𝟓𝟏𝟑
iv) Probability that atmost 2 lines are busy
P(x ≤ 2) = P(x = 0) + P(x = 1) + P(x = 2)
P(x ≤ 2) = (0.9)10 + 10c1 (0.1)1 (0.9)2 + 10c2 (0.1)2 (0.9)8
P(𝐱 ≤ 𝟐) = 𝟎. 𝟗𝟐𝟗𝟖
6. In a quiz contest of answering ‘Yes’ or ‘No’ what is the probability of guessing atleast 6
answers correctly out of 10 questions asked ? Also find the probability of the same if
there are 4 options for a correct answer.
Sol: Given n = 10
Let x denote the correct answer .
𝟏
i) 𝐩 = 𝟐 = 𝟎. 𝟓
∴ q = 1 − p = 1 − 0.5
𝐪 = 𝟎. 𝟓
From binomial distribution,
P(x) = ncx px qn−x
P(x) = 10cx (0.5)x (0.5)10−x
𝐏(𝐱) = 𝟏𝟎𝐜𝐱 (𝟎. 𝟓)𝟏𝟎
Probability of guessing atleast 6 answers correctly out of 10 questions
P(x ≥ 6) = P(x = 6)+P(x = 7) + P(x = 8) + P(x = 9) + P(x = 10)
P(x ≥ 6) = 10c6 (0.5)10 + 10c7 (0.5)10 + 10c8 (0.5)10 + 10c9 (0.5)10 + 10c10 (0.5)10
P(x ≥ 6) = (0.5)10 [10c6 + 10c7 + 10c8 + 10c9 + 10c10 ]
P(x ≥ 6) = (0.5)10 [210 + 120 + 45 + 10 + 1]
P(𝐱 ≥ 𝟔) = 𝟎. 𝟑𝟕𝟕𝟎
𝟏
ii) 𝐩 = 𝟒
1
∴q =1−p= 1−4
𝟑
𝐪=𝟒
14
From binomial distribution,
P(x) = ncx px qn−x
1 x 3 10−x
P(x) = 10cx ( ) ( )
4 4
310−x
P(x) = 10cx x+10−x
4
310−x
P(x) = 10cx
410
𝟏
𝐏(𝐱) = [𝟑𝟏𝟎−𝐱 . 𝟏𝟎𝐜𝐱 ]
𝟒𝟏𝟎
Probability of guessing atleast 6 answers correctly out of 10 questions
P(x ≥ 6) = P(x = 6)+P(x = 7) + P(x = 8) + P(x = 9) + P(x = 10)
1
P(x ≥ 6) = 410 [34 . 10c6 + 33 . 10c7 + 32 . 10c8 + 3.10c9 + 10c10 ]
1
P(x ≥ 6) = 410 [81.210 + 27.120 + 9.45 + 3.10 + 1]
P(𝐱 ≥ 𝟔) = 𝟎. 𝟎𝟏𝟗𝟕
7. In sampling a large number of parts manufactured by a company, the mean number of
defectives in samples of 20 is 2. Out of 1000 such samples how many would be expected
to contain atleast 3 defective parts.
Sol: Given Mean, μ = 2 and n = 20
np = 2
20p = 2
𝟏
𝐩 = 𝟏𝟎 = 𝟎. 𝟏
q= 1−p
1 𝟗
q = 1 − 10 𝐪 = 𝟏𝟎 = 𝟎. 𝟗
15
P(x ≥ 3) = 1 − [P(0) + P(1) + P(2)]
P(x ≥ 3) = 1 − [20c0 (0.1)0 (0.9)20 + 20c1 (0.1)1 (0.9)19 + 20c2 (0.1)2 (0.9)18 ]
P(x ≥ 3) = 1 − 0.6769
P(𝐱 ≥ 𝟑) = 𝟎. 𝟑𝟐𝟑𝟏
Thus, the number of defectives in 1000 such samples= 1000*0.3231=323.1≅323
8. If the mean and standard deviation of the number of correctly answered questions in a
test given to 4096 students are 2.5 and √1.875 . Find an estimate the number of
candidates answering correctly i) 5 questions ii) 2 or less iii) 8 or more.
16
Estimation for 4096 students is =4096*0.0005 ≅2
Number of students correctly answering 8 or more than 8 questions is 2
9. In 800 families with 5 children each how many families would be expected to have
i) 3 boys ii)5 girls iii) either 2 or 3 boys iv) atmost 2 girls by assuming probabilities for
boys and girls to be equal.
10. An airline knows that 5% of the people making reservations on a certain flight will not
turn up. Consequently, their policy is to sell 52 tickets for a flight that can only hold 50
people. What is the probability that there will be a seat for every passenger who turns up?
11. 4 coins are tossed 100 times and the following results were obtained. Fit a binomial
distribution for the data and calculate the theoretical frequencies.
Number of 0 1 2 3 4
heads
Frequency 5 29 36 25 5
Sol: Let x denote the number of heads and f the corresponding frequency. Since the data is in
the form of a frequency distribution we shall first calculate the mean.
∑ fx (0)(5)+(1)(29)+(2)(36)+(3)(25)+(4)(5) 196
Mean, μ = ∑f
= = 100 = 1.96
5+29+36+25+5
17
Poisson Distribution
Poisson distribution is regarded as the limiting form of the binomial distribution when 𝐧 is
very large (n → ∞) and 𝐩 the probability of success is very small (p → 0) so that 𝐧𝐩 tends
to a fixed finite constant say 𝐦.
Let m be a positive constant. Consider the infinite discrete probability distribution of a
𝐦𝐱
random variable x is given by P(𝐱) = 𝐞−𝐦
𝐱!
The above probability density function is called the poisson probability function.
Mean and Standard deviation of the Poisson Distribution
Mean, 𝛍 = 𝐦
Variance, 𝐕 = 𝐦
Std. Deviation, 𝛔 = √𝐦
Problems
1. The probabilities of a poisson variate taking the values 3 and 4 are equal. Find the PDF.
Also, calculate the probabilities of the variate taking the values 0 and 1.
Sol: Given P(x = 3) = P(x = 4)
m3 −m m4 −m
e = e
3! 4!
4! m4
=
3! m3
∴ 𝐦 = 𝟒.
From poisson distribution,
mx −m 4x −4
P(x) = e = e
x! x!
40 −4 41 −4
P(x = 0) = e = 𝟎. 𝟎𝟏𝟖𝟑 P(x = 1) = e = 𝟎. 𝟎𝟕𝟑𝟑
0! 1!
1
2. In a certain factory turning out razor blades there is a small probability of for any
500
blade to be defective. The blades are supplied in packets of 10. Use poisson distribution
to calculate the approximate number of packets containing i) no defective ii) one
defective iii) two defective blades in a consignment of 10,000 packets.
1
Sol: Probability of a defective blade= p = 500 = 0.002
18
𝐦 = 𝟎. 𝟎𝟐
From poisson distribution,
mx −m
P(x) = e
x!
(0.02)x −(0.02)
P(x) = e
x!
(0.02)x
P(x) = (0.9802) ( Since e−0.02 = 0.9802)
x!
i) Probability of no defective
(0.02)0
P(x = 0) = (0.9802) = 0.9802
0!
19
ii) Probability of more than 3 accidents in a year
P(x > 3) = 1 − P(x ≤ 3)
P(x > 3) = 1 − [P(0) + P(1) + P(2) + P(3)]
(3)0 (3)1 (3)2 (3)3
P(x > 3) = 1 − [(0.05) + (0.05) + (0.05) + (0.05) ]
0! 1! 2! 3!
4x −4
P(x) = e
x!
Let x denotes the number of defective fuses in a box containing 200 fuses
i) Probability that the box contains no defective fuse
40 −4
P(x = 0) = e = 𝟎. 𝟎𝟏𝟖𝟑
0!
ii) Probability that the box contains at least one defective fuse is
P(x ≥ 1) = 1 − P(x < 1)
= 1 − P(x = 0)
= 1 − 0.013
𝐏(𝐱 ≥ 𝟏) = 𝟎. 𝟗𝟖𝟏𝟕
iii) Probability that the box contains exactly 3 defective fuse
43 −4
P(x = 3) = e
3!
𝐏(𝐱 = 𝟑) = 𝟎. 𝟏𝟗𝟓𝟐
iv) Probability that the box contains 3 or more defective fuses
P(x ≥ 3) = 1 − P(x < 3)
40 −4 41 −4 42 −4
P(x ≥ 3) = 1 − [ e + e + e ]
0! 1! 2!
20
𝐏(𝐱 ≥ 𝟑) = 𝟎. 𝟕𝟔𝟐𝟏
5. A certain screw making machine has a chance of producing 2 defectives out of 1000. The
screws are packed in boxes of 100. Using poisson distribution , find the approximate
number of boxes containing i) no defective screw ii)one defective screw iii) two defective
screw, in a consignment of 5000 boxes.
Sol: The probability that a screw is defective is given as p=2/1000 = 0.002.
Here n=100
Mean = np=100X0.002=0.2 ; 𝐦 = 𝟎. 𝟐
mx −m
From poisson probability function P(x) = e
x!
(0.2)x −0.2
P(x) = e
x!
Let x denotes the number of defective fuses in a box containing 100 screws
i) Probability that the box contains no defective screw
𝟎.𝟐𝟎
𝐏(𝐱 = 𝟎) = 𝐞−𝟎.𝟐 = 𝟎. 𝟖𝟏𝟖𝟕
𝟎!
= 0.2X0.8187
𝐏(𝐱 = 𝟏) = 𝟎. 𝟏𝟔𝟑𝟕
Number of boxes containing one defective screw in 5000 boxes =5000*0.1637=819
iii) Probability that the box contains two defective screw is
(0.2)2 −0.2
P(x = 2) = e
2!
= 0.02X0.8187
𝐏(𝐱 = 𝟐) = 𝟎. 𝟎𝟏𝟔𝟑
Number of boxes containing two defective screw in 5000 boxes =5000*0.0163=82
6. A communication channel receives independent pulses at the rate of 12 pulses per micro
second. The probability of transmission error is 0.001 for each micro second. Compute
the probabilities of i) no error during a micro second ii) one error per micro second
iii) atleast one per micro second iv) two errors v) atmost two errors.
Sol: Given n = 12 and p = 0.001
Mean number of errors in one micro second
21
μ = np
m = 12 ∗ 0.001 (Since μ = m, from poisson distribution)
𝐦 = 𝟎. 𝟎𝟏𝟐
From Poisson distribution
mx −m
P(x) = .e
x!
(0.012)x −(0.012)
P(x) = .e
x!
(0.012)x
P(x) = (0.9880) (Sincee−(0.012) = 0.9880)
x!
𝐏(𝐱 = 𝟎) = 𝟎. 𝟗𝟖𝟖𝟎
ii) Probability of one error during a micro second
(0.012)1
P(x = 1) = (0.9880) 1!
𝐏(𝐱 = 𝟏) = 𝟎. 𝟎𝟏𝟏𝟗
iii) Probability of atleast one error during a micro second
P(x ≥ 1) = 1 − P(x < 1)
P(x ≥ 1) = 1 − P(x = 0)
P(x ≥ 1) = 1 − P(x = 0)
P(x ≥ 1) = 1 − 0.9880
𝐏(𝐱 ≥ 𝟏) = 𝟎. 𝟎𝟏𝟐
iv) Probability of two error during a micro second
(0.012)2
P(x = 2) = (0.9880) 2!
𝐏(𝐱 = 𝟐) = 𝟎. 𝟎0007
v) Probability of atmost two error during a micro second
𝐏(x ≤ 2) = P(x = 0) + P(x = 1) + P(x = 2)
P(x ≤ 2) = 0.9880 + 0.0119 + 0.00007
𝐏(𝐱 ≤ 𝟐) = 𝟎.99997
22
Continuous Probability Distribution
If for every x belonging to the range of a continuous random variable X, we assign a real
number f(x) satisfying the conditions,
∞
i) f(x) ≥ 0 ii) ∫−∞ f(x)dx = 1 , then f(x) is called a Continuous probability function or
probability density function ( p.d.f )
Problems
1. Find which of the following is a probability density function
2x, 0 < x < 1 |x|, 0 < |x| ≤ 1
a) f1 (x) = { b) f2 (x) = {
0, otherwise 0, otherwise
2x , 0 < x ≤ 1
c) f3 (x) = { 4 − 4x , 1 < x < 2
0 , otherwise
∞
Sol: Conditions for p.d.f are f(x) ≥ 0 and ∫−∞ f(x)dx = 1
a) Clearly f1 (x) ≥ 0
1
∞ 1 1 x2
∫−∞ f1 (x)dx = ∫0 f1 (x) dx = ∫0 2x dx = 2. [ 2 ] = [1 − 0] = 1
0
∴ f1 (x) is a p.d.f
b) Evidently f2 (x) = |x| ≥ 0
23
∞ 1 1
∫−∞ f2 (x)dx = ∫−1 f2 (x) dx = ∫−1|x| dx
−x, − 1 < x < 0
But,|x| = {
+x, 0 < x < 1
∞ 0 1
∴ ∫−∞ f2 (x)dx = ∫−1(−x) dx + ∫0 x dx
∞ 0 1
x2 x2
∫ f2 (x)dx = − [ ] + [ ]
2 −1 2 0
−∞
∞
1 1 1 1
∫ f2 (x)dx = − [0 − ] + [ − 0] = + = 1
2 2 2 2
−∞
∴ f2 (x) is a p.d.f
c) Given f3 (x) = 2x > 0 in 0 < x ≤ 1
But f3 (x) = 4 − 4x is negative in 1 < x < 2
The first condition is not satisfied.
∴ f3 (x) is not a p.d.f
x
+ c, 0 ≤ x ≤ 3
2. Find the value of c such that f(x) = { 6 is a p.d.f . Also find
0, elsewhere
P(1 ≤ x ≤ 2).
∞
Sol: Here f(x) ≥ 0 if c ≥ 0 ; Also we must have ∫−∞ f(x)dx = 1
3 x
i.e., ∫0 (6 + c) dx = 1
3
x2
i.e., [12 + cx] = 1
0
1
i.e., 12 (9 − 0) + c(3 − 0) = 1
9
i.e., 12 + 3c = 1
9
i.e., 3c = 1 − 12
3
3c =
12
𝟏
∴ 𝐜 = 𝟏𝟐
2
Now, P(1 ≤ x ≤ 2) = ∫1 f(x)dx
2 x 1
= ∫1 (6 + 12) dx
24
2
x2 x
= [12 + 12]
1
1 1
= 12 (4 − 1) + 12 (2 − 1)
3 1 4
= 12 + 12 = 12
𝟏
𝐏(𝟏 ≤ 𝐱 ≤ 𝟐) = 𝟑
kx 2 , 0 < x < 3
3. Find the constant k such that f(x) = { is a p.d.f . Also compute
0, otherwise
i) P(1 < x < 2) ii) P(x ≤ 1) iii) P(x > 1) iv) Mean v) Variance
∞
Sol: : Here f(x) ≥ 0 if k ≥ 0 ; Also we must have ∫−∞ f(x)dx = 1
3
i.e., ∫0 kx 2 dx = 1
3
kx3
i.e., [ ] =1
3 0
k
i.e., 3 (27 − 0) = 1
i.e., 9k = 1
𝟏
i.e., 𝐤 = 𝟗
x2
, 0<x<3
∴ f(x) = { 9
0, otherwise
2
i) P(1 < x < 2) = ∫1 f(x)dx
2 x2
= ∫1 dx
9
2
x3
= [27]
1
1
= 27 (8 − 1)
𝟕
𝐏(𝟏 < 𝐱 < 𝟐) = 𝟐𝟕
1
ii) P(x ≤ 1) = ∫0 f(x)dx
1
x2
P(x ≤ 1) = ∫ dx
9
0
1
x3
= [27]
0
1
= 27 (1 − 0)
25
𝟏
𝐏(𝐱 ≤ 𝟐) =
𝟐𝟕
3
iii) P(x > 1) = ∫1 f(x)dx
3 x2
= ∫1 dx
9
3
x3
P(x > 1) = [27]
1
1
= 27 (27 − 1)
𝟐𝟔
𝐏(𝐱 > 𝟏) = 𝟐𝟕
∞
iv) Mean, μ = ∫−∞ x. f(x)dx
3 x2
= ∫0 x. dx
9
3 x3
= ∫0 dx
9
3
x4
μ = [36]
0
1
μ = 36 (81 − 0)
81
μ = 36
𝟗
𝛍=𝟒
∞
v)Variance, V = ∫−∞ x 2 . f(x) dx − μ2
3 x2 9 2
= ∫0 x 2 . dx − (4)
9
3 x4 81
= ∫0 dx − 16
9
3
x5 81
= [45] − 16
0
1 81
V = 45 [35 − 0] − 16
243 81
V= − 16
45
81
V = 240
𝟐𝟕
𝐕 = 𝟖𝟎
26
Exponential Distribution
The continuous probability distribution having the probability density function f(x) given by
αe−αx , for x > 0
f(x) = { , where α > 0 is known as the exponential distribution.
0, otherwise
∞
Evidently f(x) ≥ 0 and we have ∫−∞ f(x)dx = 1.
f(x) satisfy both the conditions required for a continuous probability function.
Mean and standard deviation of the Exponential Distribution
𝟏
Mean , 𝛍 = 𝛂
𝟏
Std. deviation , 𝛔 =
𝛂
Problems
1. If x is an exponential variate with mean 3 find i) P(x > 1) ii) P(x < 3)
Sol: The p.d.f of the exponential distribution is given by
αe−αx , for 0 < x < ∞
f(x) = {
0, otherwise
1
The mean of the distribution is given by α.
1
By data, μ = α = 3
𝟏
∴ 𝛂=𝟑
1 −x
e 3, for 0 < x < ∞
Hence , f(x) = {3
0, otherwise
i) P(x > 1) = 1 − P(x ≤ 1)
P(x > 1) = 1 − P(x ≤ 1)
1
=1− ∫0 f(x)dx
x
11
= 1 − ∫0 3 e−3 dx
x
1 1
= 1 − 3 ∫0 e−3 dx
x 1
−
1 e 3
= 1 − 3[ 1 ]
−
3 0
1
= 1 + [e−3 − e−0 ]
1
= 1 + e− 3 − 1
27
1
= e− 3
𝐏(𝐱 > 𝟏) = 0.7165
3
= −[e−1 − e−0 ]
1
= −e−1 + 1 = 1 − e
1 −x
f(x) = e 5, 0 < x < ∞
5
5
51 x
= ∫0 5 e−5 dx
x
1 5
= 5 ∫0 e−5 dx
x 5
−
1 e 5
P(x < 5) = [ ]
5 −1
5 0
= −[e−1 − e−0 ]
1
= −e−1 + 1 = 1 − e
28
𝐏(𝐱 < 𝟓) = 𝟎. 𝟔𝟑𝟐𝟏
10
10 1 −x
= ∫5 e 5 dx
5
10 x
1
= 5 ∫5 e−5 dx
x 10
−
1 e 5
P(5 < x < 10) = 5 [ 1 ]
−
5 5
= −[e−2 − e−1 ]
1 1
= −e−2 + e−1 = e − e2
1 x
f(x) = e−5 , 0 < x < ∞
5
∞
= −[e−∞ − e−2 ]
= −e−∞ + e−2 = −0 + e−2
𝐏(𝐱 ≥ 𝟏𝟎) = 𝟎. 𝟏𝟑𝟓𝟑
29
ii) P(x < 10) = 1 − P(x ≥ 10)
P(x < 10) = 1 − 0.1353
𝐏(𝐱 < 𝟏𝟎) = 𝟎. 𝟖𝟔𝟒𝟕
12
= −[e−2.4 − e−2 ]
P(10 < x < 12) = e−2 − e−2.4
𝐏(𝟏𝟎 < 𝐱 < 𝟏𝟐) = 𝟎. 𝟎𝟒𝟒𝟔
4.The life of a compressor manufactured by a company is known to be 200 months on an
average following an exponential distribution. Find the probability that the life of a
compressor of that company is i) less than 200 months ii) between 100 months and 25
years.
Sol: We have, f(x) = αe−αx , 0 < x < ∞
1
By data, μ = α = 200
𝟏
∴ 𝛂 = 𝟐𝟎𝟎
1 − x
f(x) = e 200 , 0 < x < ∞
200
200
1 200 − x
= 200 ∫0 e 200 dx
x 200
−
1 e 200
= 200 [ 1 ]
−
200 0
= −[e−1 − e−0 ]
30
1
= −e−1 + 1 = 1 −
e
300 1 x
= ∫100 e−200 dx
200
x
1 300
= 200 ∫100 e−200 dx
x 300
−
1 e 200
= 200 [ 1 ]
−
200 100
= −[e−1.5 − e−2 ]
𝐏(𝟏𝟎𝟎 < 𝐱 < 𝟑𝟎𝟎) = 𝟎. 𝟑𝟖𝟑𝟒
k
5. Determine k if random variable x has the density function f(x) = 1+x2 , − ∞ < x < ∞
hence find i)P(x ≥ 0) ii) P(0 < x < 1) iii) c. d. f
Sol: Given f(x) is a probability density function.
∞
∴ f(x) ≥ 0 and ∫−∞ f(x)dx = 1
∞ k
i.e., ∫−∞ 1+x2 dx = 1
∞
1
2k ∫ dx = 1
1 + x2
0
2k[tan−1 x]∞
0 =1 ; 2k[tan−1 (∞) − tan−1 (0)] = 1
π
2k [ − 0] = 1
2
𝟏
k. π = 1𝐤 =
𝛑
1
f(x) = π 2
1+x
∞
i)P(x ≥ 0) = ∫ f(x)dx
0
1
∞ π
= ∫0 dx
1+x2
31
1 ∞ 1
= ∫0 dx
π 1+x2
1
= π [tan−1 x]∞
0
1
P(x ≥ 0) = [tan−1 (∞) − tan−1 (0)]
π
1 π
P(x ≥ 0) = [ − 0]
π 2
𝟏
𝐏(𝐱 ≥ 𝟎) =
𝟐
1
1
P(0 < x < 1) = [tan−1 x]10
π
1
= [tan−1 (1) − tan−1 (0)]
π
1 π
= π [ 4 − 0]
𝟏
𝐏(𝟎 < 𝐱 < 𝟏) =
𝟒
x
iii)c. d. f = ∫ f(x)dx
−∞
1
x π
= ∫−∞ 1+x 2
dx
1 x 1
= π ∫−∞ 1+x2 dx
1
c. d. f = [tan−1 x]x−∞
π
1
= [tan−1 (x) − tan−1 (−∞)]
π
1 π
= π [tan−1 x − (− 2 )]
𝟏 𝛑
𝐜. 𝐝. 𝐟 = 𝛑 [𝐭𝐚𝐧−𝟏 𝐱 + 𝟐] if −∞ < 𝐱 < ∞.
32
6. The daily turn over in a medical shop is exponentially distributed with Rs.6000 as the
average with a net profit of 8%. Find the probability that the net profit exceeds Rs.500
on a randomly chosen day.
Sol: Let ‘x’ denote the turn over per day.
∴ By exponential distribution
f(x) = αe−αx ; x > 0.
1
Given, By data, μ = α = 6000
𝟏
∴ 𝛂 = 𝟔𝟎𝟎𝟎
1 − x
f(x) = e 6000 , 0 < x < ∞
6000
Let ‘A’ be the turn over for which the net profit is Rs.500.
Since the net profit is 8% of the turn over,
8
A∗ = 500
100
⇒ 𝐀 = 𝟔𝟐𝟓𝟎
∴ The probability of profit exceeding Rs.500 is
∞
33
Normal Distribution
The continuous probability distribution having the probability density function f(x) given
(x−μ)2
1 −
by f(x) = σ√2π e 2σ2 where −∞ < x < ∞ , − ∞ < μ < ∞ and σ > 0 is known as the
normal distribution.
∞
Evidently f(x) ≥ 0 and we have ∫−∞ f(x)dx = 1.
f(x) satisfy both the conditions required for a continuous probability function.
Mean and standard deviation of the Normal Distribution
𝐌𝐞𝐚𝐧 = 𝛍
𝐕𝐚𝐫𝐢𝐚𝐧𝐜𝐞 = 𝛔𝟐
Geometrically we can write
∞ 𝟎 ∞ 𝟏
𝐢) ∫−∞ 𝛟(𝐳)𝐝𝐳 = 𝟏 𝐢𝐢) ∫−∞ 𝛟(𝐳)𝐝𝐳 = ∫𝟎 𝛟(𝐳)𝐝𝐳 = 𝟐
Note
1. P(−∞ ≤ z ≤ ∞) = 1
1
2. P(−∞ ≤ z ≤ 0) = 2
1
3. P(0 ≤ z ≤ ∞) 𝐨𝐫 P(z ≥ 0) = 2
Problems
1. Evaluate the following probabilities with the help of normal probability tables
i) P(z ≥ 0.85) ii) P(−1.64 ≤ z ≤ −0.88) iii) P(z ≤ −2.43) iv) P(|z| ≤ 1.94)
Sol: i) P(z ≥ 0.85) = P(z ≥ 0) − P(0 ≤ z ≤ 0.85)
= 0.5 − ϕ(0.85)
= 0.5 − 0.3023
𝐏(𝐳 ≥ 𝟎. 𝟖𝟓) = 𝟎. 𝟏𝟗𝟕𝟕
34
ii) P(−1.64 ≤ z ≤ −0.88)
By Symmetry, P(−1.64 ≤ z ≤ −0.88) = P(0.88 ≤ z ≤ 1.64)
= P(0 ≤ z ≤ 1.64) − P(0 ≤ z ≤ 0.88)
= ϕ(1.64) − ϕ(0.88)
= 0.4495 − 0.3106
𝐏(−𝟏. 𝟔𝟒 ≤ 𝐳 ≤ −𝟎. 𝟖𝟖) = 𝟎. 𝟏𝟑𝟖𝟗
iii) P(z ≤ −2.43) = P(z ≥ 2.43)
P(z ≤ −2.43) = P(z ≥ 0) − P(0 ≤ z ≤ 2.43)
= 0.5 − ϕ(2.43)
= 0.5 − 0.4925
𝐏(𝐳 ≥ −𝟐. 𝟒𝟑) = 𝟎. 𝟎𝟎𝟕𝟓
iv) P(|z| ≤ 1.94) = P(−1.94 ≤ z ≤ 1.94)
= 2 P(0 ≤ z ≤ 1.94)
= 2. ϕ(1.94)
= 2. (0.4738)
𝐏(|𝐳| ≤ 𝟏. 𝟗𝟒) = 𝟎. 𝟗𝟒𝟕𝟔
2. If x is a normal variate with mean 30 and standard deviation 5 find the probability that
i) 26 ≤ x ≤ 40 ii) x ≥ 45
x−μ x−30
Sol: We have standard normal variate, z = = [By data μ = 30, σ = 5 ]
σ 5
35
P(z ≥ 3) = P(z ≥ 0) − P(0 ≤ z ≤ 3)
= 0.5 − ϕ(3)
= 0.5 − 0.4987
𝐏(𝐳 ≥ 𝟑) = 𝟎. 𝟎𝟎𝟏𝟑
3. The marks of 1000 students in an examination follows a normal distribution with mean
70 and standard deviation 5. Find the number of students whose marks will be
i) less than 65 ii) more than 75 iii) 65 to 75
Sol: Let x represent the marks of students
x−μ x−70
We have standard normal variate, z = = [By data μ = 70, σ = 5 ]
σ 5
36
iii) To find 𝐏(𝟔𝟓 < 𝐱 < 𝟕𝟓)
Hence we need to find , 𝐏(−𝟏 < 𝐳 < 𝟏)
𝐏(−𝟏 < 𝐳 < 𝟏) = 𝟐𝐏(𝟎 < 𝐳 < 𝟏)
= 2ϕ(1)
= 2(0.3413)
𝐏(−𝟏 < 𝐳 < 𝟏) = 𝟎. 𝟔𝟖𝟐𝟔
Number of students scoring marks between 65 to 75 marks=1000*0.6826= 𝟔𝟖𝟐. 𝟔 ≅
𝟔𝟖𝟑
4. In a test on electric bulbs, it was found that the life time of a particular brand was
distributed normally with an average life of 2000 hours and S.D of 60 hours. If a firm
purchases 2500 bulbs, find the number of bulbs that are likely to last for i) more than 2100
hours ii) less than 1950 hours iii) between 1900 to 2100 hours.
Sol: Given μ = 2000, σ = 60
𝐱−𝛍 𝐱−𝟐𝟎𝟎𝟎
We have standard normal variate, 𝐳 = =
𝛔 𝟔𝟎
1950−2000
If x = 1950 , z = = −0.83
60
37
= 0.5 − 0.2967
𝐏(𝐳 < −𝟎. 𝟖𝟑) = 𝟎. 𝟐𝟎𝟑𝟑
∴ Number of bulbs that are likely to last for more than 1950 hours = 𝟐𝟓𝟎𝟎 ∗ 𝟎. 𝟐𝟎𝟑𝟑
= 𝟓𝟎𝟖. 𝟐𝟓 ≅ 𝟓𝟎𝟖
iii) To find 𝐏(𝟏𝟗𝟎𝟎 < 𝐱 < 𝟐𝟏𝟎𝟎)
1900−2000 2100−2000
If x = 1900 , z = = −1.67 ; If x = 2100 , z = = 1.67
60 60
So we have ,
P(z < z1 ) = 0.31 and P(z > z2 ) = 0.08
i.e., 0.5 + φ(z1 ) = 0.31 and 0.5 − φ(z2 ) = 0.08
φ(z1 ) = 0.31 − 0.5 and φ(z2 ) = 0.5 − 0.08
38
⇒ 𝐳𝟏 = −𝟎. 𝟓 and 𝐳𝟐 = 𝟏. 𝟒
45−μ 64−μ
i.e., = −0.5 and = 1.4
σ σ
40−50 50−50
If x = 40 , z = = −1.67 ; If x = 50 , z = =0
6 6
39
7. In an examination 7% of students score less than 35% marks and 89% of students score
less than 60% of marks. Find the mean and S.D of the distribution if the marks are
normally distributed. It is given that P(1.2263)=0.39 and P(1.4757)=0.43.
Sol: Let μ and σ be the mean and S.D of the normal distribution.
By data, P(x < 35) = 0.07 and P(x < 60) = 0.89
𝐱−𝛍
We have standard normal variate, 𝐳 = 𝛔
35−μ
When x = 35 , z = = z1 (Say)
σ
60−μ
x = 60 , z = = z2 (Say)
σ
So we have ,
P(z < z1 ) = 0.07 and P(z < z2 ) = 0.89
i.e., 0.5 + φ(z1 ) = 0.07 and 0.5 + φ(z2 ) = 0.89
40
MODULE-2
JOINT PROBABILITY DISTRIBUTIONS AND MARKOV CHAIN
Joint Probability Distribution
If 𝑋 and 𝑌 are two discrete random variables, we define the joint probability
function of 𝑋 and 𝑌 by P(𝑋 = 𝑥, 𝑌 = 𝑦) = 𝑓(𝑥, 𝑦) where 𝑓(𝑥, 𝑦) satisfy the conditions
𝑓(𝑥, 𝑦) ≥ 0 and ∑𝑥 ∑𝑦 𝑓(𝑥, 𝑦) = 1.
The Variance of 𝑋 is denoted by 𝑉(𝑋) = ∑𝑛𝑖=1 𝑥𝑖2 𝑓(𝑥𝑖 ) − 𝜇 2 , where 𝜇 is the mean of 𝑋.
𝑉 = 𝐸(𝑋 2 ) − 𝜇𝑋2
If X and Y are two discrete random variables having the joint probability distribution
f(x, y) then the expectation of X and Y are defined as
𝜇𝑋 = 𝐸(𝑋) = ∑ 𝑥𝑖 𝑓(𝑥𝑖 )
𝑖
𝜇𝑌 = 𝐸(𝑌) = ∑ 𝑦𝑗 𝑔(𝑦𝑗 )
𝑗
If X and Y are random variables having mean μX [orE(X)] and μY [orE(Y)] respectively,
then the covariance of X and Y are defined as
Cov(X, Y) = E(XY) − E(X)E(Y)
𝐶𝑜𝑣(𝑋,𝑌)
The correlation of 𝑋 and 𝑌 denoted by 𝜌(𝑋, 𝑌) is given by 𝜌(𝑋, 𝑌) = where
𝜎𝑋 𝜎𝑌
41
Problems
1. The joint distribution of two random variables 𝑋 and 𝑌 is as follows
𝑌→ -4 2 7
𝑋↓
1 1 1 1
8 4 8
5 1 1 1
4 8 8
𝑌→ -4 2 7 𝑓(𝑥𝑖 )
𝑋↓
1 1 1 1 1 1 1 1
+ + =
8 4 8 8 4 8 2
5 1 1 1 1 1 1 1
+ + =
4 8 8 4 8 8 2
𝑔(𝑦𝑗 ) 1 1 3 1 1 3 1 1 1 1
+ = + = + =
8 4 8 4 8 8 8 8 4
Distribution of 𝑋
𝑥𝑖 1 5
𝑓(𝑥𝑖 ) 1 1
2 2
Distribution of 𝑌
𝑦𝑗 -4 2 7
𝑔(𝑦𝑗 ) 3 3 1
8 8 4
42
i) 𝐸(𝑋) = ∑𝑖 𝑥𝑖 𝑓(𝑥𝑖 ) 𝐸(𝑌) = ∑𝑗 𝑦𝑗 𝑔(𝑦𝑗 )
1 1 3 3 1
𝐸(𝑋) = (1) (2) + (5) (2) 𝐸(𝑌) = (−4) (8) + (2) (8) + (7) (4)
1 5 3 3 7
𝐸(𝑋) = + 𝐸(𝑌) = − + +
2 2 2 4 4
Thus, 𝐸(𝑋) = 𝜇𝑋 = 3 𝐸(𝑌) = 𝜇𝑌 = 1
ii) 𝐸(𝑋𝑌) = ∑𝑖,𝑗 𝑥𝑖 𝑦𝑗 𝐽𝑖𝑗
1 1 1 1 1 1
𝐸(𝑋𝑌) = (1)(−4) ( ) + (1)(2) ( ) + (1)(7) ( ) + (5)(−4) ( ) + (5)(2) ( ) + (5)(7) ( )
8 4 8 4 8 8
1 1 7 5 35
𝐸 (𝑋𝑌) = − + + − 5 + +
4 2 8 4 8
3
Thus, 𝐸(𝑋𝑌) = 2
𝜎𝑋 = 2 ; 𝜎𝑌 = 4.33
iv) 𝐶𝑜𝑣(𝑋, 𝑌) = 𝐸(𝑋𝑌) − 𝐸(𝑋)𝐸(𝑌)
3
𝐶𝑜𝑣(𝑋, 𝑌) = 2 − (3)(1)
3
𝐶𝑜𝑣(𝑋, 𝑌) = −
2
𝐶𝑜𝑣(𝑋,𝑌)
v) 𝜌(𝑋, 𝑌) = 𝜎𝑋 𝜎𝑌
3
−
2
𝜌(𝑋, 𝑌) = (2)(4.33)
𝜌(𝑋, 𝑌) = −0.1732
43
2. The joint probability distribution table for two random variables 𝑋 and 𝑌 is as follows
𝑌→ -2 -1 4 5
𝑋↓
𝑥𝑖 1 2
Marginal distribution of 𝑌
𝑦𝑗 -2 -1 4 5
𝐸(𝑋𝑌) = ∑ 𝑥𝑖 𝑦𝑗 𝐽𝑖𝑗
𝑖,𝑗
𝐸(𝑋 2 ) = (12 )(0.6) + (22 )(0.4) 𝐸(𝑌 2 ) = ((−2)2 )(0.3) + ((−1)2 )(0.3) + (42 )(0.1) + (52 )(0.3)
44
𝐸(𝑋 2 ) = 2.2 𝐸(𝑌 2 ) = 10.6
∴ 𝜎𝑋2 = 2.2 − (1.4)2 ; 𝜎𝑌2 = 10.6 − 12
𝜎𝑋2 = 0.24 ; 𝜎𝑌2 = 9.6
𝜎𝑋 = 0.49 ; 𝜎𝑌 = 3.1
iii) 𝐶𝑜𝑣(𝑋, 𝑌) = 𝐸(𝑋𝑌) − 𝐸(𝑋)𝐸(𝑌)
𝐶𝑜𝑣(𝑋, 𝑌) = 0.9 − (1.4)(1)
𝐶𝑜𝑣(𝑋, 𝑌) = −0.5
𝐶𝑜𝑣(𝑋,𝑌)
iv) 𝜌(𝑋, 𝑌) = 𝜎𝑋 𝜎𝑌
−0.5
𝜌(𝑋, 𝑌) = (0.49)(3.1)
𝜌(𝑋, 𝑌) = −0.3
If 𝑋 and 𝑌 are independent random variables we must have 𝑓(𝑥𝑖 ). 𝑔(𝑦𝑗 ) = 𝐽𝑖𝑗
It can be seen that 𝑓(𝑥1 ). 𝑔(𝑦1 ) = (0.6)(0.3) = 0.18 and 𝐽11 = 0.1 i.e., 𝑓(𝑥𝑖 ). 𝑔(𝑦𝑗 ) ≠ 𝐽𝑖𝑗
45
3. Suppose 𝑋 and 𝑌 are independent random variables with the following respective
distribution, find the joint distribution of 𝑋 and 𝑌. Also verify 𝐶𝑜𝑣(𝑋, 𝑌) = 0.
𝑥𝑖 1 2
𝑦𝑗 -2 5 8
Sol: Since 𝑋 and 𝑌 are independent, the joint distribution J(𝑥, 𝑦) is obtained by using the
definition 𝑓(𝑥𝑖 ). 𝑔(𝑦𝑗 ) = 𝐽𝑖𝑗 .
𝑌→ -2 5 8 𝑓(𝑥𝑖 )
𝑋↓
46
Thus, 𝐸(𝑋) = 𝜇𝑋 = 1.3 𝐸(𝑌) = 𝜇𝑌 = 3.5
𝐸(𝑋𝑌) = ∑ 𝑥𝑖 𝑦𝑗 𝐽𝑖𝑗
𝑖,𝑗
𝑥𝑖 2 5 7
𝑓(𝑥𝑖 ) 1⁄ 1⁄ 1⁄
2 4 4
𝑦𝑗 3 4 5
𝑔(𝑦𝑗 ) 1⁄ 1⁄ 1⁄
3 3 3
𝐽11 = 𝑓(𝑥1 ). 𝑔(𝑦1 ) = 1⁄2 . 1⁄3 = 1⁄6 𝐽12 = 𝑓(𝑥1 ). 𝑔(𝑦2 ) = 1⁄2 . 1⁄3 = 1⁄6
𝐽13 = 𝑓(𝑥1 ). 𝑔(𝑦3 ) = 1⁄2 . 1⁄3 = 1⁄6 𝐽21 = 𝑓(𝑥2 ). 𝑔(𝑦1 ) = 1⁄4 . 1⁄3 = 1⁄12
𝐽22 = 𝑓(𝑥2 ). 𝑔(𝑦2 ) = 1⁄4 . 1⁄3 = 1⁄12 𝐽23 = 𝑓(𝑥2 ). 𝑔(𝑦3 ) = 1⁄4 . 1⁄3 = 1⁄12
𝐽31 = 𝑓(𝑥3 ). 𝑔(𝑦1 ) = 1⁄4 . 1⁄3 = 1⁄12 𝐽32 = 𝑓(𝑥3 ). 𝑔(𝑦2 ) = 1⁄4 . 1⁄3 = 1⁄12
47
The joint distribution table is as follows
𝑌→ 3 4 5 𝑓(𝑥𝑖 )
𝑋↓
2 1⁄ 1⁄ 1⁄ 1⁄
6 6 6 2
5 1⁄ 1⁄ 1⁄ 1⁄
12 12 12 4
7 1⁄ 1⁄ 1⁄ 1⁄
12 12 12 4
𝑔(𝑦𝑗 ) 1⁄ 1⁄ 1⁄ 1
3 3 3
𝐸(𝑋) = 4 𝐸(𝑌) = 4
Thus, 𝐸(𝑋) = 𝜇𝑋 = 4 𝐸(𝑌) = 𝜇𝑌 = 4
𝐸(𝑋𝑌) = ∑ 𝑥𝑖 𝑦𝑗 𝐽𝑖𝑗
𝑖,𝑗
𝐸(𝑋𝑌) = (2)(3)(1⁄6) + (2)(4)(1⁄6) + (2)(5)(1⁄6) + (5)(3)(1⁄12) + (5)(4)(1⁄12) + (5)(5)(1⁄12) + (7)(3)(1⁄12) + (7)(4)(1⁄12) + (7)(5)(1⁄12)
𝐸(𝑋𝑌) = 16
Thus, 𝐸(𝑋𝑌) = 16
Thus, 𝐶𝑜𝑣(𝑋, 𝑌) = 16 − (4)(4) = 0
iii) Z = 𝑋 + 𝑌
Let 𝑧𝑖 = 𝑥𝑖 + 𝑦𝑖 and hence {𝑧𝑖 } = {5,6,7,8,9,10,11,12}
The corresponding probabilities are ,
1⁄ , 1⁄ , 1⁄ , 1⁄ , 1⁄ , 1⁄ + 1⁄ , 1⁄ , 1⁄
6 6 6 12 12 12 12 12 12
The probability distribution of Z = 𝑋 + 𝑌 is as follows
Z 5 6 7 8 9 10 11 12
P(Z) 1⁄ 1⁄ 1⁄ 1⁄ 1⁄ 1⁄ + 1⁄ = 1⁄ 1⁄ 1⁄
6 6 6 12 12 12 12 6 12 12
48
5. The joint probability distribution of two discrete random variables 𝑋 and 𝑌 is given by
𝑓(𝑥, 𝑦) = 𝑘(2𝑥 + 𝑦) where 𝑥 and 𝑦 are integers such that 0 ≤ 𝑥 ≤ 2 0 ≤ 𝑦 ≤ 3.
i) Find the value of the constant k.
ii) Find the marginal probability distributions of 𝑋 and 𝑌.
iii) Show that the random variables 𝑋 and 𝑌 are dependent.
iv) Compute E(𝑋)𝐸(𝑌), 𝐸(𝑋𝑌)
v) Compute E(𝑋 2 ), 𝐸(𝑌 2 )
vi) Compute 𝜎𝑋 , 𝜎𝑌
vii) Find P(𝑋 = 1, 𝑌 = 2) , P(𝑋 = 2, 𝑌 = 1)
viii) Find P(𝑋 ≥ 1, 𝑌 ≤ 2) , P(𝑋 + 𝑌 > 2)
𝑓(𝑥, 𝑦) = 𝑘(2𝑥 + 𝑦) and the joint probability distribution table is formed as follows
𝑌→ 0 1 2 3 Sum
𝑋↓
0 0 𝑘 2𝑘 3𝑘 6𝑘
1 2𝑘 3𝑘 4𝑘 5𝑘 14𝑘
2 4𝑘 5𝑘 6𝑘 7𝑘 22𝑘
𝒙𝒊 0 1 2
𝒚𝒋 0 1 2 3
49
iv) 𝐸(𝑋) = ∑𝑖 𝑥𝑖 𝑓(𝑥𝑖 ) 𝐸(𝑌) = ∑𝑗 𝑦𝑗 𝑔(𝑦𝑗 )
𝐸(𝑋) = (0)(6⁄42) + (1)(14⁄42) + (2)(22⁄42) 𝐸(𝑌) = (0)(6⁄42) + (1)(9⁄42) + (2)(12⁄42) + (3)(15⁄42)
𝐸(𝑋𝑌) = ∑ 𝑥𝑖 𝑦𝑗 𝐽𝑖𝑗
𝑖,𝑗
102 192
𝐸(𝑋 2 ) = 𝐸(𝑌 2 ) =
42 42
𝟏𝟕 𝟑𝟐
Thus, 𝑬(𝑿𝟐 ) = 𝑬(𝒀𝟐 ) =
𝟕 𝟕
17 29 2 32 13 2
𝜎𝑋2 = −( ) 𝜎𝑌2 = −( )
7 21 7 7
230 495
𝜎𝑋2 = 441 𝜎𝑌2 = 441
√230 √495
𝜎𝑋 = 𝜎𝑌 =
21 21
Thus, 𝝈𝑿 = 𝟎. 𝟕𝟐 and 𝝈𝒀 = 𝟏. 𝟎𝟔
vii) Let 𝑋 = {𝑥𝑖 } = {𝑥1 , 𝑥2 , 𝑥3 } = {0,1,2} respectively.
50
P(𝑋 = 2, 𝑌 = 1) = 𝑓(𝑥3 , 𝑦2 ) = 𝐽32 = 5⁄42
viii) (𝑋, 𝑌) = {(1,0), (1,1), (1,2), (2,0), (2,1), (2,2)} = {(𝑥2 , 𝑦1 ), (𝑥2 , 𝑦2 ), (𝑥2 , 𝑦3 ), (𝑥3 , 𝑦1 ), (𝑥3 , 𝑦2 ), (𝑥3 , 𝑦3 )}
P(𝑋 ≥ 1, 𝑌 ≤ 2) = 𝐽21 +𝐽22 +𝐽23 +𝐽31 +𝐽32 +𝐽33
2 3 4 4 5 6 24
P(𝑋 ≥ 1, 𝑌 ≤ 2) = 42 + 42 + 42 + 42 + 42 + 42 = 42
𝟒
𝑷(𝑿 ≥ 𝟏, 𝒀 ≤ 𝟐) = 𝟕
6. A fair coin is tossed thrice. The random variables 𝑋 𝑎𝑛𝑑 𝑌 are defined as follows.
𝑋 = 0 𝑜𝑟 1 according as head or tail occurs on the first toss. 𝑌 = 𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 ℎ𝑒𝑎𝑑𝑠
i) Determine the distributions of 𝑋 𝑎𝑛𝑑 𝑌.
ii) Determine the joint distribution of 𝑋 𝑎𝑛𝑑 𝑌.
iii) Obtain the expectations of 𝑋, 𝑌 𝑎𝑛𝑑 𝑋𝑌. Also find S.D’s of 𝑋 𝑎𝑛𝑑 𝑌.
iv) Compute the covariance and correlation of 𝑋 𝑎𝑛𝑑 𝑌.
Sol: The sample space 𝑆 and the association of random variables 𝑋 𝑎𝑛𝑑 𝑌 is given by the
following table
𝑋 0 0 0 0 1 1 1 1
𝑌 3 2 2 1 2 1 1 0
1 3 3 1
𝑃(𝑌 = 0) = 8 ; 𝑃(𝑌 = 1) = 8 ; 𝑃(𝑌 = 2) = 8 ; 𝑃(𝑌 = 3) = 8
51
𝑥𝑖 0 2
𝑓(𝑥𝑖 ) 1 1
2 2
𝑦𝑗 0 1 2 3
𝑔(𝑦𝑗 ) 1 3 3 1
8 8 8 8
ii) The joint distribution of 𝑋 𝑎𝑛𝑑 𝑌 is found by computing 𝐽𝑖𝑗 = 𝑃(𝑋 = 𝑥𝑖 , 𝑌 = 𝑦𝑗 ) where
we have 𝑥1 = 0, 𝑥2 = 1 and 𝑦1 = 0, 𝑦2 = 1, 𝑦3 = 2, 𝑦4 = 3
𝐽11 = 𝑃(𝑋 = 0, 𝑌 = 0) = 0 (𝑋 = 0 implies that there is a head turn out and 𝑌 the total
number of heads is 0, outcome is impossible)
1 2 1 1
𝐽12 = 𝑃(𝑋 = 0, 𝑌 = 1) = 𝐽13 = 𝑃(𝑋 = 0, 𝑌 = 2) = = 𝐽14 = 𝑃(𝑋 = 0, 𝑌 = 3) =
8 8 4 8
(Corresponding to the out come HTT) (out comes are HHT and HTH) (out come is HHH)
1
𝐽21 = 𝑃(𝑋 = 1, 𝑌 = 0) = 8 ; out come is TTT
2 1
𝐽22 = 𝑃(𝑋 = 1, 𝑌 = 1) = 8 = 4 ; out come is THT, TTH
1
𝐽23 = 𝑃(𝑋 = 1, 𝑌 = 2) = 8 ; out come is THH
𝒀→ 0 1 2 3 Sum
𝑿↓
0 0 1 1 1 1
8 4 8 2
1 1 1 1 0 1
8 4 8 2
Sum 1 3 3 1 1
8 8 8 8
52
𝐸(𝑋) = 1⁄2 𝐸(𝑌) = 12⁄8
𝟏
𝑬(𝑿𝟐 ) = 𝟐 𝑬(𝒀𝟐 ) = 𝟑
1 1 2 3 2
𝜎𝑋2 = 2 − (2) 𝜎𝑌2 = 3 − (2)
1 3
𝜎𝑋2 = 4 𝜎𝑌2 = 4
𝟏 √𝟑
𝝈𝑿 = 𝟐 𝝈𝒀 = 𝟐
𝟏
−
𝟒
𝜌(𝑋, 𝑌) = 𝟏 √𝟑
( )( )
𝟐 𝟐
𝟏
𝝆(𝑿, 𝒀) = −
√𝟑
𝑥𝑖 -1 -2 1 2
𝑓(𝑥𝑖 ) 1 1 1 1
4 4 4 4
53
Determine i) the distribution g of Y ii) the joint distribution of 𝑋 𝑎𝑛𝑑 𝑌
iii) E(𝑋), 𝐸(𝑌), 𝐸(𝑋𝑌) iv) Cov(𝑋, 𝑌) v) 𝜌(𝑋, 𝑌)
Sol: Here 𝑌 is defined to be 𝑋 2 .
The distribution g of Y is given by
𝑦𝑗 1 4
𝑔(𝑦𝑗 ) 1 1
2 2
𝒀→ 1 4
𝑿↓
-2 0 1
4
-1 1 0
4
1 1 0
4
2 0 1
4
Thus, 𝑬(𝑿𝒀) = 𝟎
Also, 𝜎𝑋2 = 𝐸(𝑋 2 ) − 𝜇𝑋2 𝜎𝑌2 = 𝐸(𝑌 2 ) − 𝜇𝑌2
Now, 𝐸(𝑋 2 ) = ∑𝑖 𝑥𝑖2 𝑓(𝑥𝑖 ) 𝐸(𝑌 2 ) = ∑𝑗 𝑦𝑗2 𝑓(𝑦𝑗 )
54
1 1 1 1 1 1
𝐸(𝑋 2 ) = ((−1)2 ) ( ) + ((−2)2 ) ( ) + ((1)2 ) ( ) + ((2)2 ) ( ) 𝐸(𝑌 2 ) = (12 ) ( ) + (42 ) ( )
4 4 4 4 2 2
𝟓 17
𝑬(𝑿𝟐 ) = 𝟐 𝑬(𝒀𝟐 ) = 2
𝐶𝑜𝑣(𝑋, 𝑌) = 0 − 0(5⁄2)
𝑪𝒐𝒗(𝑿, 𝒀) = 𝟎
𝐶𝑜𝑣(𝑋,𝑌)
v) 𝜌(𝑋, 𝑌) = 𝜎𝑋 𝜎𝑌
0
𝜌(𝑋, 𝑌) = (1.581)(1.658)
𝝆(𝑿, 𝒀) = 𝟎
8. The joint probability function of two discrete random variables 𝑋 and 𝑌 is given by
𝑓(𝑥, 𝑦) = 𝑐𝑥𝑦 for 𝑥 = 1,2,3 and 𝑦 = 1,2,3. Find the following
i) Constant 𝑐 ii) 𝑃(𝑋 = 2, 𝑌 = 3) iii) 𝑃(1 ≤ 𝑋 ≤ 2, 𝑌 ≤ 2) iv) 𝑃(𝑋 ≥ 2) v) 𝑃(𝑌 < 2)
vi) 𝑃(𝑋 = 1) v) 𝑃(𝑌 = 3)
55
MODULE-3&4
STATISTICAL INFERENCE-1&2
Sampling Theory
Random Sampling
A large collection individuals or attributes or numerical data can be understood as a
population or universe.
A finite subset of the universe is called a sample. The number of individuals in a sample is
called a sample size(n).
If 𝑛 ≤ 30 then the sample is said to be small otherwise it is a large sample.
The selection of an individual or item from the population in such a way that each has the
same chance of being selected is called as random sampling.
Sampling where a member of the population may be selected more than once is called as
sampling with replacement.
If a member cannot be chosen more than once is called as sampling without replacement.
Parameter and Statistics
The statistical constants of population is mean(𝜇) , standard deviation (𝜎), correlation (𝜌)
are called the parameter.
The statistical constants drawn from the given population is mean(𝑥̅ ) , standard deviation
(𝑠), correlation (𝑟) are called statistic.
Testing of Hypothesis
To reach the decisions about the populations and the basis of sample information, we
make certain assumptions about the population involved. Such assumptions which may or
may not be true are called hypothesis. OR Quantitative statement about the population.
Explanation on the basis of limited evidences.
Null Hypothesis
The hypothesis formulated for the purpose of its rejection under the assumption that it is
true is called the Null Hypothesis denoted by 𝐻0 .
Any hypothesis which is complimentary to the null hypothesis is called Alternative
Hypothesis denoted by 𝐻1 .
Type-I and Type-II Errors
Type-I Error: Wrong decision to reject the null hypothesis when it is actually true.
Type-II Error: Wrong decision to accept the null hypothesis when it is actually false.
56
Accepting the Rejecting the
hypothesis hypothesis
Significance level
The probability level, below which leads to the rejection of the hypothesis is known as the
significance level. The probability levels are fixed at 0.05 or 0.01 being 5% or 1%. These are
called Significance level.
Test of Significance
The process which enables us to decide about the acceptance or rejection of the hypothesis
is called the test of significance.
Confidence Intervals
Let us suppose that we have a normal population with mean 𝜇 and S.D 𝜎 . If 𝑥̅ is the
sample
𝑥̅ −𝜇
mean of a random sample size 𝑛 the quantity 𝑧 defined by 𝑧 = 𝜎 is called the Standard
( )
√𝑛
normal variate.
From the normal distribution table we find the 95% of the area lies between 𝑧 = −1.96
and 𝑧 = +1.96 i.e., 95% confidence interval lies between −1.96 and +1.96 .
Further 5% level of significance is denoted by 𝑧0.05 .Mathematically we can write
𝑥̅ − 𝜇
−1.96 ≤ ≤ 1.96
𝜎
( )
√𝑛
𝜎 𝜎
− (1.96) ≤ 𝑥̅ − 𝜇 ≤ (1.96)
√𝑛 √𝑛
𝜎 𝜎
𝜇 ≤ 𝑥̅ + (1.96) and 𝑥̅ − (1.96) ≤ 𝜇
√𝑛 √𝑛
57
Similarly, from the table of normal areas 99% of the area lies between -2.58 and 2.58.
This is equivalent to the form,
𝜎 𝜎
𝑥̅ − 2.58 ( 𝑛) ≤ 𝜇 ≤ 𝑥̅ + 2.58 ( 𝑛) --- (2)
√ √
We can say that (1) is the 95% confidence interval and (2) is the 99% confidence interval.
Critical Value of 𝒛
Test
5% level 1% level
58
Two-tailed test -1.96 or 1.96 -2.58 or 2.58
2. If |𝑍| > 2.58, we conclude that the differences is highly significant and reject the
𝑝𝑞
hypothesis .Probable limits = 𝑝 ± 2.58√ 𝑛
3. If Z < 1.96, difference between the observed and expected number of successes is not
significant.
59
𝑧 =2.53<2.58
Thus, we can say that the coin is unbiased.
2. A sample of 900 days was taken in a coastal town and it was found that on 100 days the
weather was very hot. Obtain the probable limits of the percentage of very hot weather.
100 1
Sol: Probability of very hot weather , 𝑝 = 900 = 9
8
∴ 𝑞=9 , 𝑛 = 900
𝑝𝑞
Probable limits = 𝑝 ± (2.58)√ 𝑛
1 1 8 1
= 9 ± (2.58)√9 . 9 . 900
= 0.111 ± 0.027
= 0.084 𝑎𝑛𝑑 0.138
Probability limits of very hot weather is 8.4% to 13.8%
3. In a sample of 500 men it was found that 60% of them had over weight. What can we
infer about the proportion of people having over weight in the population?
60
Sol: Probability of persons having over weight is , 𝑝 = 100 = 0.6
∴ 𝑞 = 1 − 𝑝 = 0.4 , 𝑛 = 500
𝑝𝑞
Probable limits = 𝑝 ± (2.58)√ 𝑛
(0.6)(0.4)
= 0.6 ± (2.58)√ 500
= 0.6 ± 0.0565
Probable limits = 0. 5435 𝑎𝑛𝑑 0.6565
Thus, the probable limits of people having over weight is 54.35% and 65.65%
4. A survey was conducted in a slum locality of 2000 families by selecting a sample size
800. It was revealed that 180 families were illiterates. Find the probable limits of the
illiterate families in the population of 2000.
180
Sol: Probability of illiterate families is , 𝑝 = 800 = 0.225
∴ 𝑞 = 1 − 𝑝 = 0.775 , 𝑛 = 800
𝑝𝑞
Probable limits = 𝑝 ± (2.58)√ 𝑛
60
(0.225)(0.775)
= 0.225 ± (2.58)√ 800
= 0.225 ± 0.038
Probable limits = 0. 187 𝑎𝑛𝑑 0.263
The probable limits of the illiterate families in the population of 2000
= 0. 187 ∗ 2000 𝑎𝑛𝑑 0.263 ∗ 2000
= 374 𝑎𝑛𝑑 526
Thus, 374 to 526 are probably illiterate families.
5. In 324 throws of a six faced ‘die’, an odd number turned up 181 times. Is it reasonable to
think that the ‘die’ is an unbiased one?
Sol: Probability of the turn up of an odd number , p = 3/6= ½
Since p + q = 1, q= ½
Expected number of successes = np
= 324 × ½ = 162
Actual Number of heads = 181 = 𝑥
Now 𝑥 − np = 181 − 162 = 19
𝑥−𝜇 𝑥−𝑛𝑝 19 19
Consider z = = = =
√ 𝑛𝑝𝑞 √𝑛𝑝𝑞 √324𝑋0.5𝑋0.5 9
𝑧 =2.11<2.58
Thus, we can say that the die is unbiased.
6. A sample of 100 days is taken from meteorological records of a certain district and 10 of
them are found to be foggy. What are the probable limits of the percentage of foggy days
in the district.
10
Sol: Proportion of foggy days in a sample of 100 days is , 𝑝 = 100 = 0.1
∴ 𝑞 = 1 − 𝑝 = 0.9 , 𝑛 = 100
𝑝𝑞
Probable limits = 𝑝 ± (2.58)√ 𝑛
(0.1)(0.9)
= 0.1 ± (2.58)√ 100
= 0.1 ± 0.0774
Probable limits = 0. 0226 𝑎𝑛𝑑 0.1774
61
Thus, the percentage of foggy days lies between 2.26% and 17.74%
7. A random sample of 500 apples was taken from a large consignment and 65 were found
to be bad. Estimate the proportion of bad apples in the consignment as well as the
standard error of the estimate. Also find the percentage of bad apples in the consignment.
65
Sol: Proportion of bad apples in the sample is given by , 𝑝 = 500 = 0.13
∴ 𝑞 = 1 − 𝑝 = 0.87 , 𝑛 = 500
𝑝𝑞 (0.13)(0.87)
S.E proportion of bad apples = √ 𝑛 = √ = 0.015
500
𝑝𝑞
Probable limits = 𝑝 ± (2.58)√
𝑛
= 0.13 ± (2.58)(0.015)
= 0.13 ± 0.0387
Probable limits= 0.0913 𝑎𝑛𝑑 0.1687
= 9.13% 𝑎𝑛𝑑 16.87%
Thus, the required percentage of bad apples in the consignment lies between 9.13 and 16.87.
8. The mean and standard deviation of the maximum loads supported by 60 cables are 11.09
tonnes and 0.73 tonnes respectively. Find a) 95% b) 99% confidence limits for mean of
the maximum loads of all cables produced by the company.
Sol: By data 𝑥̅ = 11.09 , 𝜎 = 0.73 , n = 60
a) 95% confidence limits for mean of the maximum loads are given by
𝜎 0.73
𝑥̅ ± 1.96 ( 𝑛) = 11.09 ± (1.96) ( )
√ √60
= 11.09 ± 0.18
= 10.91 𝑎𝑛𝑑 11.27
Thus,10.91 tonnes to 11.27 tonnes are the 95% confidence limits for the mean of the
maximum loads.
b) 99% confidence limits for mean of the maximum loads are given by
𝜎 0.73
𝑥̅ ± 2.58 ( 𝑛) = 11.09 ± (2.58) ( )
√ √60
= 11.09 ± 0.24
= 10.85 𝑎𝑛𝑑 11.33
Thus, 10.85 tonnes to 11.33 tonnes are the 95% confidence limits for the mean of the
maximum loads.
62
9. An unbiased coin is thrown n times. It is desired that the relative frequency of the
appearance of heads should lie between 0.49 and 0.51. Find the smallest value of n that
will ensure this result with a) 95% confidence b) 99% confidence
1
Sol: Probability of getting a head, 𝑝 = 2
1
𝑞=2
𝑝𝑞
S.E proportion of heads = √ 𝑛
11
. 1
S.E proportion of heads = √2 2 = 𝑛 2√𝑛
𝑝𝑞
a) Probable limits for 95% confidence level is given by : 𝑝 ± (1.96)√ 𝑛 , which should lie
𝑝𝑞
𝑝 ± (1.96)√ 𝑛 = 0.51 𝑜𝑟 0.49
1
0.5 ± (1.96) 2 = 0.51 𝑜𝑟 0.49
√𝑛
1.96
0.5 ± 2 = 0.51 𝑜𝑟 0.49
√𝑛
1.96 1.96
0.5 + 2 = 0.51 0.5 − 2 = 0.49
√ 𝑛 √𝑛
1.96 1.96
= 0.51 − 0.5 = 0.5 − 0.49
2√𝑛 2√𝑛
1.96 1.96
= 0.01 = 0.01
2√𝑛 2 √𝑛
1.96
√𝑛 = 0.02
√𝑛 = 98
Thus, 𝐧 = 𝟗𝟔𝟎𝟒
b) Taking the confidence coefficient equal to 1.645 for 90% confidence level we have as
before
1.645
= 0.01
2√𝑛
1.645
√𝑛 =
0.02
√𝑛 = 82.25
Thus, 𝒏 = 𝟔𝟕𝟔𝟓
63
Test of significance of a sample mean
1. A manufacturer claimed that atleast 95% of the equipment which he supplied to a factory
conformed to specifications. An examination of a sample of 200 pieces of equipment
revealed that 18 of them were faulty. Test his claim at a significance level of 1% and 5%.
Sol: Let 𝑝 be the probability of success which being the probability of the equipment
supplied to the factory conformal to the specifications.
∴ By data 𝑝 = 0.95 and hence 𝑞 = 0.05
𝐻0 : 𝑝 = 0.95 and the claim is correct.
𝐻1 : 𝑝 < 0.95 and the claim is false.
We choose the one tailed test to determine whether the supply is conformal to the
specification.
𝜇 = 𝑛𝑝 = 200 ∗ 0.95 = 190
𝒁 = 𝟐. 𝟔
The value of 𝑍 is greater than the critical value 1.645 at 5% level and 2.33 at 1% level of
significance.
The claim of the manufacturer (null hypothesis that claim is correct) is rejected at 5% as
well as 1% level of significance in accordance with the one tailed test.
2. It has been found from experience that the mean breaking strength of a particular brand
of thread is 275.6gms with standard deviation of 39.7gms. Recently a sample of 36 pieces
of thread showed a mean breaking strength 0f 253.2gms. Can one conclude at a
significance level of a) 0.05 b) 0.01 that the thread has become inferior ?
Sol: We have to decide between the two hypothesis,
𝐻0 : 𝜇 = 275.6 𝑔𝑚𝑠 , mean breaking strength.
𝐻1 : 𝑝 < 275.6 𝑔𝑚𝑠 , inferior in breaking strength.
We choose the one tailed test.
Mean breaking strength of a sample of 36 pieces=253.2
64
∴ Difference =275.6 – 253.2=22.4 ; n = 36
𝐷𝑖𝑓𝑓𝑒𝑟𝑒𝑛𝑐𝑒
Now, 𝑍 =
(𝜎⁄ )
√𝑛
22.4
𝑍=
(39.7⁄ )
√36
𝑍 = 3.38
The value of Z is greater than the critical value of Z=1.645 at 5% level and 2.33 at 1% level
of significance.
Under the hypothesis 𝐻1 that the thread has become inferior is accepted at both 0.05 and 0.01
levels in accordance with one tailed test.
75−72
𝑍= 𝟑𝟔 𝟔𝟒
√ +
𝟑𝟔 𝟑𝟐
𝑍 = √3
𝑍 = 1.73
> 𝑍0.05 = 1.645 (𝑜𝑛𝑒 𝑡𝑎𝑖𝑙𝑒𝑑 𝑡𝑒𝑠𝑡)
𝑍 = 1.73 {
< 𝑍0.01 = 2.33 (𝑜𝑛𝑒 𝑡𝑎𝑖𝑙𝑒𝑑 𝑡𝑒𝑠𝑡)
The difference in the performance of girls and boys in the examination is significant at 5%
level but not at 1% level.
2. A sample of 100 bulbs produced by a company A showed a mean life of 1190 hours and a
standard deviation of 90 hours. Also a sample of 75 bulbs produced by a company B
showed a mean life of 1230 hours and a standard deviation of 120 hours. Is there a
difference between the mean life time of the bulbs produced by the two companies at
a) 5% level of significance b) 1% level of significance.
Sol: By data 𝑥̅𝐴 = 1190 , 𝜎𝐴 = 90, 𝑛𝐴 = 100 [Company A]
65
𝑥̅𝐵 = 1230 , 𝜎𝐵 = 120, 𝑛𝐵 = 75 [Company B]
𝑥̅ 𝐵 −𝑥̅ 𝐴
Consider, 𝑍 =
𝝈 𝟐
𝝈 𝟐
√ 𝑩+ 𝑨
𝒏𝑩 𝒏𝑨
1230 − 1190
𝑍=
𝟐 𝟐
√(𝟏𝟐𝟎) + (𝟗𝟎)
𝟕𝟓 𝟏𝟎𝟎
𝑍 = 2.42
> 𝑍0.05 = 1.96 (𝑇𝑤𝑜 𝑡𝑎𝑖𝑙𝑒𝑑 𝑡𝑒𝑠𝑡)
𝑍 = 2.42 {
< 𝑍0.01 = 2.58 (𝑇𝑤𝑜 𝑡𝑎𝑖𝑙𝑒𝑑 𝑡𝑒𝑠𝑡)
The null hypothesis that there is no difference between the mean life time of bulbs is rejected
at 5% level but not at 1% level of significance.
The null hypothesis that there is no difference between the mean life time of bulbs is rejected
at both levels of significance in a one tailed test as the respective critical values are 1.645 and
2.33.
3. The mean life of two large samples of 1000 and 2000 members are 168.75 cms and 170
cms respectively. Can this be regarded as drawn from the sample population of standard
deviation 6.25 cms ?
Sol: By data 𝑥̅1 = 168.75 , 𝑛1 = 1000
𝑥̅2 = 170 , 𝑛2 = 2000 , 𝜎 = 6.25
𝑥̅ 2 −𝑥̅ 1
Consider, 𝑍 = 𝟏 𝟏
𝜎√ +
𝒏𝟏 𝒏𝟐
170−168.75
𝑍= 𝟏 𝟏
6.25√ +
𝟏𝟎𝟎𝟎 𝟐𝟎𝟎𝟎
𝑍 = 5.16
𝑍 = 5.16 is very much greater than 𝑍0.05 = 1.96 and 𝑍0.01 = 2.58. Thus we say that the
difference between the sample means is significant and we conclude that the samples cannot
be regarded as drawn from the same population.
4. A random sample for 1000 workers in company has mean wage of Rs.50 per day and S.D
of Rs.15.Another sample of 1500 workers from another company has mean wage of Rs.45
per day and S.D of Rs.20.Does the mean rate of wages varies between the two companies?
Find the 95% confidence limits for the difference of the wages of the population of the two
companies?
Sol: By data 𝑥̅1 = 50 , 𝜎1 = 15, 𝑛1 = 1000 [Company 1]
𝑥̅2 = 45 , 𝜎2 = 20, 𝑛2 = 1500 [Company 2]
66
𝑥̅ 1 −𝑥̅ 2
Consider, 𝑍 =
𝝈 𝟐𝝈 𝟐
√ 𝟏+ 𝟐
𝒏𝟏 𝒏𝟐
50−45
𝑍= 𝟐 𝟐
√ 𝟏𝟓 + 𝟐𝟎
𝟏𝟎𝟎𝟎 𝟏𝟓𝟎𝟎
𝑍 = 7.1307
The value of 𝑍 = 7.1307 is greater than 𝑍0.05 = 1.96 and 𝑍0.01 = 2.58. Hence we can say
that the difference between the mean wages is significant both at 5% and 1% level of
significance.
Also 95% confidence limits for the difference of mean wages is given by
𝜎2 𝜎2
(𝑥̅1 − 𝑥̅2 ) ± 1.96√𝑛1 + 𝑛2 = 5 ± 1.96(0.7012)
1 2
= 5 ± 1.374
= 3.626 𝑎𝑛𝑑 6.374
Thus we can say with 95% confidence that the difference of population mean of wages
between the two companies lies between Rs.3.63 and Rs.6.37.
Student’s t distribution/test
We need to test the hypothesis, whether the sample mean x̅ differs significantly from the
population mean or hypothetical value µ.
x̅−μ
We compute, t = s
√n and consider |t|.
∑x 1
Here , x̅ = and s2 = (n−1) ∑ni=1(xi − x̅)2
n
67
𝒔
̅±
Similarly 99% confidence limits for 𝜇 is given by 𝒙 𝒕
√𝒏 𝟎.𝟎𝟏
1 𝑛1 2
Where, 𝑠 2 = 𝑛 {∑𝑖=1(𝑥𝑖 − 𝑥̅ )2 + ∑𝑛𝑗=1
2
(𝑦𝑗 − 𝑦̅) }
1 +𝑛2 −2
Problems
1. Find the student’s t for the following variable values in a sample of eight:
-4, -2, -2, 0, 2, 2, 3, 3 taking the mean of the universe to be zero.
x̅−μ
Sol: Wkt t = s
√n
1
𝑠2 = {(−4.25)2 + (−2.25)2 + (−2.25)2 + (−0.25)2 + (1.75)2 + (1.75)2 + (2.75)2 + (2.75)2 }
(8 − 1)
1
𝑠2 = (49.5)
(7)
𝑠 2 = 7.07
∴ 𝒔 = 𝟐. 𝟔𝟔
0.25−0
Thus, 𝑡 = √8
2.66
𝒕 = 𝟎. 𝟐𝟔𝟔
2. A machine is expected to produce nails of length 3 inches. A random sample of 25 nails
gave an average length of 3.1 inch with standard deviation 0.3. Can it be said that the
machine is producing nails as per specification ? (𝑡0.05 𝑓𝑜𝑟 24 𝑑. 𝑓 𝑖𝑠 2.064)
Sol: By data , we have 𝜇 = 3 , 𝑥̅ = 3.1, 𝑛 = 25, 𝑠 = 0.3
68
𝑥̅ −𝜇
𝑡= √𝑛
𝑠
3.1−3
𝑡= √25
0.3
𝒕 = 𝟏. 𝟔𝟕 < 𝟐. 𝟎𝟔𝟒
Thus, the hypothesis that the machine is producing nails as per specification is accepted at
5% level of significance.
3. Ten individuals are chosen at random from a population and their heights in inches are
found to be 63, 63, 66, 67, 68, 69, 70, 70, 71, 71. Test the hypothesis that the mean height
of the universe is 66 inches.(𝑡0.05 = 2.262 𝑓𝑜𝑟 9 𝑑. 𝑓).
Sol: We have 𝜇 = 66, 𝑛 = 10
∑𝑥
𝑥̅ =
𝑛
678
𝑥̅ =
10
̅ = 𝟔𝟕. 𝟖
𝒙
𝑛
2
1
𝑠 = ∑(𝑥𝑖 − 𝑥̅ )2
(𝑛 − 1)
𝑖=1
1
𝑠 2 = [(63 − 67.8)2 + (63 − 67.8)2 + ⋯ + (71 − 67.8)2 ]
9
𝑠 2 = 9.067
𝒔 = 𝟑. 𝟎𝟏𝟏
𝑥̅ −𝜇
Wkt 𝑡 = √𝑛
𝑠
(67.8−66)
𝑡= √10
3.011
𝒕 = 𝟏. 𝟖𝟗 < 𝟐. 𝟐𝟔𝟐
Thus the hypothesis is accepted at 5% level of significance.
4. A sample of 10 measurements of the diameter of a sphere gave a mean of 12 cm and a
standard deviation 0.15 cm. Find the 95% confidence limits for the actual diameter.
Sol: By data, 𝑛 = 10 , 𝑥̅ = 12, 𝑠 = 0.15
Also 𝑡0.05 = 2.262 𝑓𝑜𝑟 9 𝑑. 𝑓
𝒔
̅ ± [ ] 𝒕𝟎.𝟎𝟓
Confidence limits for the actual diameter is given by 𝒙 𝒏 √
69
𝑠 0.15
𝑥̅ ± [ ] 𝑡0.05 = 12 + [ ] (2.262) = 12 ± 0.1073
√𝑛 √10
Thus 11.893 cm to 12.107 cm is the confidence limits for the actual diameter.
5. A certain stimulus administered to each of the 12 patients resulted in the following
change in blood pressure 5, 2, 8, -1,3,0, 6, -2, 1, 5, 0, 4. Can it be concluded that the
stimulus will increase the blood pressure ? (𝑡0.05 𝑓𝑜𝑟 11 𝑑. 𝑓 𝑖𝑠 2.201)
∑𝑥
Sol: 𝑥̅ = 𝑛
31
𝑥̅ = 12
̅ = 𝟐. 𝟓𝟖𝟑𝟑
𝒙
𝑛
1
𝑠2 = ∑(𝑥𝑖 − 𝑥̅ )2
(𝑛 − 1)
𝑖=1
1
𝑠2 = [(5 − 2.58)2 + (2 − 2.58)2 + ⋯ + (4 − 2.58)2 ]
11
𝑠 2 = 9.538
𝒔 = 𝟑. 𝟎𝟖𝟖
𝑥̅ −𝜇
Wkt 𝑡 = √𝑛
𝑠
Let us suppose that the stimulus administration is not accompanied with increase in blood
pressure, we can take 𝜇 = 0.
(2.5833−0)
𝑡= √12
3.088
Boys Girls
Mean 74 70
SD 8 10
n 12 10
70
Is the difference between the means of the two groups significant at 5% level of significance
(𝑡0.05 𝑓𝑜𝑟 20 𝑑. 𝑓 𝑖𝑠 2.086).
Sol: By data , 𝑥̅ = 74, 𝑠1 = 8, 𝑛1 = 12 [Boys]
𝑦̅ = 70, 𝑠2 = 10, 𝑛2 = 10 [Girls]
𝑥̅ −𝑦̅
We have 𝑡 = 1 1
𝑠.√ +
𝑛1 𝑛2
1 𝑛1 2
Where, 𝑠 2 = 𝑛 {∑𝑖=1(𝑥𝑖 − 𝑥̅ )2 + ∑𝑛𝑗=1
2
(𝑦𝑗 − 𝑦̅) }
1 +𝑛2 −2
12(64)+10(100)
Now 𝑠 2 = 12+10−2
1768
𝑠2 = 20
𝑠 2 = 88.4
𝑠 = 9.402
74−70
Hence 𝑡 = 1 1
(9.402)√ +
12 10
𝑠 0.61
𝑥̅ ± [ 𝑛] 𝑡0.05 = 3.92 ± [ ] (2.228)
√ √11
= 3.92 ± 0.41
= 3.51 𝑎𝑛𝑑 4.33
Thus 95% confidence limits for 𝜇 are 3.51 𝑎𝑛𝑑 4.33.
8. Two types of batteries are tested for their length of life and the following results were
obtained.
71
Battery A : 𝑛1 = 10, 𝑥̅1 = 500 ℎ𝑟𝑠, 𝑠12 = 100
Battery B : 𝑛2 = 10, 𝑥̅2 = 500 ℎ𝑟𝑠, 𝑠12 = 121
Compute student’s t and test whether there is a significant difference in the two means.
Sol: By data Battery A : 𝑛1 = 10, 𝑥̅1 = 500 ℎ𝑟𝑠, 𝑠12 = 100
Battery B : 𝑛2 = 10, 𝑥̅2 = 560 ℎ𝑟𝑠, 𝑠12 = 121
𝑛1 𝑠12 +𝑛2 𝑠12
Now, 𝑠 2 = 𝑛1 +𝑛2 −2
(10∗100) +(10∗121)
𝑠2 = 10+10−2
𝑠 2 = 122.78
𝒔 = 𝟏𝟏. 𝟎𝟖𝟎𝟔
𝑥̅ 2 −𝑥̅ 1
We have 𝑡 = 1 1
𝑠.√ +
𝑛1 𝑛2
60
𝑡= 1 1
(11.0806)√ +
10 10
9. A group of 10 boys fed on a diet A and another group of 8 boys fed on a different diet B
for a period of 6 months recorded the following increase in weight (lbs.)
Diet A : 5 6 8 1 12 4 3 9 6 10
Diet B : 2 3 6 8 10 1 2 8
Test whether diets A and B differ significantly regarding their effect on increase in weight.
Sol: Let the variable 𝑥 correspond to the diet A and 𝑦 to the diet B.
∑𝑥 ∑𝑦
𝑥̅ = 𝑦̅ =
𝑛1 𝑛2
64 40
𝑥̅ = 10 𝑦̅ = 8
̅ = 𝟔. 𝟒
𝒙 ̅=𝟓
𝒚
2
∑𝑛𝑖=1
1
(𝑥𝑖 − 𝑥̅ )2 = 102.4 ; ∑𝑛𝑗=1
2
(𝑦𝑗 − 𝑦̅) = 82
72
𝑛1 𝑛2
1 2
𝑠2 = {∑(𝑥𝑖 − 𝑥̅ )2 + ∑(𝑦𝑗 − 𝑦̅) }
𝑛1 + 𝑛2 − 2
𝑖=1 𝑗=1
1
𝑠2 = {102.4 + 82}
16
184.4
𝑠2 = = 11.525
16
𝒔 = 𝟑. 𝟑𝟗𝟓
𝑥̅ −𝑦̅
We have 𝑡 = 1 1
𝑠.√ +
𝑛1 𝑛2
1.4
𝑡=
1 1
(3.395)√10 + 8
2
∑𝑛𝑖=1
1
(𝑥𝑖 − 𝑥̅ )2 = 31.43 ∑𝑛𝑗=1
2
(𝑦𝑗 − 𝑦̅) = 26.84
1 𝑛1 2
𝑠2 = 𝑛 {∑𝑖=1(𝑥𝑖 − 𝑥̅ )2 + ∑𝑛𝑗=1
2
(𝑦𝑗 − 𝑦̅) }
1 +𝑛2 −2
1
𝑠2 = (31.43 + 26.84)
11
𝑠 2 = 5.2973
∴ 𝒔 = 𝟐. 𝟑𝟎𝟏𝟔
𝑥̅ − 𝑦̅
We have 𝑡 = 1 1
𝑠.√ +
𝑛1 𝑛2
31.3−28.2
𝑡= 1 1
(2.3016)√ +
7 6
73
> 𝑡0.05 = 2.2
𝑡 = 2.42 = {
< 𝑡0.02 = 2.72
The discrimination between the horses is significant at 5% level but not at 2% level of
significance.
Chi-Square distribution
Chi-square distribution provides a measure of correspondence between the theoretical
frequencies and observed frequencies.
If 𝑂𝑖 (𝑖 = 1,2, … , 𝑛) and 𝐸𝑖 (𝑖 = 1,2, … , 𝑛) respectively denotes a set of observed and
estimated frequencies, the quantity chi-square denoted by 𝜒 2 is defined as follows.
(𝑶𝒊 −𝑬𝒊 )𝟐
𝝌𝟐 = ∑𝒏𝒊=𝟏 ; degrees of freedom = 𝑛 − 1
𝑬𝒊
Note: If the expected frequencies are less than 10, we group them suitably for computing
the value of chi. Square.
Chi-square test as a test of goodness of fit
It is possible to test the hypothesis about the association of two attributes. We already
discussed the fitting of Binomial distribution, Normal distribution, Poisson distribution to a
given data. It is easily possible to find the theoretical frequencies from the distribution of fit.
Chi-square test helps us to test the goodness of fit of these distributions.
If the calculated value of 𝝌𝟐 is less than the value of 𝝌𝟐 at a specified level of significance,
the hypothesis is accepted. Otherwise the hypothesis is rejected.
Problems
1.A dice is thrown 264 times and the number appearing on the face (𝒙) follows the following
frequency distribution.
𝒙 1 2 3 4 5 6
𝑓 40 32 28 58 54 60
74
Now the data is as follows:
(𝑶𝒊 − 𝑬𝒊 )𝟐
𝒏
𝝌𝟐 = ∑ 𝑬𝒊
𝒊=𝟏
2
(40 − 44)2 (32 − 44)2 (60 − 44)2
𝜒 = + + ⋯+
44 44 44
1
𝜒2 = [16 + 144 + 256 + 196 + 100 + 256]
44
968
𝜒2 =
44
Thus, 𝝌𝟐 = 𝟐𝟐
2. Five dice were thrown 96 times and the numbers 1, 2 or 3 appearing on the face of the
dice follows the frequency distribution as below.
Frequency 7 19 35 24 8 3
Test the hypothesis that the data follows a binomial distribution. (𝝌20.05 = 11.07 𝑓𝑜𝑟 5 𝑑. 𝑓)
Sol: The data gives the observed frequencies and we need to calculate the expected
frequencies.
3 1
Probability of a single dice throwing 1, 2 or 3 is 𝑝 = 6 = 2
1
∴𝑞 =1−𝑝 =2
75
We have the table of observed and expected frequencies.
𝑂𝑖 7 19 35 24 8 3
𝐸𝑖 3 15 30 30 15 3
(𝑶𝒊 − 𝑬𝒊 )𝟐
𝒏
𝝌𝟐 = ∑ 𝑬𝒊
𝒊=𝟏
16 16 25 36 49 0
𝜒2 = + + + + +
3 15 30 30 15 3
𝜒 2 = 11.7 > 𝝌20.05 = 11.07
Thus the hypothesis that the data follows a binomial distribution is rejected.
3. A sample analysis of examination results of 500 students was made. It was found that 220
students had failed, 170 had secured third class 90 had secured second class and 20 had
secured first class. Do these figures support the general examination result which is in the
ratio 4:3:2:1 for the respective categories (𝝌20.05 = 7.81 𝑓𝑜𝑟 3 𝑑. 𝑓).
Sol: Let us take the hypothesis that these figures support to the general result in the ratio
4:3:2:1
The expected frequencies in the respective category are
4 3 2 1
∗ 500, 10 ∗ 500, 10 ∗ 500, 10 ∗ 500 or 200, 150, 100, 50
10
𝑂𝑖 220 170 90 20
(𝑶𝒊 − 𝑬𝒊 )𝟐
𝒏
𝟐
𝝌 =∑ 𝑬𝒊
𝒊=𝟏
76
4. 4 coins are tossed 100 times and the following results were obtained. Fit a binomial
distribution for the data and calculate the theoretical frequencies(𝝌20.05 = 9.49 𝑓𝑜𝑟 4 𝑑. 𝑓).
Number 0 1 2 3 4
of heads
Frequency 5 29 36 25 5
Sol: Let 𝑥 denote the number of heads and 𝑓 the corresponding frequency. Since the data is
in the form of a frequency distribution we shall first calculate the mean.
∑ 𝑓𝑥 (0)(5)+(1)(29)+(2)(36)+(3)(25)+(4)(5) 196
Mean, 𝜇 = ∑𝑓
= = 100 = 1.96
5+29+36+25+5
𝑂𝑖 5 29 36 25 5
𝐸𝑖 7 26 37 24 6
(𝑶𝒊 − 𝑬𝒊 )𝟐
𝒏
𝝌𝟐 = ∑ 𝑬𝒊
𝒊=𝟏
77
4 9 1 1 1
𝜒2 = + + + +
7 26 37 24 6
2
𝜒 2 = 1.15 < 𝜒0.05 = 9.49
Thus the hypothesis that the fitness is good can be accepted.
5. Fit a poisson distribution for the following data and test the goodness of fit given that
𝝌20.05 = 7.815 𝑓𝑜𝑟 3 𝑑. 𝑓
𝑥 0 1 2 3 4
𝑓 122 60 15 2 1
Sol: Let 𝑥 denote the number of heads and 𝑓 the corresponding frequency. Since the data is
in the form of a frequency distribution we shall first calculate the mean.
∑ 𝑓𝑥 (0)(122)+(1)(60)+(2)(15)+(3)(2)+(4)(1) 100
Mean, 𝜇 = ∑𝑓
= = 200 = 0.2
122+60+15+2+1
(0.5)𝑥 𝑒 −(0.5)
𝐹(𝑥) = 200 ∗ But 𝑒 −(0.5) = 0.6065
𝑥!
(0.5)𝑥
𝐹(𝑥) = 121.3
𝑥!
𝑂𝑖 122 60 15 2+1=3
𝐸𝑖 121 61 15 3+0=3
(𝑶𝒊 − 𝑬𝒊 )𝟐
𝒏
𝟐
𝝌 =∑ 𝑬𝒊
𝒊=𝟏
1 1
𝜒 2 = 121 + 61 + 0 + 0
78
2
𝜒 2 = 0.025 < 𝜒0.05 = 7.815
Thus the hypothesis that the fitness is good can be accepted.
6. The number of accidents per day (𝑥) as recorded in a textile industry over a period of 400
days is given below. Test the goodness of fit in respect of Poisson distribution of fit to
the given data(𝝌20.05 = 9.49 𝑓𝑜𝑟 4 𝑑. 𝑓).
x 0 1 2 3 4 5
f 173 168 37 18 3 1
Sol: Let 𝑥 denote the number of heads and 𝑓 the corresponding frequency. Since the data is
in the form of a frequency distribution we shall first calculate the mean.
∑ 𝑓𝑥 (0)(173)+(1)(168)+(2)(37)+(3)(18)+(4)(3)+(5)(1) 313
Mean, 𝜇 = ∑𝑓
= = 400 = 0.7825
173+168+37+18+3+1
(0.7825)𝑥 𝑒 −(0.7825)
𝐹(𝑥) = 400 ∗ But 𝑒 −(0.7825) = 0.4573
𝑥!
(0.7825)𝑥
𝐹(𝑥) = 182.92 𝑥!
𝐹(0) = 183 , 𝐹(1) = 143, 𝐹(2) = 56, 𝐹(3) = 15, 𝐹(4) = 3, 𝐹(5) = 0
Thus the required theoretical frequencies are 183,143,56,15,3,0.
Since the last of the expected frequency is 0 we shall club it with the previous one.
We have the following table
(𝑶𝒊 − 𝑬𝒊 )𝟐
𝒏
𝟐
𝝌 =∑ 𝑬𝒊
𝒊=𝟏
79
𝜒 2 = 12.297 > 𝝌20.05 = 9.49
80
MODULE-5
DESIGN OF EXPERIMENTS &ANOVA
ANALYSIS OF VARIANCE
The analysis of variance is a powerful statistical tool for tests of significance.
WORKING PROCEDURE:
Null Hypothesis 𝐻0 : 𝑇ℎ𝑒𝑟𝑒 𝑖𝑠 𝑛𝑜 𝑠𝑖𝑔𝑛𝑖𝑓𝑖𝑐𝑎𝑛𝑐𝑒 𝑑𝑖𝑓𝑓𝑒𝑟𝑒𝑛𝑐𝑒 𝑏𝑒𝑡𝑤𝑒𝑒𝑛 … … …
𝑇 = ∑ 𝑋1 + ∑ 𝑋2 + ∑ 𝑋3 + ⋯
𝑇2
Correction factor: 𝐶. 𝐹 = 𝑁
PROBLEM 1:
The following figures relate to production in kgs. of three variables A, B, C of wheat sown on 12 plots
A 14 16 18
B 14 13 15 22
C 18 16 19 19 22
SOLUTION:
Null Hypothesis:
22 19 484 361
20 400
∑𝑋1 = 48
STEP1: To find the following
𝑁 = 𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛𝑠 = 12
𝑇 = ∑ 𝑋1 + ∑ 𝑋2 + ∑ 𝑋3 = 48 + 64 + 92 = 204
𝑇2 (204)2
Correction factor: 𝐶. 𝐹 = 𝑁
=
12
= 3468
16.8 8.4
= =
2 7.47
= 𝟖. 𝟒 = 1.13
TOTAL
RESULT:
WORKING PROCEDURE:
𝑁 = 𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛𝑠
𝑇 = ∑ 𝑋1 + ∑ 𝑋2 + ∑ 𝑋3 + ⋯
𝑇2
Correction factor: 𝐶. 𝐹 =
𝑁
(∑ 𝑌1 )2 (∑ 𝑌2 )2
𝑆𝑆𝐶 = 𝑛1
+ 𝑛2
+ ⋯ − 𝐶. 𝐹
Four doctors each test four treatments for a certain disease and observe the number of days each
patient takes to recover. The results are as follows (recovery time in days)
_____________________________________________________________________________________
Treatment
Doctor T1 T2 T3 T4
D1 10 14 19 20
D2 11 15 17 21
D3 9 12 16 19
D4 8 13 17 20
SOLUTION:
Null Hypothesis: 𝑯0 ∶
𝑇ℎ𝑒𝑟𝑒 𝑖𝑠 𝑛𝑜 𝑠𝑖𝑔𝑛𝑖𝑓𝑖𝑐𝑎𝑛𝑐𝑒 𝑑𝑖𝑓𝑓𝑒𝑟𝑒𝑛𝑐𝑒 𝑏𝑒𝑡𝑤𝑒𝑒𝑛 𝑖𝑛 𝑡ℎ𝑒 𝑝𝑒𝑟𝑓𝑜𝑚𝑒𝑛𝑐𝑒 𝑜𝑓 𝑑𝑜𝑐𝑡𝑜𝑟𝑠 𝑎𝑛𝑑 𝑡𝑟𝑒𝑎𝑡𝑚𝑒𝑛𝑡𝑠.
Alternative Hypothesis:
𝑯 𝟏 : 𝑇ℎ𝑒𝑟𝑒 𝑖𝑠 𝑠𝑖𝑔𝑛𝑖𝑓𝑖𝑐𝑎𝑛𝑐𝑒 𝑑𝑖𝑓𝑓𝑒𝑟𝑒𝑛𝑐𝑒 𝑏𝑒𝑡𝑤𝑒𝑒𝑛 𝑖𝑛 𝑡ℎ𝑒 𝑖𝑛 𝑡ℎ𝑒 𝑝𝑒𝑟𝑓𝑜𝑚𝑒𝑛𝑐𝑒 𝑜𝑓 𝑑𝑜𝑐𝑡𝑜𝑟𝑠 𝑎𝑛𝑑 𝑡𝑟𝑒𝑎𝑡𝑚𝑒𝑛𝑡𝑠
USE CODE: 15
−5 −1 4 5 ∑𝑌1 = 3 25 1 16 25
−4 0 2 6 ∑𝑌2 = 4 16 0 4 36
−6 −3 1 4 ∑𝑌3 36 9 1 16
= −4
−7 −2 2 5 ∑𝑌4 49 4 4 25
= −2
𝑁 = 𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛𝑠 = 16
𝑇 = ∑ 𝑋1 + ∑ 𝑋2 + ∑ 𝑋3 + ∑ 𝑋4
= −22 − 6 + 9 + 20 = 1
𝑇2 ( 1) 2
Correction factor: 𝐶. 𝐹 = 𝑁
=
16
= 𝟎. 𝟎𝟔𝟐𝟓
(∑ 𝑌1 )2 (∑ 𝑌2 )2 (∑ 𝑌3 )2 (∑ 𝑌4 )2
𝑆𝑆𝑅 = + + + – 𝐶. 𝐹
𝑛1 𝑛2 𝑛3 𝑛4
(3) 2 (4)2 (−4)2 (−2)2
= + + + − 0.0625
4 4 4 4
= 11.19
=
11.19 3.73
3 =
= 3.73 0.62
= 𝟔. 𝟎𝟐
ERROR SSE = 5.56 (C – 1) X(R – 1) = 9 𝑆𝑆𝐸
𝑀𝑆𝐸 =
𝑁−𝐶
5.56
=
9
= 0.62
TOTAL
RESULT:
Hence the difference between doctors is significant and that between the treatments is highly
significant.
LATIN SQUARE DESIGN:
WORKING PROCEDURE:
𝑁 = 𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛𝑠
𝑇 = ∑ 𝑋1 + ∑ 𝑋2 + ∑ 𝑋3 + ⋯
𝑇2
Correction factor: 𝐶. 𝐹 =
𝑁
(∑ 𝑋1 )2 (∑ 𝑋2 )2
𝑆𝑆𝐶 = + + ⋯ − 𝐶. 𝐹
𝑛1 𝑛2
(∑ 𝑌1 )2 (∑ 𝑌2 )2
𝑆𝑆𝑅 = 𝑛1
+ 𝑛2
+ ⋯ − 𝐶. 𝐹
(∑ 𝑍1 )2 (∑ 𝑍2 )2
𝑆𝑆𝑇 = + + ⋯ − 𝐶. 𝐹
𝑛1 𝑛2
TOTAL
PROBLEM 3:
Analysis the variance in the following Latin square of yields (in kgs ) of wheat where A, B,C,D denote
different methods of cultivation
Examine whether the different methods of cultivation have given significantly different yields.
SOLUTION:
Null Hypothesis: 𝐻0 ∶
𝑇ℎ𝑒𝑟𝑒 𝑖𝑠 𝑛𝑜 𝑠𝑖𝑔𝑛𝑖𝑓𝑖𝑐𝑎𝑛𝑐𝑒 𝑑𝑖𝑓𝑓𝑒𝑟𝑒𝑛𝑐𝑒 𝑏𝑒𝑡𝑤𝑒𝑒𝑛 𝑖𝑛 𝑡ℎ𝑒 𝐿𝑒𝑡𝑡𝑒𝑟𝑠 𝑎𝑛𝑑 𝑡ℎ𝑒 𝑑𝑖𝑓𝑓𝑒𝑟𝑒𝑛𝑡 𝑚𝑒𝑡ℎ𝑜𝑑 𝑜𝑓 𝑐𝑢𝑙𝑡𝑖𝑣𝑎𝑡𝑖𝑜𝑛.
Alternative Hypothesis:
𝐻 1 : 𝑇ℎ𝑒𝑟𝑒 𝑖𝑠 𝑠𝑖𝑔𝑛𝑖𝑓𝑖𝑐𝑎𝑛𝑐𝑒 𝑑𝑖𝑓𝑓𝑒𝑟𝑒𝑛𝑐𝑒 𝑏𝑒𝑡𝑤𝑒𝑒𝑛 𝑖𝑛 𝑡ℎ𝑒 𝑖𝑛 𝑡ℎ𝑒 𝐿𝑒𝑡𝑡𝑒𝑟𝑠 𝑎𝑛𝑑 𝑡ℎ𝑒 𝑑𝑖𝑓𝑓𝑒𝑟𝑒𝑛𝑡 𝑚𝑒𝑡ℎ𝑜𝑑 𝑜𝑓 𝑐𝑢𝑙𝑡𝑖𝑣𝑎𝑡𝑖𝑜𝑛.
USE CODE: 120
2 1 3 2 ∑𝑌1 = 8 4 1 9 4
4 3 2 5 ∑𝑌2 16 4 4 25
= 14
0 -1 0 1 ∑𝑌3 = 0 0 1 0 1
2 3 1 2 ∑𝑌4 = 8 4 9 1 4
A B C D
1 2 3 2
2 4 3 5
0 -1 1 0
2 1 2 3
𝑁 = 𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛𝑠 = 16
𝑇 = ∑ 𝑋1 + ∑ 𝑋2 + ∑ 𝑋3 + ∑ 𝑋4 = 8 + 6 + 6 + 10 = 30
𝑇2 (30)2
Correction factor: 𝐶. 𝐹 = 𝑁
=
16
= 56.25
= 24 + 15 + 14 + 34 − 56.25
= 35.75
(∑ 𝑋1 )2 (∑ 𝑋2 )2 (∑ 𝑋3 )2 (∑ 𝑋4 )2
𝑆𝑆𝐶 = 𝑛1
+ 𝑛2
+ 𝑛3
+ 𝑛4
– 𝐶. 𝐹
(∑ 𝑌1 )2 (∑ 𝑌2 )2 (∑ 𝑌3 )2 (∑ 𝑌4 )2
𝑆𝑆𝑅 = + + + – 𝐶. 𝐹
𝑛1 𝑛2 𝑛3 𝑛4
(8) 2 (14)2 (0)2 (8)2
= + + + − 56.25
4 4 4 4
= 24.75
(∑ 𝑍1 )2 (∑ 𝑍2 )2 (∑ 𝑍3 )2 (∑ 𝑍4 )2
𝑆𝑆𝑇 = + + + – 𝐶. 𝐹
𝑛1 𝑛2 𝑛3 𝑛4
(5) 2 (6)2 (9)2 (10)2
= + + + − 56.25
4 4 4 4
= 4.25
𝑆𝑆𝐸 = 𝑇𝑆𝑆 − 𝑆𝑆𝐶 − 𝑆𝑆𝑅 − 𝑆𝑆𝑇
= 35.75 − 2.75 − 24.75 − 4.25
= 4.0
STEP 3: ANOVA TABLE
= 𝟔. 𝟎𝟐
BETWEEN SST = 4.25 K–1=4–1=3 𝑆𝑆𝑇 𝑀𝑆𝑇
𝑀𝑆𝑇 = 𝐹𝑇 =
TREATMENTS 𝐾−1 𝑀𝑆𝐸
4.25
= 3
1.42
= 1.42 =
0.67
= 𝟐. 𝟏𝟏𝟗𝟒
ERROR SSE = 4.0 (K – 1) X(K – 2) = 6 𝑆𝑆𝐸
𝑀𝑆𝐸 =
𝐷. 𝐹
4.0
=
6
= 0.67
TOTAL
RESULT:
Since with respect to letters, the difference between the method of cultivation is not significant.
.
THANK YOU