The Mathematics of Volleyball
The Mathematics of Volleyball
the game matter? It seemed like it didn't matter much, but did
it?
I decided to analyze the game mathematically. I made the About the site
simplifying assumption that each team had 50-50 odds of
Contact info and site index
winning each point. I found the analysis interesting, and it turns
out to have close ties to Pascal's Triangle, so I'm posting it
here in case anyone else is interested. Popular Posts
Volleyball games are scored using the rally point system, Reverse-
engineering
which means that one team gets a point on every serve. (Back
Ethernet
in the olden days, volleyball used side-out scoring, which backoff on
meant that only the serving team could get a point. Fortunately, the Intel
rally point scoring is more mathematically tractable. Rally point 82586 network chip's die
More interesting is if the score is 24-22, half the time the first Search This Blog
team will score a point and win, and half the time the second
Search
team will score, making the score 24-23. We know from above
that the first team has a 75% chance of winning from 24-23, so
Labels
P(24, 22) = 1/2 * 100% + 1/2 * 75% = 87.5%.
386 6502 8008 8085 8086
We can use the recurrence to work backwards and find the
8087 aerospacealto analog
probability of winning from any score. The following table Apollo apple arc arduino
shows the probability of winning for each score. The first team arm beaglebone bitcoin c#
has the score on the left, and the second team has the score cadc calculator chips css
on the top. datapoint dx7
Note that each score is the average of the score below and the
score to the right - these are the cases where the first team
gets the point and the second team gets the point. This
corresponds directly to the equation above.
The first two arguments are the current score, and the last
argument is the amount to win (25 in this case). For instance:
Mathematical analysis
Instead of computing the probabilities through dynamic
programming, it is possible to come up with a mathematical
solution. After studying the values for a while, I realized rather
surprisingly that the probabilities are closely tied to Pascal's
Triangle. You may be familiar with Pascal's Triangle, where
each element is the sum of the two elements above it (with 1's
along the edges), forming a table of binomial coefficients:
Pascal's Triangle
This can proved straighforwardly from the equation for P(x, y).
For example, if the score is 18-12, the importance is C(7+13-2,
6) / 2^(7+13-2) = 18564 / 262144 = 7.08%.
Conclusions
How useful is this model? Well, it depends on the assumption
that each team has an equal chance of winning each point. Of
course, most teams are not evenly matched. Even more
important is the fact that if a team has a good server, they can
quickly rack up 10 points in a row, which throws the model out
the window.
Python version
P.S. The code above is in Arc, an obscure language. Here's a
version of the code in Python that will be more useful:
14 comments:
human mathematics said...
Very cool! This definitely makes me more interested in arc
and in functional languages generally. I didn't know you
could solve dynamic programming problems so easily in a
functional language.
Anonymous said...
all high school coaches need to read this.
April 14, 2013 at 6:28 AM
Unknown said...
so how could you mathematically included factors such as
a mismatched team into the calculations?
this is an extremely interesting topic and i hope that you
delve more into it
June 2, 2013 at 2:32 PM
Tim Roberts said...
Wonderful article--fascinating. One question: On the
importance of next point chart--when the team on the Left
is at 25 and the team on the Top is at 23, the chart shows
25%. Shouldn't it be 0% as the team on the Left would
have already won the match?
September 24, 2015 at 12:19 PM
Unknown said...
This comment has been removed by the author.
September 9, 2016 at 8:47 AM
Ghyeb said...
Hi Mr.Shirriff, I was looking over your mathematics of
volleyball pascal's triangle and I was confused as to what
your (n) and (i) values meant in your formula following the
triangle? Would you mind explaining the significance of
them? Thanks!
September 18, 2016 at 7:41 PM
Anonymous said...
Hi! I love your ideas! one question: where did you get the
formula's from? I could not find the formula that you used
at the beginning.
October 12, 2016 at 4:52 AM
Unknown said...
Hello. I came across this after I had essentially derived its
equivalent using a cumulative binomial probability
model(Pascal's Triangle link you mention). I decided to
allow for a future point win rate assumption different than
50%, if the reader desired this to be variable. The fact that
serving win rates and receiving win rates are typically
differing by nearly 25% throws off the model in terms of
game flow (average rally lengths are less than a
cumulative binomial model would suggest, closer to 1.6
than 2.0, I believe),but regarding the end state probability
to win the game it remains very useful. Below is a link to a
newsletter I am working on and it has links to
Googlesheets which help inform the ideas within. I am
apparently a little late to the discussion, but still found
your post interesting!
https://drive.google.com/file/d/1RAc34dA6QUUbHh2-
6IOZnK3Xz-rWHOpw/view?usp=sharing
beejels said...
The $64,000 question is this...how does knowing this help
us earn a point? The concept is interesting of course. The
big assumption necessary to do any of this work is that
the teams are evenly matched, which they never are, but
that's why it's an assumption. What can we say or do in
the match to make this useful?
September 6, 2018 at 11:41 AM
Unknown said...
Hello Sir, really like your work! I would like to know how
did you come up with both the numerator and the
denominators for these two equations:
(a+b-2
a-1 ) / 2^ a+b-2