Introduction to Discrete Probability Distributions (Business)

What is a Discrete Probability distribution?

The probability distribution of a discrete random variable $X$ is the set $S$ of all the possible outcomes $X$ can take and corresponding probabilities $\mathrm{P}(x)$ (or equivalently $\mathrm{P}(X=x)$) for each $x \in S$ (each value of $x$ in the set $S$). Remember, for a discrete probability distribution to be valid, the probabilities must sum to one.

Illustrative Example 1

Suppose that exactly 4 loan applications are reviewed each day by a small bank branch. Suppose further that these loans are accepted independently of each other so the probability that one of the four loans reviewed in a given day is accepted does not affect the probability that one of the other four loans is accepted. We have the following probabilities:

  • No applications are accepted with probability $0.2$,
  • $1$ application is accepted with probability $0.25$
  • $2$ applications are accepted with probability $0.3$,
  • $3$ applications are accepted with probability $0.15$ and
  • $4$ are accepted with probability $0.1$.

This is an example of a discrete probability distribution and is valid because the probabilities sum to 1 ($0.2+0.25+0.3+0.15+0.1=1$). We can present this probability distribution as a table:

|center

|center

Cumulative Probabilities

If $X$ is a random variable then the cumulative probability ($\mathrm{P}(X \leq 4)$ for example) is the probability that $X$ takes any value less than or equal to 4.

If $X$ is a discrete random variable taking only positive values, then we can obtain this cumulative probability by summing each of the probabilities $\mathrm{P}(X=j)$ for $j=0, 1, 2, 3.$. That is,

\[\mathrm{P}(X \leq 4)=\mathrm{P}(X=0)+\mathrm{P}(X=1)+\mathrm{P}(X=2)+\mathrm{P}(X=3).\] Note: Since the sum of all of the probabilities must be 1, we have for any value $j$:

\[\mathrm{P}(X \leq j)=1-\mathrm{P}(X \gt j).\]

Thus if $\mathrm{P}(X \leq j)$ is more straightforward to calculate than $\mathrm{P}(X \gt j)$ (or vice versa), we can use the above fact to obtain one from the other. We will see examples of this below.

The Expectation and Variance

The expectation $\mathrm{E}[X]$ of a discrete random variable $X$ is the sum of each of the possible outcomes multiplied by their associated probabilities.

\begin{equation} \mathrm{E}[X] = \sum\limits_{x \in S}{x \times \mathrm{P}(x)}. \end{equation}

The variance $\mathrm{Var}(X)$ of a discrete random variable $X$ is defined as:

\begin{equation} \mathrm{Var}[X] = \mathrm{E}[(X- \mathrm{E}[X])^2 ]\text{.} \end{equation}

However it can be simplified to this expression:

\begin{equation} \mathrm{Var}(X) = \mathrm{E}[X^2] - (\mathrm{E}[X])^2\text{.} \end{equation}

where $\mathrm{E}[X^2]$ is the sum of each outcome squared and multiplied by its corresponding probability:

\[\sum\limits_{x \in S}{x^2 \times \mathrm{P}(x)}\]

and $\mathrm{E}[X]^2$ is the mean, $\mathrm{E}[X]$, squared:

\[\left(\sum\limits_{x \in S}{x \times \mathrm{P}(x)}\right)^2\].

Note: We also denote $\mathrm{E}[X]$ as $\mu$.

Worked Example 1

Worked Example 1

In a game, there is a spinner with twelve equally sized segments. There are four green segments, two blue segments, one purple segment, one white segment and four black segments. If a player spins:

  • green, they must move forward $2$ spaces;
  • blue, they must move forward $1$ space;
  • purple, they must move forward $3$ spaces;
  • white, they move back $1$ space and
  • black, they do not move.

(A) What is the expected number of spaces a player will move in a turn?

(B) What is the variance of the number of spaces moved?

Solution

A) Firstly, we can think of each colour as a number: a positive number represents moving forwards in the game and a negative number represents moving backwards.

|center

|center

Now we need to calculate the associated probabilities for these values.

Let $X$ denote the number of spaces moved.

\begin{align} \mathrm{P}(X = 2) &= \dfrac{\text{Number of Green Segments}~}{\text{Total Number of Segments}~}\\ &=\dfrac{4}{12}\\ &=\dfrac{1}{3}\\ \end{align}

\begin{align} \mathrm{P}(X = 1) &= \dfrac{\text{Number of Blue Segments}~}{\text{Total Number of Segments}~}\\ &=\dfrac{2}{12}\\ &=\dfrac{1}{6}\\ \end{align}

\begin{align} \mathrm{P}(X = 3) &= \dfrac{\text{Number of Purple Segments}~}{\text{Total Number of Segments}~}\\ &=\dfrac{1}{12}\\ \end{align}

\begin{align} \mathrm{P}(X = -1) &= \dfrac{\text{Number of White Segments}~}{\text{Total Number of Segments}~}\\ &=\dfrac{1}{12}\\ \end{align}

\begin{align} \mathrm{P}(X = 0) &= \dfrac{\text{Number of Black Segments}~}{\text{Total Number of Segments}~}\\ &=\dfrac{4}{12}\\ &=\dfrac{1}{3}\\ \end{align}

Now we can put these values into a table.

|center

|center

Now we can see that we have a discrete probability distributions and the distribution is valid as the probabilities sum to one.

To calculate the expectation we use the above formula, $\mathrm{E}[X] = \sum\limits_{x \in S}{x \times P(X = x)}$ as follows:

\begin{align} \mathrm{E}[X] &= 2 \times \dfrac{1}{3} + 1 \times \dfrac{1}{6} + 3 \times \dfrac{1}{12} + (-1) \times \dfrac{1}{12} + 0 \times \dfrac{1}{3}\\ &= \dfrac{2}{3} + \dfrac{1}{6} + \dfrac{1}{4} - \dfrac{1}{12} + 0\\ &= 1\\ \end{align}

So the expected number of spaces moved in one turn is $\mathrm{E}[X]=1$.

B)

To calculate the variance we use the formula: $\mathrm{Var}(X) = \mathrm{E}[X^2] - (\mathrm{E}[X])^2$.

First we need to square each value of $x$ and multiply by the associated probabilities.

|center

|center

Thus: \begin{align} \mathrm{E}[X^2] &= 4 \times \dfrac{1}{3} + 1 \times \dfrac{1}{6} + 9 \times \dfrac{1}{12} + 1 \times \dfrac{1}{12} + 0 \times \dfrac{1}{3}\\ &= \dfrac{4}{3} + \dfrac{1}{6} + \dfrac{3}{4} + \dfrac{1}{12} + 0\\ &= \dfrac{7}{3}\\ \end{align}

From above we have $\mathrm{E}[X] = 1$, so $(\mathrm{E}[X])^2=1^2 = 1$.

The variance is thus:

\begin{align} \mathrm{Var}(X) &= \mathrm{E}[X^2] - (\mathrm{E}[X])^2\\ &=\dfrac{7}{3} - 1\\ &=\dfrac{4}{3}.\\ \end{align}

Combinations

A combination is a way of selecting a number of items from a collection of items where the order of selection is not important. Combinations can be used to calculate probabilities, which we shall demonstrate with an example.

Illustrative Example 2

Imagine you are about to be shipwrecked on a deserted island. There is some rope, a knife, a box of matches and a blanket, but you can only take two items with you before the ship sinks. How many different ways are there of choosing two items? The different ways of choosing two items are combinations (since order is not important) and are listed below:

  1. Rope & knife
  2. Rope & matches
  3. Rope & blanket
  4. Knife & matches
  5. Knife & blanket
  6. Matches & blanket.

So, there are six different ways of selecting two items.

Now imagine the items are all in unmarked identical wooden boxes, so you do not know which items you are selecting. What is the probability you have selected the knife and matches?

\begin{equation} \mathrm{P}(\text{select knife and matches}) = \dfrac{\text{Number of ways of selecting the knife and matches}}{\text{Total number of combinations}} =\dfrac{1}{6} \end{equation}

With this number of items, it is quite easy to list all the possible selections. However, for larger numbers of items listing all of the possible selections can become tedious and time consuming. Fortunately there is a much faster way of calculating such combinations. The formula is:

\begin{equation} {}^n\mathrm{C}_r=\frac{n!}{r!(n-r)!} \end{equation}

where $n!$ denotes the factorial of $n$ ($n!=n\times(n-1)\times(n-2)\times...\times1$), $n$ is the total number of items (e.g. people. numbers, objects...), $r$ is how many items we want to “choose” and $r \leq n$. We pronounce this formula as “$n$ C $r$” or “$n$ choose $r$”.

We can now use this formula to calculate how many ways there are of selecting six items from twelve as follows: \begin{align} {}^{12}\mathrm{C}_6 &=\dfrac{12!}{6!(12-6)!}\\ &=\dfrac{12\times11\times\times10\times9\times8\times7\times6\times5\times4\times3\times2\times1}{(6\times5\times4\times3\times2\times1)\times(6\times5\times4\times3\times2\times1)}\\ &=\dfrac{479001600}{720\times720}\\ &=924. \end{align}

Selecting lottery numbers is an example of choosing a combination. A person must pick $6$ numbers from $49$. With each draw there is only one winning combination. But how many combinations are possible?

There are ${}^{49}\mathrm{C}_6 = 13983816$ possible combinations or ways of picking $6$ numbers from the $49$ possible numbers where order doesn't matter. This means that the probability of winning the lottery is $\dfrac{1}{13983816}.$

Permutations

A permutation is a way of selecting a specified number of items from a collection of items where the order of selection is important. Another way of thinking about this is that it is a way of arranging a number of items.

We use the formula:

\begin{equation} {}^n\mathrm{P}_r=\frac{n!}{(n-r)!} \end{equation}

where $n!$ denotes the factorial of $n$ ($n!=n\times(n-1)\times(n-2)\times...\times1$), $n$ is the total number of items (e.g. people. numbers, objects...), $r$ is how many items we want to arrange from the $n$ items and $r \leq n$. where $n$ is the total number of things, $r$ is how many things we are choosing and $r \leq n$. As with combinations, we can also use permutations to help calculate probabilities.

Worked Example

Worked Example

Imagine you have been on the deserted island for some time. There are five survivors, including you. You decide to have a race as there is not much to do on the island. How many different outcomes are there for this race? How many different options are possible for first, second and third position?

Solution

Here we have a total of $5$ “items” (survivors). Each outcome of the race is a different arrangement or permutation of these 5 items so we can use the above formula:

\[{}^5\mathrm{P}_5 = 120\]. There are 120 different outcomes for this race.

The number of different options for the first three positions is an arrangement of $3$ items ($r$ in our formula) from a total of $5$ items ($n$ in the formula). There are thus:

\[{}^5\mathrm{P}_3 = 60\] different options for first, second and third position.

Test Yourself

To practice questions on the Binomial distribution and Poisson distribution click the following link:

Numbas test.

See Also

For information on continuous probability distributions see the pages on continuous probability distributions.