Grade 11 · Statistics 1 · Cambridge A-Level 9709 · Age 16–17
A discrete random variable (DRV) is a variable whose value is determined by a random experiment and can take only a countable set of values. In Cambridge A-Level Statistics 1 (9709), you learn to define probability distributions, calculate expected values and variance, apply linear transformations, and set up distributions from real-world contexts. These ideas underpin all further statistics work at A-Level and beyond.
Countable values x⊂i; each with probability P(X=x⊂i); all probs sum to 1
E(X) = Σ x·P(X=x) — the long-run average value
Var(X) = E(X²) − [E(X)]² — measures spread
SD(X) = √Var(X) — in the same units as X
E(aX+b) = aE(X)+b; Var(aX+b) = a²Var(X)
F(x) = P(X ≤ x) — cumulative sum of probabilities
Use ΣP=1 to find unknown constant in distribution
Game is fair iff E(profit) = 0
A random variable X is a variable whose numerical value is determined by the outcome of a random experiment. X is discrete if it can take only a countable number of distinct values x⊂1;, x⊂2;, x⊂3;, …
The probability distribution of X lists every possible value and its corresponding probability. It is usually displayed in a table:
| x | 1 | 2 | 3 | 4 |
|---|---|---|---|---|
| P(X=x) | 0.1 | 0.3 | 0.4 | 0.2 |
A table defines a valid probability distribution if and only if:
| x | 0 | 1 | 2 | 3 |
|---|---|---|---|---|
| P(X=x) | 0.2 | 0.35 | 0.3 | 0.15 |
The cumulative distribution function F(x) gives the probability that X takes a value less than or equal to x:
The CDF is a step function that increases at each value of X. Key properties:
| x | 1 | 2 | 3 | 4 |
|---|---|---|---|---|
| P(X=x) | 0.1 | 0.3 | 0.4 | 0.2 |
| F(x) | 0.1 | 0.4 | 0.8 | 1.0 |
The expected value (or mean) of a discrete random variable X is the weighted average of its values, where the weights are the probabilities:
E(X) represents the long-run average value of X if the experiment were repeated many times. It need not be a value that X can actually take.
| x | 1 | 2 | 3 | 4 |
|---|---|---|---|---|
| P(X=x) | 0.1 | 0.3 | 0.4 | 0.2 |
We frequently need E(X²) to calculate variance. Apply the same idea but square each x first:
For any function g, apply g to each value of X then weight by probabilities:
Two distributions can have the same mean but very different spreads. The variance measures how spread out the values of X are around the mean E(X). A large variance means values are widely scattered; a small variance means they cluster close to the mean.
This is the computational formula. It equals the definitional formula E[(X − μ)²] but is much easier to calculate (see Proof Bank for the derivation).
The standard deviation is in the same units as X (variance is in squared units). From the example above:
| x | −2 | 0 | 1 | 3 |
|---|---|---|---|---|
| P(X=x) | 0.25 | 0.40 | 0.20 | 0.15 |
When we scale a random variable by a constant a and shift it by b, the mean and variance change according to simple rules:
Note: adding a constant b shifts the distribution but does NOT change the spread, so b drops out of the variance formula. Multiplying by a stretches the distribution, scaling the variance by a².
If X and Y are independent random variables:
Many exam questions ask you to construct the probability distribution of X. Follow this systematic process:
| x | 0 | 1 | 2 |
|---|---|---|---|
| P(X=x) | 1/10 | 3/5 | 3/10 |
| x (profit) | −2 | 1 | 6 |
|---|---|---|---|
| P(X=x) | 1/2 | 1/3 | 1/6 |
Determine whether the following is a valid probability distribution for X, where x = 1, 2, 3, 4, 5 and P(X=x) = 0.15, 0.30, 0.25, 0.20, 0.10.
X has probability distribution: x = 0, 1, 2, 3; P(X=x) = 1/8, 3/8, 3/8, 1/8. Find E(X).
Using the distribution from Example 2 (x = 0,1,2,3; P = 1/8, 3/8, 3/8, 1/8), find Var(X) and SD(X).
X has E(X) = 4.2 and Var(X) = 2.56. Find E(2X+3) and Var(2X+3).
Two fair dice are rolled. X = |difference of the two scores|. Set up the probability distribution of X.
A spinner has 4 equal sectors labelled 1, 2, 3, 4. You pay £1.50 to spin. If it lands on 4 you win £4; on 3 you win £2; otherwise nothing. Find the expected profit.
The random variable X has P(X=x) = k(x²+1) for x = 0, 1, 2, 3. Find the value of k, then find E(X).
X has distribution: x = 1,2,3,4,5; P = 0.05, 0.20, 0.45, 0.25, 0.05. Build the CDF and find P(2 ≤ X < 4).
| Formula | Expression | Notes |
|---|---|---|
| Valid Distribution | Σ P(X=x) = 1, P(X=x) ≥ 0 | Both conditions required |
| CDF | F(x) = P(X ≤ x) = Σt ≤ x P(X=t) | Non-decreasing step function |
| P from CDF | P(X=x) = F(x) − F(x−1) | For integer-valued X |
| Range probability | P(a < X ≤ b) = F(b) − F(a) | Note strict vs non-strict inequality |
| E(X) | Σ x · P(X=x) | Weighted mean; long-run average |
| E(X²) | Σ x² · P(X=x) | NOT equal to [E(X)]² |
| E(g(X)) | Σ g(x) · P(X=x) | Apply g before weighting |
| Var(X) | E(X²) − [E(X)]² | Always ≥ 0 |
| SD(X) | √Var(X) | Same units as X |
| E(aX+b) | a·E(X) + b | Linear scaling of mean |
| Var(aX+b) | a²·Var(X) | b has no effect; a is squared |
| E(X+Y) | E(X) + E(Y) | Always true (linearity) |
| Var(X+Y) | Var(X) + Var(Y) | Only if X, Y independent |
| Var(X−Y) | Var(X) + Var(Y) | Only if X, Y independent (always add) |
| Fairness condition | E(profit) = 0 | Expected net gain = 0 |
The definitional formula for variance is Var(X) = E[(X − μ)²] where μ = E(X).
Expand the square inside the expectation:
E[(X − μ)²] = E[X² − 2μX + μ²]
Apply the linearity of expectation (E of a sum = sum of E's, constants factor out):
= E(X²) − 2μE(X) + μ²
Since μ = E(X), we substitute E(X) for μ:
= E(X²) − 2E(X)·E(X) + [E(X)]²
= E(X²) − 2[E(X)]² + [E(X)]²
= E(X²) − [E(X)]² Q.E.D.
By definition, E(aX+b) = Σ (ax+b) · P(X=x)
Split the sum using linearity of summation:
= Σ ax · P(X=x) + Σ b · P(X=x)
= a Σ x · P(X=x) + b Σ P(X=x)
Since Σ P(X=x) = 1 (valid distribution) and Σ x·P(X=x) = E(X):
= aE(X) + b·1 = aE(X) + b Q.E.D.
Let Y = aX + b. Then E(Y) = aE(X) + b (from Proof 2).
Var(Y) = E[(Y − E(Y))²]
Substitute Y = aX+b and E(Y) = aE(X)+b:
Y − E(Y) = (aX+b) − (aE(X)+b) = aX − aE(X) = a(X − E(X))
Therefore:
Var(Y) = E[{a(X − E(X))}²] = E[a²(X − E(X))²]
= a² E[(X − E(X))²]
= a² Var(X) Q.E.D.
Note: b disappears because it cancels when we subtract the mean. This is why the constant b has no effect on variance.
Enter a probability distribution (up to 5 values). Type the x-values in the top row and probabilities in the bottom row, then click Compute to see E(X), Var(X) and a bar chart.
| x | |||||
|---|---|---|---|---|---|
| P(X=x) |
These questions mirror Cambridge A-Level 9709 style. Show all working. Click "Show Mark Scheme" to reveal the answer.
The discrete random variable X has the probability distribution shown in the table.
| x | 1 | 2 | 3 | 4 | 5 |
|---|---|---|---|---|---|
| P(X=x) | 0.05 | p | 0.35 | 0.25 | 0.15 |
(a) Find the value of p. [1]
(b) Find E(X). [2]
(c) Find Var(X). [2]
The random variable X has P(X=r) = kr for r = 1, 2, 3, 4, 5.
(a) Find k. [2]
(b) Find E(X) and E(3X − 1). [2]
A bag contains 4 red balls and 2 blue balls. Three balls are drawn at random without replacement. The random variable X denotes the number of red balls drawn.
(a) Show that P(X=2) = 12/20 = 3/5. [2]
(b) Find the complete probability distribution of X. [2]
(c) Find E(X) and Var(X). [2]
X and Y are independent random variables. E(X) = 3, Var(X) = 2, E(Y) = 5, Var(Y) = 3.
Find: (a) E(2X + 3Y) [2] (b) Var(2X − Y + 1) [3]
The CDF of X is given by F(x) = 0 for x<1; F(1)=0.15; F(2)=0.40; F(3)=0.72; F(4)=0.91; F(5)=1.
(a) Write down P(X=3). [1]
(b) Find P(2 ≤ X < 5). [2]
(c) Find P(X > 3). [2]
A game costs £c to play. A fair die is rolled. Score 5 or 6: win £6. Score 3 or 4: win £2. Score 1 or 2: win nothing.
(a) Let X = profit. For c = 2.5, find the distribution of X and show E(X) = −0.5. [4]
(b) Find the value of c that makes the game fair. [2]
X has E(X) = μ and Var(X) = σ². Given that E(3X−2) = 7 and Var(3X−2) = 36, find μ and σ.
The discrete random variable X has P(X=x) = a + bx for x = 1, 2, 3, 4, where a and b are constants.
Given that E(X) = 2.6, find the values of a and b, and hence find Var(X).
These questions are representative of Cambridge 9709 S1 past papers on Discrete Random Variables. Attempt fully before revealing the solution.
The random variable X has the following probability distribution.
P(X=0) = 2k, P(X=1) = 3k, P(X=2) = 4k, P(X=3) = k.
(a) Find k. [1]
(b) Find E(X) and show that Var(X) = 0.86 (to 2 d.p.). [4]
(c) Find P(X > E(X)). [1]
Two fair dice are rolled and X is the smaller of the two scores (or the common score if they are equal).
(a) Show that P(X=1) = 11/36. [2]
(b) Given that P(X=2)=9/36, P(X=3)=7/36, P(X=4)=5/36, P(X=5)=3/36, P(X=6)=1/36, find E(X). [3]
X has E(X) = 5 and Var(X) = 4. Y = 3X − 2.
(a) Find E(Y) and Var(Y). [2]
(b) Given also that E(X²) = c, write down the value of c. [1]
(c) Z = X + Y. Find E(Z) and, assuming X and Y are independent, Var(Z). [3]
A bag contains 3 white and 5 black counters. Counters are drawn one at a time, without replacement, until a white counter is obtained or 3 counters have been drawn. X = total number of counters drawn.
(a) Find P(X=1), P(X=2), P(X=3). [4]
(b) Show that E(X) = 79/40. [2]
(c) Given that X=2, find the probability that exactly 1 counter was black. [1]
The probability distribution of X is given by P(X=x) = k(5−x) for x = 0, 1, 2, 3, 4.
(a) Find k. [1]
(b) Find E(X) and Var(X). [4]
(c) Find E(2−3X) and Var(2−3X). [2]
(d) Find P(X < E(X)). [1]