Probability and Distribution Notes
Exam Study Guide: Probability & Distributions
Course: IIM Lucknow — IPMX Batch 2026-27 — QAB-I
Topics covered: Probability, Random Variables, Binomial, Poisson, Uniform, Exponential, Normal
Also see the solved questions from the lectures here
BLOCK 1: Probability Foundations
Core Ideas in 5 Bullets
- Sample space (S) = list of all possible outcomes. Event (A) = a subset of S.
- P(A) = favorable outcomes / total outcomes, with
0 ≤ P(A) ≤ 1andP(S) = 1. - Addition rule:
P(A ∪ B) = P(A) + P(B) − P(A ∩ B). If mutually exclusive, the last term is 0. - Conditional probability:
P(A|B) = P(A ∩ B) / P(B)— probability of A given B already happened. - Multiplication rule:
P(A ∩ B) = P(A|B) · P(B). If independent:P(A ∩ B) = P(A) · P(B).
Bayes' Theorem
The only formula you need for inverse problems:
P(Eᵢ | D) = P(D | Eᵢ) · P(Eᵢ) / Σⱼ P(D | Eⱼ) · P(Eⱼ)
Plain English: Given you saw D, what's the chance it came from cause Eᵢ?
💡 Pattern recognition: Whenever a question gives you P(D | cause₁), P(D | cause₂)... and asks "given D, what's P(cause)?", it's Bayes. Always.
BLOCK 2: Random Variables + Uniform
Random Variable (RV) — Concept
A random variable X assigns a number to each outcome of a random experiment. Two types:
- Discrete: X takes countable values (0, 1, 2...). E.g., number of heads in 3 tosses.
- Continuous: X takes any value in an interval. E.g., heights, times, weights.
Expectation and Variance
For a discrete RV:
- E[X] = Σ x · p(x)
- V[X] = E[X²] − (E[X])², where E[X²] = Σ x² · p(x)
Linear Transformation Rule — Memorize This
If Y = aX + b (a, b are constants):
- E[Y] = a · E[X] + b
- V[Y] = a² · V[X]
+b shifts the mean but does NOT affect variance. The coefficient gets squared in variance.
Uniform Distribution — X ~ U(a, b)
PDF is flat (rectangle) on [a, b]: f(x) = 1/(b−a)
| Quantity | Formula |
|---|---|
| Mean | E[X] = (a + b) / 2 |
| Variance | V[X] = (b − a)² / 12 |
BLOCK 3: Binomial + Poisson
Binomial Distribution — X ~ B(n, p)
Use when:
- Fixed number n of independent trials
- Each trial: success (prob p) or failure (prob 1−p)
- X = number of successes
PMF: P(X = x) = C(n,x) · pˣ · (1−p)ⁿ⁻ˣ
| Quantity | Formula |
|---|---|
| Mean | E[X] = np |
| Variance | V[X] = np(1−p) |
Excel:
=BINOM.DIST(x, n, p, FALSE)→ P(X = x)=BINOM.DIST(x, n, p, TRUE)→ P(X ≤ x)
💡 "At least one" trick: P(X ≥ 1) = 1 − P(X = 0). Most common Binomial pattern in exams.
What do x, n, and p mean?
- n = total number of trials (e.g., 5 HR directors surveyed)
- p = probability of success in one trial (e.g., 0.30 probability of saying "NOT important")
- x = number of successes (e.g., exactly 2 directors say "NOT important")
"Success" = whatever the question is counting. It does NOT always mean something good.
Poisson Distribution — X ~ Poisson(λ)
Use when:
- Counting rare events over a fixed interval (time, area, volume)
- You know the average rate λ for that interval
PMF: P(X = x) = e⁻λ · λˣ / x!
| Quantity | Formula |
|---|---|
| Mean | E[X] = λ |
| Variance | V[X] = λ |
Critical scaling rule: If λ is "events per hour" and you're asked about T minutes, scale λ proportionally to match the time window.
Excel:
=POISSON.DIST(x, λ, FALSE)→ P(X = x)=POISSON.DIST(x, λ, TRUE)→ P(X ≤ x)
How to find λ:
λ = rate × time
Example: 15 calls/hour, time = 20 minutes = 1/3 hour → λ = 15 × 1/3 = 5
BLOCK 4: Exponential + Normal
Exponential Distribution — X ~ Exp(λ)
Use when measuring waiting time (continuous time between Poisson events).
| Quantity | Formula |
|---|---|
| Mean | E[X] = 1/λ |
| Variance | V[X] = 1/λ² |
| P(X ≤ x) | 1 − e^(−λx) |
| P(X > x) | e^(−λx) |
💡 Poisson-Exponential link: If events arrive Poisson with rate λ, time between events is Exponential with the same λ.
Key relationship: λ = 1 / mean
- If average time between events = 20 minutes → λ = 1/20 = 0.05
Excel: =EXPON.DIST(x, λ, TRUE) → P(X ≤ x)
Complement: P(X > x) = 1 − P(X ≤ x) — use when question asks "more than" or "after" x time.
Normal Distribution — X ~ N(μ, σ)
Continuous, symmetric. Mean = median = mode = μ. Spread controlled by σ.
Empirical rule:
- ~68% of data within μ ± 1σ
- ~95% within μ ± 2σ
- ~99.7% within μ ± 3σ
Z-score: Z = (X − μ) / σ
Converts X to Z ~ N(0,1). Z = "how many SDs above/below the mean."
Two problem types:
- Forward: Given X, find probability → compute z, look up Φ(z). Excel:
=NORM.DIST(x, μ, σ, TRUE) - Inverse: Given probability, find X → find z, solve z = (X − μ)/σ. Excel:
=NORM.S.INV(p)
Critical z-values to memorize:
| Cumulative prob | z-value |
|---|---|
| 0.10 | −1.2816 |
| 0.05 | −1.6449 |
| 0.025 | −1.96 |
| 0.90 | +1.2816 |
| 0.95 | +1.6449 |
| 0.975 | +1.96 |
BLOCK 5: Final Cheat Sheet
Distribution Decision Tree
Is the variable a COUNT?
- YES → Discrete
- Fixed n trials, each yes/no? → Binomial
- Counting rare events in an interval? → Poisson
- NO → Continuous
- Equally likely on [a,b]? → Uniform
- Time between rare events? → Exponential
- Bell-shaped, real-world measurement? → Normal
Formula Card
| Distribution | E[X] | V[X] | Key Formula |
|---|---|---|---|
| Binomial(n,p) | np | np(1−p) | P(X=x) = C(n,x)·pˣ·(1−p)ⁿ⁻ˣ |
| Poisson(λ) | λ | λ | P(X=x) = e⁻λ·λˣ/x! |
| Uniform(a,b) | (a+b)/2 | (b−a)²/12 | f(x) = 1/(b−a) |
| Exponential(λ) | 1/λ | 1/λ² | P(X≤x) = 1−e⁻λˣ |
| Normal(μ,σ) | μ | σ² | Z = (X−μ)/σ |
Linear transform Y = aX + b (any distribution):
- E[Y] = a · E[X] + b
- V[Y] = a² · V[X]
Excel Functions
| Need | Excel |
|---|---|
| Binomial P(X=x) | =BINOM.DIST(x, n, p, FALSE) |
| Binomial P(X≤x) | =BINOM.DIST(x, n, p, TRUE) |
| Poisson P(X=x) | =POISSON.DIST(x, λ, FALSE) |
| Poisson P(X≤x) | =POISSON.DIST(x, λ, TRUE) |
| Exponential P(X≤x) | =EXPON.DIST(x, λ, TRUE) |
| Normal P(X≤x) | =NORM.DIST(x, μ, σ, TRUE) |
| Standard normal Φ(z) | =NORM.S.DIST(z, TRUE) |
| Inverse normal: find x | =NORM.INV(p, μ, σ) |
| Inverse standard normal: find z | =NORM.S.INV(p) |
The "Cumulative" Toggle
- TRUE → P(X ≤ x) — use for "at most", "less than"
- FALSE → P(X = x) — use for "exactly"
The "At Least" Trick
P(X ≥ x) = 1 − P(X ≤ x−1)
Excel (Binomial, at least 1): =1 - BINOM.DIST(0, n, p, TRUE)
When to Use "S" (Standard) Versions
NORM.S.DIST/NORM.S.INV→ when working with Z-scores (mean=0, SD=1)NORM.DIST/NORM.INV→ when working with actual values
Five Common Mistakes to Avoid
- Forgetting to scale Poisson λ when the time window changes — always match rate to interval first.
- Using "at least one" the long way — go straight to
1 − P(X=0). - Mixing up "less than" vs "more than" in Normal — always sketch the curve and shade the region.
- Not squaring the coefficient in variance — V[3X−4] = 9·V[X], not 3·V[X].
- Choosing the wrong distribution — "time between events" → Exponential (not Normal). "Counts" → Poisson (not Binomial, unless n is fixed).
Before the Exam
- Excel is allowed — use the functions above. Don't compute by hand if you don't have to.
- Always sketch the distribution and shade the region you want.
- For Bayes — draw the partition + evidence diagram before writing any formula.
- For Poisson — rescale λ first, then plug in.
- For "at least one" Binomial — immediately go to
1 − (1−p)ⁿ.