Random Variables

Overview

This topic builds the entire probabilistic foundation. We start from the formal probability space $(\Omega, \mathcal{F}, P)$ , define random variables and their key characteristics, then cover the essential tools — moments, inequalities, and transforms — that appear directly in quant interview problems.

1. Probability Space

1.1 Probability Space $(\Omega, \mathcal{F}, P)$

Definition— Sample Space

\Omega

The sample space $\Omega$ (also written $\Gamma$ ) is the set of all possible outcomes of a random experiment. Each element $\omega \in \Omega$ is called an outcome.

Definition— Event and

\sigma

-algebra

\mathcal{F}

An event is a subset $A \subseteq \Omega$ . The collection $\mathcal{F}$ of all observable events is a $\sigma$ -algebra (tribu) on , satisfying:

Definition— Probability Measure

P

A probability measure is a function $P : \mathcal{F} \to [0, 1]$ satisfying the Kolmogorov axioms:

$P(\Omega) = 1$

Remarque

In practice, you never need to manipulate $\mathcal{F}$ explicitly in interview problems. What matters is the probability measure $P$ and its properties below.

1.2 Basic Properties of $P$

Properties

$P(\emptyset) = 0$ - $P(\bar{A}) = 1 - P(A)$
(monotonicity)

Theorem— Boole's Inequality (Union Bound)

For any sequence of events $A_1, A_2, \ldots$ (not necessarily disjoint):

P ⁣ (⋃_{i = 1}^{n} A_{i}) \leq \sum_{i = 1}^{n}

💡Tip

Boole's inequality is the go-to tool to upper bound the probability that at least one of many bad events occurs. If each $P(A_i)$ is small, the union is also small.

1.3 Continuity of $P$

Theorem— Sequential Continuity

For any monotone sequence of events:

If $A_1 \subseteq A_2 \subseteq \cdots$ (increasing), then:

P ⁣ (⋃_{}^{}

Remarque

This theorem justifies passing limits through probability signs for monotone sequences — a technique used often in convergence proofs.

2. Random Variables

2.1 Definition of a Random Variable

Definition

A random variable $X$ is a measurable function $X : \Omega \to \mathbb{R}$ , i.e. for all $x \in \mathbb{R}$ :

{ω \in Ω : X (ω)

Remarque

In practice, a random variable is simply a quantity whose value depends on the outcome of a random experiment. The measurability condition ensures that $P(X \leq x)$ is always well-defined.

2.2 Cumulative Distribution Function (CDF)

Definition

The cumulative distribution function (CDF, or FDR) of a random variable $X$ is:

\boxed{F_X(x) = P(X \leq x), \quad x \in \mathbb{R}}

Properties

$F_X$ is non-decreasing
$\lim_{x \to -\infty} F_X(x) = 0$ and

Remarque

The CDF uniquely characterizes the distribution of $X$ . Two random variables with the same CDF have the same distribution.

2.3 Independence

Definition

Two events $A, B \in \mathcal{F}$ are independent if:

\boxed{P(A \cap B) = P(A)\, P(B)}

Properties

$X, Y$ independent $\Rightarrow E[XY] = E[X]\,E[Y]$
$X, Y$ independent

Remarque

Pairwise independence does not imply mutual independence. A classic counterexample: toss two fair coins, let $A = \{\text{first is H}\}$ , $B = \{\text{second is H}\}$ , $C = \{\text{both same}\}$ — pairwise independent but not mutually independent.

3. Moments

3.1 Expected Value

Definition

The expected value (espérance) of a random variable $X$ is:

Discrete: $\displaystyle E[X] = \sum_{x} x\, P(X = x)$

Formula— Transfert Formula

For any measurable function $g : \mathbb{R} \to \mathbb{R}$ :

\boxed{E[g(X)] = \int_{-\infty}^{+\infty} g(x)\, f_X(x)\, dx}

Formula— Tail Formula

For a non-negative random variable $X \geq 0$ :

\boxed{E[X] = \int_0^{+\infty} P(X > t)\, dt}

Properties

$E[aX + b] = aE[X] + b$ $\quad$ (linearity)
$E[X + Y] = E[X] + E[Y]$ (linearity, always)

💡Tip

The tail formula $E[X] = \int_0^\infty P(X > t)\,dt$ is extremely powerful for non-negative integer-valued RVs: . Use it for geometric distributions, waiting times, and coupon collector-type problems.

3.2 Variance

Definition

The variance of $X$ measures the spread around its mean $\mu = E[X]$ :

\boxed{\text{Var}(X) = E\!\left[(X - \mu)^2\right] = E[X^2] - (E[X])^2}

Properties

$\text{Var}(aX + b) = a^2\,\text{Var}(X)$
$\text{Var}(X) \geq 0$ , and a.s.

Formula— Covariance

\boxed{\text{Cov}(X, Y) = E[XY] - E[X]\,E[Y]}

Formula— Correlation

\boxed{\rho(X, Y) = \frac{\text{Cov}(X,Y)}{\sigma_X\, \sigma_Y} \in [-1, 1]}

💡Tip

In interviews, always reach for $\text{Var}(X) = E[X^2] - (E[X])^2$ to compute variance — it avoids expanding directly. Compute via the transfert formula.

3.3 Higher Moments

Definition

The moment of order $k$ of $X$ is:

\boxed{m_k = E[X^k], \quad k \in \mathbb{N}}

Properties

$\mu_1 = 0$ , $\mu_2 = \text{Var}(X)$
Skewness (asymétrie): — measures asymmetry of the distribution

Remarque

For a normal distribution: $\gamma_1 = 0$ (symmetric) and $\gamma_2 = 0$ (excess kurtosis = 0). A distribution with $γ_{2} > 0$ is (heavy tails) — relevant in finance where asset returns exhibit excess kurtosis.

4. Inequalities

4.1 Markov's Inequality

Theorem— Markov's Inequality

For any random variable $X \geq 0$ and $a > 0$ :

\boxed{P(X \geq a) \leq \frac{E[X]}{a}}

💡Tip

Markov only requires $X \geq 0$ and knowledge of $E[X]$ — it is the weakest but most general bound. In interviews, use it when you only know the mean and need an upper bound on a tail probability.

4.2 Bienaymé-Chebyshev Inequality

Theorem— Bienaymé-Chebyshev

For any random variable $X$ with mean $\mu$ and variance $\sigma^2$ , and for any $k > 0$ :

💡Tip

Chebyshev is the standard tool to prove the Law of Large Numbers and to bound probabilities when only mean and variance are known. It is distribution-free — valid for any $X$ with finite variance.

4.3 Cantelli's Inequality (One-sided Chebyshev)

Theorem— Cantelli's Inequality

For any random variable $X$ with mean $\mu$ and variance $\sigma^2$ , and for any $\lambda > 0$ :

Remarque

Cantelli is strictly stronger than Chebyshev on one side: it gives an upper bound on the one-sided tail $P(X \geq \mu + \lambda)$ without the factor of 2. Useful when the direction of deviation matters.

4.4 Jensen's Inequality

Theorem— Jensen's Inequality

Let $\varphi : \mathbb{R} \to \mathbb{R}$ be a convex function and $X$ a random variable with $E[|X|] < \infty$ . Then:

Properties

$\varphi(x) = x^2$ convex $\Rightarrow (E[X])^2 \leq E[X^2]$ (i.e. )

💡Tip

Jensen is the key inequality when dealing with convex/concave transformations of expectations — log-returns, utility functions, option pricing bounds. Whenever you see $E[f(X)]$ vs $f(E[X])$ , ask yourself if $f$ is convex or concave.

4.5 Cauchy-Schwarz Inequality

Theorem— Cauchy-Schwarz

For any two random variables $X, Y$ with finite second moments:

\boxed{\left(E[XY]\right)^2 \leq E[X^2]\, E[Y^2]}

Properties

Equality holds if and only if $Y = cX$ a.s. for some constant $c$ - Implies $|\rho(X,Y)| \leq 1$ : $Cov (X, Y)^{2} \leq Var$

4.6 Hölder's Inequality

Theorem— Hölder's Inequality

For any random variables $X, Y$ and conjugate exponents $p, q \geq 1$ with $\dfrac{1}{p} + \dfrac{1}{q} = 1$ :

Remarque

Cauchy-Schwarz is the special case $p = q = 2$ . Hölder generalizes to any conjugate pair. The case $p = 1, q = \infty$ gives $E [∣ X Y ∣] \leq E [∣ X ∣] \cdot ∥$ .

5. Generating Functions & Transforms

5.1 Probability Generating Function (PGF)

Definition

For a non-negative integer-valued random variable $X$ , the probability generating function is:

\boxed{G_X(z) = E[z^X] = \sum_{k=0}^{\infty} P(X = k)\, z^k, \quad |z| \leq 1}

Properties

$G_X(1) = 1$
$P(X = k) = \dfrac{G_X^{(k)}(0)}{k!}$

💡Tip

PGFs are most useful for sums of independent discrete RVs and branching processes. The factorization property $G_{X+Y} = G_X \cdot G_Y$ is the key tool.

5.2 Moment Generating Function (MGF)

Definition

The moment generating function of $X$ is:

\boxed{M_X(t) = E[e^{tX}] = \int_{-\infty}^{+\infty} e^{tx}\, f_X(x)\, dx}

Properties

$M_X(0) = 1$
$E[X^k] = M_X^{(k)}(0)$ (moments from derivatives)

Remarque

The MGF does not always exist (e.g. heavy-tailed distributions like Cauchy). In that case, use the characteristic function instead.

💡Tip

In interviews, the MGF is used to: (1) identify a distribution by matching its MGF to a known one, (2) compute moments quickly via differentiation, (3) prove that a sum of independents follows a known law.

5.3 Characteristic Function

Definition

The characteristic function of $X$ is:

\boxed{\varphi_X(t) = E[e^{itX}] = \int_{-\infty}^{+\infty} e^{itx}\, f_X(x)\, dx, \quad t \in \mathbb{R}}

Properties

$\varphi_X(0) = 1$ and $|\varphi_X(t)| \leq 1$ for all

Remarque

The characteristic function is the Fourier transform of the density. It always exists, making it more general than the MGF. The inversion formula recovers the density: $f_X(x) = \dfrac{1}{2\pi}\int_{-\infty}^{+\infty} e^{-itx}\varphi_X(t)\,dt$ .

5.4 Laplace Transform

Definition

For a non-negative random variable $X \geq 0$ , the Laplace transform is:

\boxed{\mathcal{L}_X(s) = E[e^{-sX}] = \int_0^{+\infty} e^{-sx}\, f_X(x)\, dx, \quad s \geq 0}

Properties

$\mathcal{L}_X(0) = 1$
$E[X^k] = (-1)^k \mathcal{L}_X^{(k)}(0)$

Remarque

The Laplace transform is essentially the MGF evaluated at $-s$ : $\mathcal{L}_X(s) = M_X(-s)$ . It is preferred for non-negative RVs (exponential, gamma) and in queuing theory / reliability contexts.

6. Quick Reference

Inequalities Summary

Inequality	Condition	Bound
Markov	$X \geq 0$ , $a > 0$	$P(X \geq a) \leq \dfrac{E[X]}{a}$

Transforms Summary

Transform	Definition	Exists always?	Recovers moments?
PGF	$E[z^X]$	For integer $X \geq 0$	Yes, via derivatives at 0
MGF	$E[e^{tX}]$	No (heavy tails)

All Key Formulas

Concept	Formula
CDF	$F_X(x) = P(X \leq x)$
PDF from CDF	$f_X(x) = F_X'(x)$

Random Variables

1. Probability Space

1.1 Probability Space (Ω,F,P)(\Omega, \mathcal{F}, P)(Ω,F,P)

1.2 Basic Properties of PPP

1.3 Continuity of PPP

2. Random Variables

2.1 Definition of a Random Variable

2.2 Cumulative Distribution Function (CDF)

2.3 Independence

3. Moments

3.1 Expected Value

3.2 Variance

3.3 Higher Moments

4. Inequalities

4.1 Markov's Inequality

4.2 Bienaymé-Chebyshev Inequality

4.3 Cantelli's Inequality (One-sided Chebyshev)

4.4 Jensen's Inequality

4.5 Cauchy-Schwarz Inequality

4.6 Hölder's Inequality

5. Generating Functions & Transforms

5.1 Probability Generating Function (PGF)

5.2 Moment Generating Function (MGF)

5.3 Characteristic Function

5.4 Laplace Transform

6. Quick Reference

Inequalities Summary

Transforms Summary

All Key Formulas

1.1 Probability Space $(\Omega, \mathcal{F}, P)$

1.2 Basic Properties of $P$

1.3 Continuity of $P$