
EBS mini-seminar, Monash University
29 May 2026
Ask me if interested

Developed by Wald (1945)
Birth of sequential analysis
Data (X_i): 0, 0, 1, 1, 0, 1, \dots
Hypotheses: \begin{cases} H_0\colon p = p_0 \\ H_1\colon p = p_1 \end{cases}

Test statistic: S_t = \frac{L_t(p_1)}{L_t(p_0)} = \frac{\Pr(X_1, \dots, X_t \mid H_1)} {\Pr(X_1, \dots, X_t \mid H_0)} = \frac{p_1^{Y_t} (1 - p_1)^{t - Y_t}} {p_0^{Y_t} (1 - p_0)^{t - Y_t}}
Where: Y_t = \sum_{i=1}^t X_i
Stopping rule: \begin{cases} S_t \leqslant \frac{\beta}{1 - \alpha} & \Rightarrow \text{accept } H_0 \\ S_t \geqslant \frac{1 - \beta}{\alpha} & \Rightarrow \text{accept } H_1 \\ \text{otherwise} & \Rightarrow \text{keep sampling} \end{cases}
Error control:


“Stays the same” on average

“Stays the same or reduces in value” on average
Martingale condition: \mathbb{E}\left(S_{t+1} \mid X_1, \dots, X_t\right) = S_t
Supermartingale condition: \mathbb{E}\left(S_{t+1} \mid X_1, \dots, X_t\right) \leqslant S_t
Let S_0, S_1, S_2, \dots be a test supermartingale (TSM):
has non-negative terms (S_i \geqslant 0) and starting value S_0 = 1.
For any \alpha > 0, \Pr\left(\max_t (S_t) \geqslant \frac{1}{\alpha}\right) \leqslant \alpha


Decision rule:
Ville’s inequality \Rightarrow anytime-valid type I error control
Growth rate under H_1 \rightarrow statistical efficiency (“power”)
Let S be a non-negative random variable (S \geqslant 0).
For any \alpha > 0, \Pr\left(S \geqslant \frac{1}{\alpha}\right) \leqslant \alpha \cdot \mathbb{E}(S)
Let S = E, an e-variable. Under H_0 we have: \Pr\left(E \geqslant \frac{1}{\alpha}\right) \leqslant \alpha
Reject H_0 if E \geqslant 1 / \alpha
Markov’s inequality \Rightarrow type I error control
Note
A, B, C are e-variables
A, B, C independent \Rightarrow the product is an e-variable: E = A \cdot B \cdot C
The average is always an e-variable: E = \frac{A + B + C}{3}
(Easier than combining p-values!)
\text{Let } P_t = \frac{1}{E_t}
E_t > \frac{1}{\alpha} \quad \Longrightarrow \quad P_t < \alpha
H_0 = H_0^1 \cap H_0^2 \cap \dots \cap H_0^k
Need to reject at least one of the H_0^j
…but which ones are false?
Base TSM: for H_0^j
S_{j,t} = \prod_{i = 1}^t s_{j, i}
Intersection TSM: for H_0
S_t = \prod_{i = 1}^t \frac{\sum_j w_{j, i} \, s_{j, i}}{\sum_jw_{j, i}}
The weights w_{j, i} can depend on the data up until time i - 1.
Sequential analysis allows for “safe” inference
Modern tools use TSMs and e-processes
Complex inference problems can be tackled sequentially with finite-sample guarantees.
Chugg, Ramdas, Grünwald (2026). E-values as statistical evidence: A comparison to Bayes factors, likelihoods, and p-values. arXiv:2603.24421.
Ek, Stark, Stuckey, Vukcevic (2023). Adaptively Weighted Audits of Instant-Runoff Voting Elections: AWAIRE. In: Electronic Voting. E-Vote-ID 2023. Lecture Notes in Computer Science 14230:35–51.
Ramdas, Grünwald, Vovk, Shafer (2023). Game-Theoretic Statistics and Safe Anytime-Valid Inference. Statist. Sci. 38(4):576–601.
Wald (1945). Sequential tests of statistical hypotheses. Ann. Math. Stat. 16(2):117–186.
The coin flipping graphic was designed by Wannapik.
The photograph of Abraham Wald is freely available from Wikipedia.
The photograph of the 18th century print is freely available from the Digital Commonwealth; the original sits at the Boston Public Library.



Parameters:
n = 38 for fixed-n test with same power
SPRT often stops earlier
H_0\colon X \sim f_0(\cdot) \quad \text{vs} \quad H_1\colon X \sim f_1(\cdot)
Test statistic: S_t = \frac{f_1(X_1, X_2, \dots, X_t)} {f_0(X_1, X_2, \dots, X_t)}
Test statistic, with iid observations: S_t = \prod_{i=1}^t \frac{f_1(X_i)}{f_0(X_i)} = S_{t - 1} \times \frac{f_1(X_t)}{f_0(X_t)}
At time t: \mathbb{E}_{H_0} \left(\frac{f_1(X_t)}{f_0(X_t)}\right) = \int \frac{f_1(x)}{f_0(x)} f_0(x) \, dx = \int f_1(x) \, dx = 1
This implies the martingale condition.
H_0\colon \theta = \theta_0 \quad \text{vs} \quad H_1\colon \theta \in \mathcal{A}
Plug-in strategy: S_t = S_{t-1} \times \frac{f(X_t \mid \hat\theta_t)}{f(X_t \mid \theta_0)} \hat\theta_t = a(X_1, X_2, \dots, X_{t-1}) \in \mathcal{A}
H_0\colon \theta = \theta_0 \quad \text{vs} \quad H_1\colon \theta \in \mathcal{A}
Mixture strategy: S_t = S_{t-1} \times \frac{\int_{\mathcal{A}} f(X_t \mid \theta) \, g_t(\theta) \, d\theta}{f(X_t \mid \theta_0)} g_t(\theta) \text{ is a distribution over } \mathcal{A} g_t(\theta) \text{ can depend on } X_1, X_2, \dots, X_{t-1}
“Martingales”: 🎲 gambling strategies from 18th century France ♣️
These strategies cannot “beat the house”.
Martingales (in mathematics): represent the 💰 wealth over time of a gambler playing a fair game.
Test statistic: S_t, a TSM under H_0
Intrepretation:
Gambler gets rich \rightarrow evidence against H_0
Ville’s inequality: “one in a million chance of becoming a millionaire”
Fixed-sample hypothesis test \longleftrightarrow confidence interval (CI)
Sequential hypothesis test \longleftrightarrow confidence sequence (CS)


Reanalysing the data with different test statistics (“betting strategies”) and only reporting the best one.
This is not a safe procedure.
Fundamental principle:
Declare your bet before you play.
Pre-registration is still an important tool.
However, anytime-valid methods retain flexibility even after pre-specification.