Position-Based Click Model

Definition

Position-Based Click Model (PBM)

The Position-Based Click Model is the simplest practical click model. It assumes a user clicks a document $d$ shown at rank $k$ iff the user examines rank $k$ and the document is relevant — and crucially that examination depends only on the rank $k$ , never on the document, the query, or the surrounding results. It is the operational instantiation of the Examination Hypothesis in which the examination probability is a per-rank constant.

Intuition

Think of each result position as having a fixed “visibility” determined purely by where it sits on the page. Rank 1 is almost always looked at; rank 8 is looked at far less. The PBM bakes this into a single number per rank, the propensity $P (Exam_{k})$ , and then treats whether the user clicks given that they looked as a clean measurement of relevance.

This factorization is what makes the model so useful: the two latent causes of a click — did they see it? (position) and did they like it? (relevance) — are assumed independent. Once you know the per-rank examination probabilities, an observed click becomes a noisy but unbiased-after-reweighting signal of relevance, which is exactly what IPW exploits.

The contrast to keep in mind: PBM says examination of rank $k$ is the same regardless of what is above it. The cascade model / Cascading Position Bias says the opposite — examination depends on the relevance of everything above $k$ .

Mathematical Formulation

A click random variable $C_{d, k} \in {0, 1}$ for document $d$ at rank $k$ factors into two independent Bernoulli events:

$P (C_{d, k} = 1) = examination (position only) P (E_{k} = 1) \cdot relevance (position-free) P (R_{d} = 1 ∣ q)$

where:

$C_{d, k}$ — observed click on document $d$ displayed at rank $k$
$E_{k} \in {0, 1}$ — latent examination event for rank $k$ ; $P (E_{k} = 1)$ is the propensity $θ_{k}$
$R_{d} \in {0, 1}$ — latent relevance of $d$ to query $q$ ; $P (R_{d} = 1 ∣ q) = γ_{d}$
$θ_{k} = P (E_{k} = 1)$ — depends only on $k$ (the defining PBM assumption), monotonically decreasing in $k$
$γ_{d} = P (R_{d} = 1 ∣ q)$ — depends only on $(d, q)$ , never on $k$

So the per-(document, rank) click probability is simply the product $θ_{k} γ_{d}$ . The latent variables $E_{k}$ are unobserved; only $C_{d, k}$ is observed, which is what forces an EM-style inference.

Likelihood and EM estimation

Given a click log of sessions $s$ , each presenting document $d_{s}$ at rank $k_{s}$ with observed click $c_{s}$ , the data log-likelihood is:

$L (θ, γ) = \sum_{s} [c_{s} lo g (θ_{k_{s}} γ_{d_{s}}) + (1 - c_{s}) lo g (1 - θ_{k_{s}} γ_{d_{s}})]$

Because $E_{k}$ is latent, parameters are fit by Expectation-Maximization:

E-step — for a non-click ( $c_{s} = 0$ ) we infer the posterior that the rank was nonetheless examined (the click failed because the doc was irrelevant): $P (E_{k_{s}} = 1 ∣ c_{s} = 0) = \frac{θ _{k_{s}} ( 1 - γ _{d_{s}} )}{1 - θ _{k_{s}} γ _{d_{s}}}, P (R_{d_{s}} = 1 ∣ c_{s} = 0) = \frac{( 1 - θ _{k_{s}} ) γ _{d_{s}}}{1 - θ _{k_{s}} γ _{d_{s}}}$ (for a click $c_{s} = 1$ both $E_{k_{s}} = 1$ and $R_{d_{s}} = 1$ are certain).
M-step — re-estimate each parameter as the average of its inferred posterior over the relevant sessions: $θ_{k} \leftarrow \frac{\sum _{s : k_{s} = k} [ c _{s} + ( 1 - c _{s} ) P ( E _{k} = 1 ∣ c _{s} = 0 ) ]}{∣ { s : k _{s} = k } ∣}, γ_{d} \leftarrow \frac{\sum _{s : d_{s} = d} [ c _{s} + ( 1 - c _{s} ) P ( R _{d} = 1 ∣ c _{s} = 0 ) ]}{∣ { s : d _{s} = d } ∣}$

Iterating E and M to convergence yields the propensities $θ_{k}$ used downstream.

Key Properties / Variants

Two latent factors, one product — the entire model is $P (C_{d, k}) = θ_{k} γ_{d}$ ; everything else (EM, IPW) is bookkeeping on top of this factorization.
Propensities for counterfactual LTR — the fitted $θ_{k}$ are exactly the inverse weights used by Inverse Propensity Weighting: a click at rank $k$ counts as $1/ θ_{k}$ units of relevance evidence, debiasing Counterfactual Learning to Rank objectives.
Estimating $θ_{k}$ without full EM — propensities can also be recovered by result randomization (swap a document across ranks and watch how its click rate scales) or intervention harvesting from naturally occurring rank changes, avoiding the joint EM fit.
Identifiability caveat — if documents rarely change rank, the data cannot separate “clicked because examined” from “clicked because relevant”; multiple $(θ, γ)$ explain the log equally well. Randomization breaks this degeneracy.
Position-only assumption is the weakness — PBM ignores that earlier results affect later examination. When users scan-and-stop, the cascade model / Cascading Position Bias is the correct alternative.
Does not model trust or outlier effects — top ranks attracting extra clicks (Trust Bias) and visually distinctive items grabbing attention (Outlier Bias) both violate PBM’s clean factorization and require extended models.

Algorithm: PBM Parameter Estimation via EM
──────────────────────────────────────────────
Input: click log {(d_s, k_s, c_s)} over sessions s
Initialize θ_k, γ_d ∈ (0,1) for all ranks k, docs d
 
Repeat until convergence:
  # E-step: posteriors over latent E, R for non-clicks
  For each session s:
    if c_s == 1:
      P(E=1) ← 1 ;  P(R=1) ← 1
    else:                                  # c_s == 0
      denom    ← 1 - θ_{k_s} * γ_{d_s}
      P(E=1)   ← θ_{k_s} * (1 - γ_{d_s}) / denom
      P(R=1)   ← (1 - θ_{k_s}) * γ_{d_s} / denom
 
  # M-step: re-estimate as posterior averages
  For each rank k:
    θ_k ← mean over {s : k_s = k} of [ c_s + (1-c_s)*P(E=1)_s ]
  For each doc d:
    γ_d ← mean over {s : d_s = d} of [ c_s + (1-c_s)*P(R=1)_s ]
 
Return propensities {θ_k}  →  feed as 1/θ_k weights to IPW

Connections

Instantiates: Examination Hypothesis (examination probability made a per-rank constant)
Member of: Click Models (simplest member of the family)
Quantifies: Position Bias via the propensities $θ_{k}$
Feeds: Inverse Propensity Weighting and Counterfactual Learning to Rank / Unbiased Learning to Rank
Contrasted with: cascade model / Cascading Position Bias (examination depends on items above)
Violated by: Trust Bias, Outlier Bias, Surrounding Item Bias
Robust alternative when assumptions break: Doubly Robust Estimation

Study Notes

Explorer

Position-Based Click Model

Position-Based Click Model

Definition

Intuition

Mathematical Formulation

Likelihood and EM estimation

Key Properties / Variants

Connections

Appears In

Graph View

Table of Contents

Backlinks