Position-Based Click Model

Definition

Position-Based Click Model (PBM)

The Position-Based Click Model is the simplest practical click model. It assumes a user clicks a document shown at rank iff the user examines rank and the document is relevant — and crucially that examination depends only on the rank , never on the document, the query, or the surrounding results. It is the operational instantiation of the Examination Hypothesis in which the examination probability is a per-rank constant.

Intuition

Think of each result position as having a fixed “visibility” determined purely by where it sits on the page. Rank 1 is almost always looked at; rank 8 is looked at far less. The PBM bakes this into a single number per rank, the propensity , and then treats whether the user clicks given that they looked as a clean measurement of relevance.

This factorization is what makes the model so useful: the two latent causes of a click — did they see it? (position) and did they like it? (relevance) — are assumed independent. Once you know the per-rank examination probabilities, an observed click becomes a noisy but unbiased-after-reweighting signal of relevance, which is exactly what IPW exploits.

The contrast to keep in mind: PBM says examination of rank is the same regardless of what is above it. The cascade model / Cascading Position Bias says the opposite — examination depends on the relevance of everything above .

Mathematical Formulation

A click random variable for document at rank factors into two independent Bernoulli events:

where:

  • — observed click on document displayed at rank
  • latent examination event for rank ; is the propensity
  • latent relevance of to query ;
  • — depends only on (the defining PBM assumption), monotonically decreasing in
  • — depends only on , never on

So the per-(document, rank) click probability is simply the product . The latent variables are unobserved; only is observed, which is what forces an EM-style inference.

Likelihood and EM estimation

Given a click log of sessions , each presenting document at rank with observed click , the data log-likelihood is:

Because is latent, parameters are fit by Expectation-Maximization:

  • E-step — for a non-click () we infer the posterior that the rank was nonetheless examined (the click failed because the doc was irrelevant): (for a click both and are certain).
  • M-step — re-estimate each parameter as the average of its inferred posterior over the relevant sessions:

Iterating E and M to convergence yields the propensities used downstream.

Key Properties / Variants

  • Two latent factors, one product — the entire model is ; everything else (EM, IPW) is bookkeeping on top of this factorization.
  • Propensities for counterfactual LTR — the fitted are exactly the inverse weights used by Inverse Propensity Weighting: a click at rank counts as units of relevance evidence, debiasing Counterfactual Learning to Rank objectives.
  • Estimating without full EM — propensities can also be recovered by result randomization (swap a document across ranks and watch how its click rate scales) or intervention harvesting from naturally occurring rank changes, avoiding the joint EM fit.
  • Identifiability caveat — if documents rarely change rank, the data cannot separate “clicked because examined” from “clicked because relevant”; multiple explain the log equally well. Randomization breaks this degeneracy.
  • Position-only assumption is the weakness — PBM ignores that earlier results affect later examination. When users scan-and-stop, the cascade model / Cascading Position Bias is the correct alternative.
  • Does not model trust or outlier effects — top ranks attracting extra clicks (Trust Bias) and visually distinctive items grabbing attention (Outlier Bias) both violate PBM’s clean factorization and require extended models.
Algorithm: PBM Parameter Estimation via EM
──────────────────────────────────────────────
Input: click log {(d_s, k_s, c_s)} over sessions s
Initialize θ_k, γ_d ∈ (0,1) for all ranks k, docs d
 
Repeat until convergence:
  # E-step: posteriors over latent E, R for non-clicks
  For each session s:
    if c_s == 1:
      P(E=1) ← 1 ;  P(R=1) ← 1
    else:                                  # c_s == 0
      denom    ← 1 - θ_{k_s} * γ_{d_s}
      P(E=1)   ← θ_{k_s} * (1 - γ_{d_s}) / denom
      P(R=1)   ← (1 - θ_{k_s}) * γ_{d_s} / denom
 
  # M-step: re-estimate as posterior averages
  For each rank k:
    θ_k ← mean over {s : k_s = k} of [ c_s + (1-c_s)*P(E=1)_s ]
  For each doc d:
    γ_d ← mean over {s : d_s = d} of [ c_s + (1-c_s)*P(R=1)_s ]
 
Return propensities {θ_k}  →  feed as 1/θ_k weights to IPW

Connections

Appears In