Implicit and Explicit Feedback

Definition

Implicit and Explicit Feedback

The two kinds of preference signal a Recommender System learns from.

Explicit feedback is a deliberate, stated judgement of preference: a 1–5 star rating, a thumbs up/down, a review score. The value is (roughly) on an interpretable preference scale, and a missing entry means “not rated”, i.e. genuinely unknown.

Implicit feedback is an indirect behavioural signal interpreted as preference: a click, play, watch-duration, purchase, skip, add-to-playlist. It is observed as a by-product of using the system. The signal is (usually) positive-only — we see what the user did, never an explicit “I dislike this”.

Both are recorded in the user–item Interaction Matrix $R$ with users $U = {u_{1}, \dots, u_{n}}$ and items $I = {i_{1}, \dots, i_{m}}$ , but the semantics of an observed value and of a missing value differ, which changes how we model and evaluate.

Intuition

What does a missing cell mean?

The crux of the distinction is the meaning of an unobserved entry $r_{u i}$ .

With explicit ratings, an unrated cell is missing data: the user simply has not told us. We should not treat it as a negative. Models are fit on the observed entries only.

With implicit feedback, there is no “negative” channel. A user who never clicked an item might (a) dislike it, or (b) never have seen it. Treating all non-interactions as negatives is wrong (most are unseen), but ignoring them entirely leaves only positive examples and the model collapses to predicting “everything is good”.

Practical reality (RS-L01 case studies): explicit ratings are scarce, expensive, and biased (few users rate; those who do are not a random sample). Implicit signals are abundant and cheap — every play, skip, and purchase is logged — which is why production systems (Spotify Daily Mix, bol.com deals, YouTube feed) run almost entirely on implicit feedback. The price is noise and ambiguity: a play could be a mis-click, a purchase could be a gift.

Mathematical Formulation

The same architecture can be trained on either signal; the loss function is what differs. RS-L01 frames Neural Collaborative Filtering as binary classification with label $y_{u i}$ and prediction $\overset{y}{^}_{u i}$ , and gives two losses:

Explicit feedback — (weighted) squared loss

$L_{exp} = \sum_{(u, i) \in O} w_{u i} (r_{u i} - \overset{r}{^}_{u i})^{2}$ where:

$O$ — set of observed (rated) user–item pairs; the sum runs over these only

$r_{u i}$ — the actual rating (e.g. on a 1–5 scale)

$\overset{r}{^}_{u i}$ — predicted rating (e.g. dot product $\overset{u}{ˉ}_{u} \cdot \overset{v}{ˉ}_{i}$ in Matrix Factorization, or NCF output)

$w_{u i}$ — optional per-entry weight (set $w_{u i} = 1$ for plain squared error)

Implicit feedback — binary cross-entropy with negative sampling

$L_{imp} = - \sum_{(u, i) \in O^{+}} lo g \overset{y}{^}_{u i} - \sum_{(u, j) \in O^{-}} lo g (1 - \overset{y}{^}_{u j})$ where:

$O^{+}$ — observed interactions, treated as positives ( $y_{u i} = 1$ )

$O^{-}$ — sampled unobserved pairs treated as negatives ( $y_{u j} = 0$ ); drawn by Negative Sampling to avoid summing over the huge set of all non-interactions

$\overset{y}{^}_{u i} \in (0, 1)$ — predicted probability that item $i$ is relevant to $u$ (sigmoid output)

An alternative for implicit data is to rank a positive above a sampled negative rather than classify each. This is Bayesian Personalized Ranking (BPR) (Rendle et al., 2012), the canonical implicit-feedback objective:

BPR — pairwise ranking from implicit feedback

$L_{BPR} = - \sum_{(u, i, j) \in D_{S}} lo g σ (\overset{r}{^}_{u i} - \overset{r}{^}_{u j})$ where:

$D_{S}$ — triples $(u, i, j)$ with $i$ an observed (positive) item for $u$ and $j$ a sampled unobserved item

$\overset{r}{^}_{u i}, \overset{r}{^}_{u j}$ — model scores for the positive and negative item

$σ (\cdot)$ — sigmoid; the loss pushes the positive’s score above the negative’s, encoding " $i ≻_{u} j$ " rather than fitting an absolute value

Key Properties / Variants

Missing-data semantics (the central exam point):
- Explicit → missing = unknown; fit on observed entries only; a regression/rating-prediction task evaluated with error metrics (the implicit assumption being “missing at random”, which is itself questionable).
- Implicit → missing = unlabeled (mostly unseen, some disliked); a ranking / one-class task; you must synthesise negatives (Negative Sampling) because there is no negative channel.
Task framing. Explicit feedback naturally suits rating prediction ( $\overset{r}{^}_{u i}$ , evaluated by RMSE/MAE). Implicit feedback naturally suits Top-K Recommendation / Next-Item Prediction, evaluated by ranking metrics — Recall/HR@K, MRR, NDCG (RS-L01/RS-L02). The modern course default is implicit + ranking.
Confidence weighting. Implicit signals carry strength: watching a film twice or purchasing is stronger evidence than a single click. A common trick is to weight the positive term by an interaction-derived confidence $c_{u i}$ (e.g. $c_{u i} = 1 + α count_{u i}$ ), the implicit analogue of $w_{u i}$ above.
Bias. Implicit logs are not an unbiased sample of preference: they are filtered through what the system already exposed and through Position Bias (RS-L02 motivates simulation and debiasing precisely because logged implicit data is biased). Explicit ratings carry selection bias (users rate what they feel strongly about).
Where each model sits:
- Matrix Factorization / user-based rating prediction — classically explicit (squared loss on observed ratings).
- Neural Collaborative Filtering — either, via the loss switch above.
- Bayesian Personalized Ranking (BPR), GRU4Rec (BPR loss), SASRec (BCE + negatives), BERT4Rec (masked CE over the catalogue) — implicit / sequential, learned from interaction order.
- Generative Recommendation (TIGER, OneRec) — implicit by construction: trained on the sequence of items the user actually interacted with, decoding the next item’s identifier.

Choosing the objective from the feedback type
─────────────────────────────────────────────
if feedback is EXPLICIT (ratings / scores):
    task   = rating prediction
    train  = squared loss over OBSERVED entries only      (do NOT impute 0)
    eval   = RMSE / MAE   (and/or rank the predicted ratings)
else feedback is IMPLICIT (clicks / plays / purchases):
    positives = observed interactions
    negatives = SAMPLE from unobserved pairs              (negative sampling)
    train  = BCE  (pointwise)   or   BPR  (pairwise ranking)
    weight positives by confidence c_ui if signal strength varies
    eval   = ranking metrics: Recall/HR@K, MRR, NDCG
    beware: logged implicit data is BIASED (exposure / position)

Connections

Recorded in: Interaction Matrix / User-Item Interaction
Trained via: Negative Sampling, Bayesian Personalized Ranking (BPR), Matrix Factorization, Neural Collaborative Filtering
Drives task choice: Top-K Recommendation vs rating prediction; Next-Item Prediction in Sequential Recommendation
Evaluated with: Recall, Hit Rate, MRR, NDCG (ranking) for implicit; error metrics for explicit
Confounded by: Position Bias, Popularity Bias (implicit logs are not unbiased preference samples)
Underlies: Collaborative Filtering, GRU4Rec, SASRec, BERT4Rec, Generative Recommendation

Study Notes

Explorer

Implicit and Explicit Feedback

Implicit and Explicit Feedback

Definition

Intuition

Mathematical Formulation

Key Properties / Variants

Connections

Appears In

Graph View

Table of Contents

Backlinks