ColBERT
ColBERT
ColBERT (Contextualized Late Interaction over BERT) is a retrieval model that combines the efficiency of Bi-Encoder (pre-computed document representations) with fine-grained token-level matching via late interaction.
MaxSim Scoring
For each query token, find its best-matching document token (max similarity), then sum across all query tokens.
Key properties:
- Documents encoded independently → can be pre-computed and indexed
- Token-level representations (not single-vector) → richer matching than standard bi-encoder
- Uses Approximate Nearest Neighbor search with deferred interaction
- Much faster than cross-encoder, more effective than single-vector bi-encoder