Vector Space Model
Vector Space Model
The VSM represents documents and queries as vectors in a high-dimensional term space. Each dimension corresponds to a term; weights are typically TF-IDF. Similarity is computed via cosine similarity.
Cosine Similarity
Properties:
- Handles partial matching (unlike Boolean retrieval)
- Length normalization via cosine
- Foundation extended by BM25 and Dense Retrieval (learned dense vectors)