Vector Space Model

Vector Space Model

The VSM represents documents and queries as vectors in a high-dimensional term space. Each dimension corresponds to a term; weights are typically TF-IDF. Similarity is computed via cosine similarity.

Cosine Similarity

Properties:

  • Handles partial matching (unlike Boolean retrieval)
  • Length normalization via cosine
  • Foundation extended by BM25 and Dense Retrieval (learned dense vectors)

Appears In