Pooling
Pooling
Pooling is a technique used to create relevance judgments (Qrels) for large-scale document collections without judging every single document. It involves taking the top- results from multiple diverse retrieval systems and forming a “pool” for human judges to evaluate.
The Process
- Submit Queries: Run the same set of queries through different retrieval systems.
- Collect Top-k: Take the top (e.g., ) documents returned by each system.
- Union: Create a unique set of documents from the union of these results.
- Judge: Human annotators judge only the documents in this union.
- Assumption: Any document not in the pool is assumed to be not relevant.
Why it works
It is impossible to judge millions of documents for every query. Pooling assumes that if a document is highly relevant, at least one of the many retrieval systems will find it and place it in its top- list.
Evaluation and Bias
- Unjudged = Non-relevant: This is the core assumption. If a new system finds a relevant document that wasn’t in the original pool, that system might be unfairly penalized (its precision will look lower than it is).
- Leave-One-Out (LOO) Tests: Used to check if the pool is “fair” by removing one system’s contributions and seeing if the rankings of other systems change.
Connections
- Necessary for: Cranfield Paradigm in large collections like TREC.
- Relates to: Precision and Recall, MAP (which are calculated based on these judgments).