Cross-encoder reranker trained on the MS MARCO passage retrieval dataset, designed to score query-document pairs jointly rather than encoding them independently. Distilled from a 12-layer cross-encoder into 6 layers to reduce latency while retaining re-ranking accuracy. Used as a second-stage ranker on top of fast first-stage retrieval (BM25 or bi-encoder).
78,976,309 ↓ · 267 ♡
A 4-layer MiniLM cross-encoder fine-tuned on MS MARCO passage ranking for fast query-document relevance scoring. Offers the lowest latency in the MS MARCO cross-encoder series at the cost of some ranking accuracy.
3,414,783 ↓ · 16 ♡
The 12-layer variant of MiniLM fine-tuned on MS MARCO, providing the highest accuracy in the MiniLM cross-encoder family at the cost of latency. A strong default choice for reranking when compute allows.
2,662,662 ↓ · 106 ♡
gte-reranker-modernbert-base is a cross-encoder for text ranking. It produces fine-grained relevance judgments at the cost of higher per-pair inference latency.
1,832,291 ↓ · 95 ♡
jina-reranker-v2-base-multilingual is a cross-encoder for text ranking. It produces fine-grained relevance judgments at the cost of higher per-pair inference latency.
1,570,802 ↓ · 351 ♡
Qwen3-Reranker-0.6B ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.
1,533,311 ↓ · 365 ♡
Qwen3-Reranker-4B ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.
1,496,645 ↓ · 143 ♡
mmarco-mMiniLMv2-L12-H384-v1 is a cross-encoder for text ranking. It produces fine-grained relevance judgments at the cost of higher per-pair inference latency.
1,429,256 ↓ · 75 ♡
ms-marco-MiniLM-L2-v2 is a cross-encoder for text ranking. It produces fine-grained relevance judgments at the cost of higher per-pair inference latency.
1,257,135 ↓ · 14 ♡
Qwen3-Reranker-8B ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.
1,194,253 ↓ · 239 ♡
jina-reranker-v3 is a cross-encoder for text ranking. It produces fine-grained relevance judgments at the cost of higher per-pair inference latency.
1,008,955 ↓ · 138 ♡
mxbai-rerank-base-v1 ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.
803,502 ↓ · 46 ♡
Qwen3-VL-Reranker-8B is Qwen's multimodal cross-encoder reranker built on the Qwen3-VL-8B architecture, capable of reranking both text-only and text+image query-document pairs. It is trained with contrastive reranking objectives to score relevance between a query and a set of retrieved passages or image-text chunks, making it the first VLM-based reranker in the MTEB ecosystem.
741,676 ↓ · 152 ♡
mxbai-rerank-xsmall-v1 ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.
583,024 ↓ · 57 ♡
crossencoder-camembert-base-mmarcoFR is an open-source text-ranking model available on HuggingFace. Details are sourced from the public model registry.
477,058 ↓ · 7 ♡
rank1-7b is an open-source text-ranking model available on HuggingFace. Details are sourced from the public model registry.
444,839 ↓ · 4 ♡
llama-nemotron-rerank-1b-v2 ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.
357,535 ↓ · 52 ♡
Qwen3-VL-Reranker-2B scores query-document pairs for relevance. Used as a cross-encoder reranker, it jointly encodes the pair and outputs a single relevance score.
318,777 ↓ · 194 ♡
mxbai-rerank-large-v2 is Mixedbread AI's cross-encoder reranker built on Qwen2, covering 100+ languages. Rerankers score query-document pairs for relevance, improving retrieval quality when used after a fast bi-encoder retrieval step. Apache-2.0 licensed with text-embeddings-inference compatibility.
314,720 ↓ · 140 ♡
A sequence-classification-format wrapper of the Qwen3-Reranker-0.6B by Tom Aarsen (sentence-transformers maintainer), making it compatible with the CrossEncoder class without generative decoding. This format is faster for reranking than the generative format.
306,747 ↓ · 30 ♡
A 2-layer TinyBERT cross-encoder fine-tuned on MS MARCO passage ranking by the sentence-transformers team. Extremely small and fast, it trades accuracy for speed in reranking pipelines where a larger cross-encoder would be the throughput bottleneck.
300,817 ↓ · 44 ♡
mxbai-rerank-large-v1 is an open-source text-ranking model available on HuggingFace. Details are sourced from the public model registry.
299,638 ↓ · 142 ♡
japanese-reranker-cross-encoder-small-v1 is an open-source text-ranking model available on HuggingFace. Details are sourced from the public model registry.
296,854 ↓ · 5 ♡