AI Tools.

Search

text ranking models

23 models · ranked by HuggingFace downloads

ms-marco-MiniLM-L6-v2

Cross-encoder reranker trained on the MS MARCO passage retrieval dataset, designed to score query-document pairs jointly rather than encoding them independently. Distilled from a 12-layer cross-encoder into 6 layers to reduce latency while retaining re-ranking accuracy. Used as a second-stage ranker on top of fast first-stage retrieval (BM25 or bi-encoder).

78,976,309 ↓ · 267 ♡

ms-marco-MiniLM-L4-v2

A 4-layer MiniLM cross-encoder fine-tuned on MS MARCO passage ranking for fast query-document relevance scoring. Offers the lowest latency in the MS MARCO cross-encoder series at the cost of some ranking accuracy.

3,414,783 ↓ · 16 ♡

ms-marco-MiniLM-L12-v2

The 12-layer variant of MiniLM fine-tuned on MS MARCO, providing the highest accuracy in the MiniLM cross-encoder family at the cost of latency. A strong default choice for reranking when compute allows.

2,662,662 ↓ · 106 ♡

gte-reranker-modernbert-base

gte-reranker-modernbert-base is a cross-encoder for text ranking. It produces fine-grained relevance judgments at the cost of higher per-pair inference latency.

1,832,291 ↓ · 95 ♡

jina-reranker-v2-base-multilingual

jina-reranker-v2-base-multilingual is a cross-encoder for text ranking. It produces fine-grained relevance judgments at the cost of higher per-pair inference latency.

1,570,802 ↓ · 351 ♡

Qwen3-Reranker-0.6B

Qwen3-Reranker-0.6B ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.

1,533,311 ↓ · 365 ♡

Qwen3-Reranker-4B

Qwen3-Reranker-4B ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.

1,496,645 ↓ · 143 ♡

mmarco-mMiniLMv2-L12-H384-v1

mmarco-mMiniLMv2-L12-H384-v1 is a cross-encoder for text ranking. It produces fine-grained relevance judgments at the cost of higher per-pair inference latency.

1,429,256 ↓ · 75 ♡

ms-marco-MiniLM-L2-v2

ms-marco-MiniLM-L2-v2 is a cross-encoder for text ranking. It produces fine-grained relevance judgments at the cost of higher per-pair inference latency.

1,257,135 ↓ · 14 ♡

Qwen3-Reranker-8B

Qwen3-Reranker-8B ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.

1,194,253 ↓ · 239 ♡

jina-reranker-v3

jina-reranker-v3 is a cross-encoder for text ranking. It produces fine-grained relevance judgments at the cost of higher per-pair inference latency.

1,008,955 ↓ · 138 ♡

mxbai-rerank-base-v1

mxbai-rerank-base-v1 ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.

803,502 ↓ · 46 ♡

Qwen3-VL-Reranker-8B

Qwen3-VL-Reranker-8B is Qwen's multimodal cross-encoder reranker built on the Qwen3-VL-8B architecture, capable of reranking both text-only and text+image query-document pairs. It is trained with contrastive reranking objectives to score relevance between a query and a set of retrieved passages or image-text chunks, making it the first VLM-based reranker in the MTEB ecosystem.

741,676 ↓ · 152 ♡

mxbai-rerank-xsmall-v1

mxbai-rerank-xsmall-v1 ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.

583,024 ↓ · 57 ♡

crossencoder-camembert-base-mmarcoFR

crossencoder-camembert-base-mmarcoFR is an open-source text-ranking model available on HuggingFace. Details are sourced from the public model registry.

477,058 ↓ · 7 ♡

rank1-7b

rank1-7b is an open-source text-ranking model available on HuggingFace. Details are sourced from the public model registry.

444,839 ↓ · 4 ♡

llama-nemotron-rerank-1b-v2

llama-nemotron-rerank-1b-v2 ranks candidate passages against a query by fully attending to both strings in a single forward pass, making it more accurate but slower than bi-encoders.

357,535 ↓ · 52 ♡

Qwen3-VL-Reranker-2B

Qwen3-VL-Reranker-2B scores query-document pairs for relevance. Used as a cross-encoder reranker, it jointly encodes the pair and outputs a single relevance score.

318,777 ↓ · 194 ♡

mxbai-rerank-large-v2

mxbai-rerank-large-v2 is Mixedbread AI's cross-encoder reranker built on Qwen2, covering 100+ languages. Rerankers score query-document pairs for relevance, improving retrieval quality when used after a fast bi-encoder retrieval step. Apache-2.0 licensed with text-embeddings-inference compatibility.

314,720 ↓ · 140 ♡

Qwen3-Reranker-0.6B-seq-cls

A sequence-classification-format wrapper of the Qwen3-Reranker-0.6B by Tom Aarsen (sentence-transformers maintainer), making it compatible with the CrossEncoder class without generative decoding. This format is faster for reranking than the generative format.

306,747 ↓ · 30 ♡

ms-marco-TinyBERT-L2-v2

A 2-layer TinyBERT cross-encoder fine-tuned on MS MARCO passage ranking by the sentence-transformers team. Extremely small and fast, it trades accuracy for speed in reranking pipelines where a larger cross-encoder would be the throughput bottleneck.

300,817 ↓ · 44 ♡

mxbai-rerank-large-v1

mxbai-rerank-large-v1 is an open-source text-ranking model available on HuggingFace. Details are sourced from the public model registry.

299,638 ↓ · 142 ♡

japanese-reranker-cross-encoder-small-v1

japanese-reranker-cross-encoder-small-v1 is an open-source text-ranking model available on HuggingFace. Details are sourced from the public model registry.

296,854 ↓ · 5 ♡