all-MiniLM-L6-v2 vs all-mpnet-base-v2

all-MiniLM-L6-v2 and all-mpnet-base-v2 are both sentence-similarity models. See each entry for specifics.

all-MiniLM-L6-v2

Pipeline: sentence similarity
Downloads: 239,973,503
Likes: 4,754

Distilled BERT model that encodes sentences into 384-dimensional vectors for measuring semantic similarity. Trained on over a billion sentence pairs spanning scientific papers, web QA, NLI datasets, and community forums. At 22M parameters and 6 transformer layers, it is fast enough for CPU inference while remaining competitive on standard sentence similarity benchmarks.

all-mpnet-base-v2

Pipeline: sentence similarity
Downloads: 36,513,639
Likes: 1,287

Sentence embedding model based on the MPNet architecture, producing 768-dimensional vectors. Trained on over a billion sentence pairs from MS MARCO, NLI datasets, and community QA forums, it is frequently used when accuracy matters more than inference speed among English embedding models. The MPNet backbone enables masked and permuted prediction during pre-training for stronger representations.

Key differences

See individual model pages for architecture and use cases.

Common ground

Both are open-source models on HuggingFace.

Which should you pick?

Pick based on your compute budget and specific task requirements.