bge-small-en-v1.5 vs multilingual-e5-large

bge-small-en-v1.5 and multilingual-e5-large are both feature-extraction models. See each entry for specifics.

bge-small-en-v1.5

Pipeline: feature extraction
Downloads: 34,386,222
Likes: 451

Small English dense embedding model from BAAI's BGE (BAAI General Embedding) series, producing 384-dimensional vectors via MIT license. Optimized for MTEB retrieval benchmarks through a retrieval-focused training strategy, it achieves competitive scores relative to its parameter count. Suited for embedding workflows where throughput and cost matter more than peak accuracy.

multilingual-e5-large

Pipeline: feature extraction
Downloads: 7,225,099
Likes: 1,186

Multilingual-E5-Large is a 560-million-parameter multilingual embedding model from Microsoft Research, supporting 100+ languages via an XLM-RoBERTa backbone. Trained with E5's instruction-following approach (prepending 'query:' or 'passage:' prefixes), it achieves strong MTEB multilingual retrieval scores. MIT licensed with ONNX and OpenVINO export.

Key differences

See individual model pages for architecture and use cases.

Common ground

Both are open-source models on HuggingFace.

Which should you pick?

Pick based on your compute budget and specific task requirements.