bge-base-en-v1.5 vs multilingual-e5-large

bge-base-en-v1.5 and multilingual-e5-large are both feature-extraction models. See each entry for specifics.

bge-base-en-v1.5

Pipeline: feature extraction
Downloads: 8,365,829
Likes: 414

BGE-Base-EN-v1.5 is BAAI's mid-tier English embedding model in the v1.5 series, producing 768-dimensional vectors. It balances accuracy and compute cost between the small (384d) and large (1024d) variants, making it a practical default for English retrieval tasks where storage and inference overhead of the large model are undesirable. MIT licensed with ONNX export.

multilingual-e5-large

Pipeline: feature extraction
Downloads: 7,225,099
Likes: 1,186

Multilingual-E5-Large is a 560-million-parameter multilingual embedding model from Microsoft Research, supporting 100+ languages via an XLM-RoBERTa backbone. Trained with E5's instruction-following approach (prepending 'query:' or 'passage:' prefixes), it achieves strong MTEB multilingual retrieval scores. MIT licensed with ONNX and OpenVINO export.

Key differences

See individual model pages for architecture and use cases.

Common ground

Both are open-source models on HuggingFace.

Which should you pick?

Pick based on your compute budget and specific task requirements.