nomic-ai
/

nomic-embed-text-v2-moe

Sentence Similarity

sentence-transformers

feature-extraction

text-embeddings-inference

Inference Endpoints

Model card Files Files and versions Community

zpn commited on 14 days ago

Commit

3b160e9

·

verified ·

1 Parent(s): 24132aa

Update README.md

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -119,7 +119,7 @@ nomic-embed-text-v2-moe is SoTA multilingual MoE text embedding model:
 - **High Performance**: SoTA Multilingual performance compared to ~300M parameter models, competitive with models 2x in size
 - **Multilinguality**: Supports ~100 languages and trained over 1.6B pairs
 - **Flexible Embedding Dimension**: Trained with [Matryoshka Embeddings](https://arxiv.org/abs/2205.13147) with 3x reductions in storage cost with minimal performance degredations
-- **Fully-Open Source**: Model weights, [code](https://github.com/nomic-ai/contrastors), and training data (see code repo) released
 | Model | Params (M) | Emb Dim | BEIR | MIRACL | Pretrain Data | Finetune Data | Code |
@@ -151,6 +151,12 @@ nomic-embed-text-v2-moe is SoTA multilingual MoE text embedding model:
 The model can be used through SentenceTransformers and Transformers.
 **Important**: the text prompt *must* include a *task instruction prefix*, instructing the model which task is being performed.
 For queries/questions, please use `search_query: ` and `search_document: ` for the corresponding document

 - **High Performance**: SoTA Multilingual performance compared to ~300M parameter models, competitive with models 2x in size
 - **Multilinguality**: Supports ~100 languages and trained over 1.6B pairs
 - **Flexible Embedding Dimension**: Trained with [Matryoshka Embeddings](https://arxiv.org/abs/2205.13147) with 3x reductions in storage cost with minimal performance degredations
+- **Fully Open-Source**: Model weights, [code](https://github.com/nomic-ai/contrastors), and training data (see code repo) released
 | Model | Params (M) | Emb Dim | BEIR | MIRACL | Pretrain Data | Finetune Data | Code |
 The model can be used through SentenceTransformers and Transformers.
+For best performance on GPU, please install
+```bash
+pip install torch transformers einops git+https://github.com/nomic-ai/megablocks.git
+```
 **Important**: the text prompt *must* include a *task instruction prefix*, instructing the model which task is being performed.
 For queries/questions, please use `search_query: ` and `search_document: ` for the corresponding document