File size: 1,274 Bytes
960643f |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 |
---
datasets:
- quarkss/stsb-indo-mt
- rzkamalia/stsb-indo-mt-modified
language:
- id
base_model:
- sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
tags:
- Sentence Similarity
- text-embeddings-inference
- feature-extraction
---
# Fine-tune sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 using Indonesia dataset
This is a sentence-transformers model fine-tuned from [sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2](https://huggingface.co/sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2) using the CSV dataset. The dataset source is [rzkamalia/stsb-indo-mt-modified](https://huggingface.co/datasets/rzkamalia/stsb-indo-mt-modified).
## Usage
### Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
```bash
pip install -U sentence-transformers
```
Then you can load this model and run inference.
```python
from sentence_transformers import SentenceTransformer
# download model
model = SentenceTransformer("rzkamalia/fine-tune-paraphrase-multilingual-MiniLM-L12-v2-version-2")
# run inference
sentences = [
'...',
'...',
'...'
]
embeddings = model.encode(sentences)
# get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
``` |