|
--- |
|
datasets: |
|
- quarkss/stsb-indo-mt |
|
- rzkamalia/stsb-indo-mt-modified |
|
language: |
|
- id |
|
base_model: |
|
- sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 |
|
tags: |
|
- Sentence Similarity |
|
- text-embeddings-inference |
|
- feature-extraction |
|
--- |
|
# Fine-tune sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 using Indonesia dataset |
|
|
|
This is a sentence-transformers model fine-tuned from [sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2](https://huggingface.co/sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2) using the CSV dataset. The dataset source is [rzkamalia/stsb-indo-mt-modified](https://huggingface.co/datasets/rzkamalia/stsb-indo-mt-modified). |
|
|
|
## Usage |
|
|
|
### Direct Usage (Sentence Transformers) |
|
|
|
First install the Sentence Transformers library: |
|
|
|
```bash |
|
pip install -U sentence-transformers |
|
``` |
|
|
|
Then you can load this model and run inference. |
|
```python |
|
from sentence_transformers import SentenceTransformer |
|
|
|
# download model |
|
model = SentenceTransformer("rzkamalia/fine-tune-paraphrase-multilingual-MiniLM-L12-v2-version-2") |
|
|
|
# run inference |
|
sentences = [ |
|
'...', |
|
'...', |
|
'...' |
|
] |
|
embeddings = model.encode(sentences) |
|
|
|
# get the similarity scores for the embeddings |
|
similarities = model.similarity(embeddings, embeddings) |
|
``` |