File size: 1,041 Bytes
2359bda |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 |
# Datasets
`sentence_transformers.datasets` contains classes to organize your training input examples.
## ParallelSentencesDataset
`ParallelSentencesDataset` is used for multilingual training. For details, see [multilingual training](../../examples/training/multilingual/README.md).
```eval_rst
.. autoclass:: sentence_transformers.datasets.ParallelSentencesDataset
```
## SentenceLabelDataset
`SentenceLabelDataset` can be used if you have labeled sentences and want to train with triplet loss.
```eval_rst
.. autoclass:: sentence_transformers.datasets.SentenceLabelDataset
```
## DenoisingAutoEncoderDataset
`DenoisingAutoEncoderDataset` is used for unsupervised training with the TSDAE method.
```eval_rst
.. autoclass:: sentence_transformers.datasets.DenoisingAutoEncoderDataset
```
## NoDuplicatesDataLoader
`NoDuplicatesDataLoader`can be used together with MultipleNegativeRankingLoss to ensure that no duplicates are within the same batch.
```eval_rst
.. autoclass:: sentence_transformers.datasets.NoDuplicatesDataLoader
```
|