File size: 39,242 Bytes
6c4d153
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
---
tags:
- sentence-transformers
- sentence-similarity
- feature-extraction
- generated_from_trainer
- dataset_size:6433
- loss:Infonce
base_model: BAAI/bge-m3
widget:
- source_sentence: What are the factors that influence the treatment and healing of
    equine lameness
  sentences:
  - 'Masterkraft may refer to:


    Masterkraft (producer), a record producer

    MSTRKRFT, a Canadian electronic music group'
  - 'The treatment of equine lameness is a complex subject. Lameness in horses has
    a variety of causes, and treatment must be tailored to the type and degree of
    injury, as well as the financial capabilities of the owner. Treatment may be applied
    locally, systemically, or intralesionally, and the strategy for treatment may
    change as healing progresses. The end goal is to reduce the pain and inflammation
    associated with injury, to encourage the injured tissue to heal with normal structure
    and function, and to ultimately return the horse to the highest level of performance
    possible following recovery. The process of healing


    Bone


    Bone heals by formation of a callus over the defective area. Speed and quality
    of healing is directly related to the blood supply and fracture stability. Rest
    is required immediately following injury to reduce movement of the fracture site.
    Stability may be improved through use of surgical implants or casting, depending
    on the location of extent of the fracture. Shock wave therapy is sometimes employed
    in the case of splint bone fracture or stress fractures to the cannon bones, to
    improve blood flow to the area. Fractures within a joint, such as chip fractures
    in the knee, hock, or fetlock, require arthroscopic surgery to prevent secondary
    arthritis of that joint. In some cases, the callus may place pressure on surrounding
    soft tissue structures. The callus of a splint bone fracture can push on the adjacent
    suspensory ligament, leading to lameness from secondary suspensory desmitis. Treatment
    usually involves the removal of the offending callus. On average, bone heals better
    than soft tissue. It requires less time to heal and, unlike soft tissue which
    is always weaker after healing, bone heals to 100% strength. However, fracture
    healing in horses is complicated by their size, flightiness, and desire to stand.
    Horses are at risk of re-injury of the fracture site, especially when trying to
    rise after lying down, or when recovering from anesthesia following fracture repair.
    Forced recumbency is not an option for horses, making healing more difficult.
    Weight bearing on a single front or hind limb increases the likelihood of support
    limb laminitis. Additionally, the cost of casting or surgical fixation makes treatment
    financially unattainable for some owners. While limb fractures are no longer a
    death-sentence for horses, it is still considered a very serious injury. In general,
    a horse is more likely to survive if it is small in stature and has a good temperament
    that will tolerate the months of inactivity required for healing. Fractures that
    are open, comminuted (very fragmented), or located higher on the limb tend to
    have a poorer prognosis. Synovial joints

    Lameness is most commonly associated with injury to synovial joints, or those
    joints containing articular cartilage, a joint capsule, and a synovial membrane.
    Joint disease may affect the joint capsule and synovial membrane, articular cartilage,
    subchondral bone (the bone underneath the cartilage), menisci, or any ligaments
    associated with the joint. Damage to any of these tissues leads to inflammation,
    which is especially problematic in the joint. While degeneration of articular
    cartilage is a common disease process in working animals, resulting in osteoarthritis,
    cartilage is aneural (does not contain nerves) and does not produce pain. Pain
    associated with osteoarthritis is secondary to joint capsule pain, due to joint
    distention and reduced range of motion, or to pain from the underlying bone, which
    may become damaged following erosion of the articular cartilage. Inflammatory
    products, such as inflammatory mediators and cytokines, damage articular cartilage
    and have been shown to weaken intra-articular ligaments. Therefore, treatment
    of joint disease should not only address the primary injury producing inflammation,
    but also the inflammatory cycle that leads to further tissue damage. Cryotherapy,
    joint lavage, systemic anti-inflammatories, or intra-articular medications are
    used to reduce joint inflammation. In the case of severe joint pathology such
    as an osteochondral chip, intra-articular fracture, osteochondritis dissecans
    lesion, or ligamentous or meniscal injury, arthroscopy may be required to ensure
    normal function of that joint. Debris within the joint, such as from a chip fracture,
    can cause long term damage to the synovium and articular cartilage leading to
    osteoarthritis, and is therefore best removed. Following acute injury, joints
    often benefit from specialized physical therapy, such as swimming, to prevent
    the loss of range of motion associated with joint capsule fibrosis. Treatment
    of joint cartilage injury is difficult and often unrewarding. Partial-thickness
    defects do not heal. The body will try to repair full-thickness cartilage defects
    using scar tissue or fibrocartilage, both of which are poor substitutes for normal,
    healthy articular cartilage. Current treatment includes arthroscopy-produced microfratures
    within the subchondral plate. These microfractures encourage an inflammatory response
    within the defect, which recruits stem cells to the area. Unfortunately, these
    cells differentiate into fibrocartilage, rather than normal joint (hyaline) cartilage,
    leading to inferior tissue repair at the site of injury. Bone marrow aspirate
    concentrate (BMAC) has shown some benefits when grafted into the area following
    microtrauma. However, the primary treatment for degenerative joint disease involves
    reducing the inflammatory process that is known to accelerate articular cartilage
    degeneration. Tendon and ligament


    Tendon is primarily composed of elastic type I collagen. However, mature tendon
    contains cells that have a limited ability to regenerate. Following injury, tendon
    lays down type III collagen, or scar tissue, which is stronger than type I collagen
    but stiffer and less-elastic. This makes it less distensible and more likely to
    re-injure when the horse begins to stretch the tendon during strenuous work. Certain
    treatments may improve the final tendon fiber quality, and subsequently increase
    the likelihood that the horse will return to full performance post-injury. Healing
    of soft tissue injury is often monitored using ultrasound to assess the lesion
    size and fiber pattern. Monitoring soft tissue injury with ultrasound allows for
    a more scientific determination of when to introduce exercise back into the horse''s
    rehabilitation program, and for quick intervention should the injury worsen. Recently,
    a new ultrasound technique called color Doppler ultrasonography has been used
    to assess equine tendon injuries. Color Doppler measures the degree of blood flow
    to a lesion, allowing for more accurate assessment of healing. Rest and hand-walking

    Rest is almost always recommended for treatment of lameness because it reduces
    forces placed on damaged tissue, allowing the normal process of healing to occur.
    Type and severity of injury determines the duration and degree of rest required.
    Aggressive limitations of activity may be required in cases of fracture. Horses
    are kept tied for the several-month duration of healing, to prevent them from
    lying down and potentially re-injuring the bone while trying to stand. In other
    cases, rest may be contraindicated. Animals with a history of upward fixation
    of the patella, polysaccharide storage myopathy, and equine recurrent rhabdomyolysis
    are often best kept on a schedule of regular exercise. Rest may be counterproductive
    if the lameness is secondary to osteoarthritis. In this case, mild exercise improves
    joint mobility and lameness can worsen with confinement. Rest may vary from strict
    confinement (“stall rest”), to small paddock or pasture turnout, to reduction
    of exercise intensity. Horses are often unpredictable when on prolonged stall
    rest, which greatly increases the risk of re-injury when hand-walking is begun.
    Sedation or additional forms of restraint may be needed to help control the horse
    during this initial period of'
  - As we stand right now, our broadcasts resemble a mixture between a graduate-level
    college seminar and a JV football game with the volume turned down 11 notches.
    Everything the commentators narrate is an explanation simply of what is going
    on, not speculation as to why they are doing it. They simply state what an ejection
    means, what a particular whistle is for, or what the purpose was of taking a time
    out. There is no substance. If Water Polo hopes and expects to be a household
    name, or at least something that households want on their television instead of
    the Wednesday Bowling Championships, then we need to institute a change. Most
    broadcasts are viewed by Water Polo fans…fans that already understand the game
    and the rules. We don’t need to be spoken to like a freshman college student who’s
    learning algebra for the first time. Most of us have already done that, and I
    feel confident in claiming that no one cares to do it again, especially not for
    entertainment. Production and entertainment value are key to the growth and expansion
    of any sport, especially Water Polo. Invite a color commentator that knows the
    game well enough to look for the nuances in the action.
- source_sentence: What architectural features make Rosslyn Chapel unique
  sentences:
  - 'Star Wars: Galaxy’s Edge may not officially open until the end of June, but for
    some fans, it could happen even sooner.'
  - Visual artist Megan Jonas painted urban Phoenix landscapes, and musician Jordan
    Ignacio composed music to accompany the images. Director Joshua J. Provost is
    an Arizona native and explores how the artists are inspired, allowing them to
    express their creative process through animation and sound.
  - 'In 1736, James St. Clair worked to restore the building to its former state.
    Since then, the Earls of St. Clair have worked to keep the building in good condition.
    There are a few intersting things to mention about Rosslyn Chapel. First, the
    town Roslin was founded as a home for the stone masons who worked on the original
    church. This is the same Roslin where scientists produced the first cloned animal,
    Dolly the sheep, in 1996. Secondly, the chapel''s design is quite eccentric. There
    are 32 different arch designs, which is strange enough, and some are even hollow.
    There are numerous carvings, including the "Dance of Death" motif, Abraham about
    to sacrifice Isaac, Samson destroying Philistines, David killing a lion, the Prodigal
    son, the Crucifixion, and scenes of the St. Clairs. The building is oriented east-west:

    marked out by the solar ray, according to the ancient tradition. A geometrical
    figure is usually used to regulate the proportions, and a double equilateral triangle
    is used in Rosslyn.Then there is the case of the Apprentice Pillar and its possible
    significance. The story associated with it is somewhat reminiscent of the Widow''s
    Son of the Freemasons, and there are numerous claims--unsubstantiated--which say
    that the Knights Templar worshipped here after escaping from the Continent, and
    that they brought the Holy Grail and hid it in the pillar. --The Rosslyn Chapel
    website

    All in all, an unusual place. Back to "R" | Back to JCE

    Mary Jones © 2003'
- source_sentence: What is the liability of Norwest regarding the information provided
    on their website
  sentences:
  - Mulpha Norwest Pty Ltd ACN 000 004 633, Norwest City Pty Ltd ACN 603 333 615 (in
    its personal capacity and as trustee of the Norwest City Trust ABN 26 695 958
    023), Norwest Real Estate Pty Ltd ACN 000 692 213, Norwest Flexi Pty Ltd ACN 607
    435 165 (in its personal capacity and as trustee of Flexi Trust ABN 49 958 864
    033) and Circa 1 Pty Ltd (“Norwest, we, us, our ”) updates this website as often
    as practicable and tries to ensure that all information provided as part of this
    website is correct at the time of its inclusion; however, we do not guarantee
    its accuracy at any time. All information provided as part of this website is
    intended to be general in nature and you should not rely on it in connection with
    the making of any decision. We are not liable for any action you may take as a
    result of, or loss incurred from relying on this website, including from data
    corruption on download. We do not take responsibility for websites of other organisations,
    whether or not linked, or any device used to access or view our website.
  - Coaltown (1945–1965) was an American Hall of Fame Champion Thoroughbred racehorse
    of whom The New York Times said "was probably the most underrated Thoroughbred
    of the 20th Century."
  - '“As a proud Cal Alum, I’m excited to help lay the foundation for Berkeley to
    become a globally recognized hub for Artificial Intelligence.”

    We’re excited to share that we are expanding our team to execute on this vision.
    Today, we welcome six UC Berkeley Faculty as Co-Founders of AI@The House and Faculty
    Partners at The House Fund. Their professional and academic expertise spans across
    the full stack of AI — theory and statistics, computer vision, systems, robotics,
    security, algorithms, hardware, and more.'
- source_sentence: In what year were the book reviews mentioned in various publications
    published
  sentences:
  - Vol 13, No 1. Pages 84 to 86. JSTOR. Julius Goebel Jr. (September 1926) Political
    Science Quarterly. Vol 41, No 3. Pages 476 to 479. JSTOR. Plucknett, Thoedore
    F T. "Book Reviews" (1926) 39 Harvard Law Review 405 JSTOR. Zane, John M. "Current
    Legal Literature". (1926) 12 American Bar Association Journal 172. JSTOR. Putnam,
    Bertha Haven. "Reviews of Books" (July 1926) The American Historical Review. Vol
    31, No 4. Page 763. JSTOR. H D H. "Book Reviews" (1926) 2 Cambridge Law Journal
    408. JSTOR.
  - 'Wednesday, September 24, 2014

    Kiwi Lane & Bella Blvd "That''s My Boy"

    Alicia with you today for Kiwi Lane. I have for you a single page layout using
    Bella Blvd''s Max Collections paper That''s My Boy, Borders, Treasures & Text
    stickers with some die cut pieces from that collection.'
  - Memoir '44 is a historical boardgame where players face-off in stylized battles
    of some of the most famous historic battles of World War II including Omaha Beach,
    Pegasus Bridge, Operation Cobra and the Ardennes. Memoir '44 includes over 15
    different battle scenarios and features a double-sided hex game board for both
    beach landings and countryside combat.
- source_sentence: What rights do individuals have regarding their personal data according
    to European Regulation no. 679/2016
  sentences:
  - 'The regional economy has benefited from these investments in infrastructure.
    The project owner has also funded the construction of a local school, which is
    providing benefits to local children. Project impacts and benefits:

    - The project has generated hydropower plant operation/ maintenance jobs for local
    people. - 24 operational staff have benefited from six months of capacity building
    in the form of technical training. - The construction of a new transmission line
    is reducing electricity loss and increasing the electricity supply in the region.
    - Former low-quality infrastructure systems in the region have improved, e.g.
    by upgrading roads, and by building bridges and irrigation canals. - A local school
    has been built. - The project has provided local farmers with support to broaden
    their agricultural activities to make them more sustainable (e.g. by implementing
    aquaculture, which reduces the need for logging for farmland). - The project has
    reduced the need for wood for heating, cooking, and lighting, thus allowing the
    forest to regenerate and improving soil conditions, hydrology and biodiversity.
    - The project has improved regional air quality by reducing the need for diesel
    generators and wood fires.'
  - 'How did you get there??? 05-20-2003, 05:57 AM

    FAQ (on this forum) -> How do I get a picture under my username'
  - By consulting this site, data relating to identified or identifiable persons may
    be processed. The consent mechanisms will be evident, brief and easily understandable;
    if the original conditions for which consent was requested were to be changed,
    for example if the purpose of data processing changed, further consent will be
    required pursuant to European Regulation no. 679/2016. All the documents related
    to consents collected will be kept separate from any other corporate document.
    Your personal data will not be disclosed and you are granted the exercise of the
    rights referred to in Articles. 11-20 of the European Regulation n. 679/2016 by
    writing to Promoviaggi S.p.A., Viale Gian Galeazzo, nr. 3, 20136 Milano (Italy)
    or by sending an e-mail to [email protected].
pipeline_tag: sentence-similarity
library_name: sentence-transformers
---

# SentenceTransformer based on BAAI/bge-m3

This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [BAAI/bge-m3](https://huggingface.co/BAAI/bge-m3). It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.

## Model Details

### Model Description
- **Model Type:** Sentence Transformer
- **Base model:** [BAAI/bge-m3](https://huggingface.co/BAAI/bge-m3) <!-- at revision 5617a9f61b028005a4858fdac845db406aefb181 -->
- **Maximum Sequence Length:** 1024 tokens
- **Output Dimensionality:** 1024 dimensions
- **Similarity Function:** Cosine Similarity
<!-- - **Training Dataset:** Unknown -->
<!-- - **Language:** Unknown -->
<!-- - **License:** Unknown -->

### Model Sources

- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
- **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
- **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)

### Full Model Architecture

```
SentenceTransformer(
  (0): Transformer({'max_seq_length': 1024, 'do_lower_case': False}) with Transformer model: XLMRobertaModel 
  (1): Pooling({'word_embedding_dimension': 1024, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
  (2): Normalize()
)
```

## Usage

### Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

```bash
pip install -U sentence-transformers
```

Then you can load this model and run inference.
```python
from sentence_transformers import SentenceTransformer

# Download from the 🤗 Hub
model = SentenceTransformer("Jrinky/mpnet-base-all-nli-triplet")
# Run inference
sentences = [
    'What rights do individuals have regarding their personal data according to European Regulation no. 679/2016',
    'By consulting this site, data relating to identified or identifiable persons may be processed. The consent mechanisms will be evident, brief and easily understandable; if the original conditions for which consent was requested were to be changed, for example if the purpose of data processing changed, further consent will be required pursuant to European Regulation no. 679/2016. All the documents related to consents collected will be kept separate from any other corporate document. Your personal data will not be disclosed and you are granted the exercise of the rights referred to in Articles. 11-20 of the European Regulation n. 679/2016 by writing to Promoviaggi S.p.A., Viale Gian Galeazzo, nr. 3, 20136 Milano (Italy) or by sending an e-mail to [email protected].',
    'The regional economy has benefited from these investments in infrastructure. The project owner has also funded the construction of a local school, which is providing benefits to local children. Project impacts and benefits:\n- The project has generated hydropower plant operation/ maintenance jobs for local people. - 24 operational staff have benefited from six months of capacity building in the form of technical training. - The construction of a new transmission line is reducing electricity loss and increasing the electricity supply in the region. - Former low-quality infrastructure systems in the region have improved, e.g. by upgrading roads, and by building bridges and irrigation canals. - A local school has been built. - The project has provided local farmers with support to broaden their agricultural activities to make them more sustainable (e.g. by implementing aquaculture, which reduces the need for logging for farmland). - The project has reduced the need for wood for heating, cooking, and lighting, thus allowing the forest to regenerate and improving soil conditions, hydrology and biodiversity. - The project has improved regional air quality by reducing the need for diesel generators and wood fires.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 1024]

# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
```

<!--
### Direct Usage (Transformers)

<details><summary>Click to see the direct usage in Transformers</summary>

</details>
-->

<!--
### Downstream Usage (Sentence Transformers)

You can finetune this model on your own dataset.

<details><summary>Click to expand</summary>

</details>
-->

<!--
### Out-of-Scope Use

*List how the model may foreseeably be misused and address what users ought not to do with the model.*
-->

<!--
## Bias, Risks and Limitations

*What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
-->

<!--
### Recommendations

*What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
-->

## Training Details

### Training Dataset

#### Unnamed Dataset

* Size: 6,433 training samples
* Columns: <code>anchor</code> and <code>positive</code>
* Approximate statistics based on the first 1000 samples:
  |         | anchor                                                                           | positive                                                                             |
  |:--------|:---------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
  | type    | string                                                                           | string                                                                               |
  | details | <ul><li>min: 6 tokens</li><li>mean: 18.1 tokens</li><li>max: 38 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 182.31 tokens</li><li>max: 1024 tokens</li></ul> |
* Samples:
  | anchor                                                                                                               | positive                                                                                                                                                                                                                                                                                                                                                                                                                                 |
  |:---------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
  | <code>What type of insect is Crambus sudanicola</code>                                                               | <code>Crambus sudanicola is a moth in the family Crambidae.</code>                                                                                                                                                                                                                                                                                                                                                                       |
  | <code>How can you improve storage capacity in standard-height kitchens with unused wall space</code>                 | <code>If you have standard-height cabinets with unused wall space above, increasing the cabinets by six inches can improve storage capacity. Utilize cutlery dividers in drawers to organize cooking utensils and tools and keep them off counters.</code>                                                                                                                                                                               |
  | <code>What new guidelines has the Library Association issued regarding the sale of rare books and manuscripts</code> | <code>The Library Association has issued new guidelines for the sale of rare books and manuscripts by institutions, writes Kam Patel. The move follows an acrimonious dispute at Keele University over the sale of rare mathematics books for Pounds 1 million. The association said that for an institution to have the authority to sell books, its library should first establish that it has a full legal title to the works.</code> |
* Loss: <code>selfloss.Infonce</code> with these parameters:
  ```json
  {
      "scale": 20.0,
      "similarity_fct": "cos_sim"
  }
  ```

### Evaluation Dataset

#### Unnamed Dataset

* Size: 804 evaluation samples
* Columns: <code>anchor</code> and <code>positive</code>
* Approximate statistics based on the first 804 samples:
  |         | anchor                                                                            | positive                                                                             |
  |:--------|:----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
  | type    | string                                                                            | string                                                                               |
  | details | <ul><li>min: 6 tokens</li><li>mean: 17.78 tokens</li><li>max: 37 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 190.09 tokens</li><li>max: 1024 tokens</li></ul> |
* Samples:
  | anchor                                                                                                                          | positive                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |
  |:--------------------------------------------------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
  | <code>What does the speaker suggest about the relationship between unarmed civilians and the metaphor of eggs and a wall</code> | <code>One way to read the metaphor, he says, is that unarmed civilians are the eggs, while tanks, guns and white phosphorus shells are the wall. But he also offers a more nuanced interpretation:<br>Each of us is, more or less, an egg.</code>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
  | <code>When did Reed-Rowe retire from the Foreign Service</code>                                                                 | <code>Reed-Rowe completed that assignment July 26, 2013, and was succeeded by Amy J. Hyatt. Reed-Rowe then joined the United States Army War College as a member of the Command team which focuses on the development of the next generation of military, interagency and international leaders. She officially retired from the Foreign Service in July 2014. During her career, Reed-Rowe earned several Department of State Meritorious Honor Awards, recognition from the Republic of the Marshall Islands, and the Republic of Palau, and the U.S. Army Superior Civilian Service Award. Personal<br>In addition to English, Reed-Rowe speaks Spanish and French. She has two adult children, Nikkia Rowe and Kevin Anthony Rowe. See also<br><br>List of ambassadors of the United States<br><br>References<br><br>External links<br>US Department of State: Ambassadorial Nomination Statement: Helen Reed-Rowe, Ambassador-Designate to Palau, July 21, 2010<br><br>Ambassadors of the United States to Palau<br>African-American diplomats<br>American women ambassadors<br>...</code> |
  | <code>When will Star Wars: Galaxy's Edge officially open</code>                                                                 | <code>Star Wars: Galaxy’s Edge may not officially open until the end of June, but for some fans, it could happen even sooner.</code>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |
* Loss: <code>selfloss.Infonce</code> with these parameters:
  ```json
  {
      "scale": 20.0,
      "similarity_fct": "cos_sim"
  }
  ```

### Training Hyperparameters
#### Non-Default Hyperparameters

- `eval_strategy`: steps
- `per_device_train_batch_size`: 4
- `per_device_eval_batch_size`: 4
- `learning_rate`: 2e-05
- `num_train_epochs`: 1
- `warmup_ratio`: 0.1
- `fp16`: True
- `batch_sampler`: no_duplicates

#### All Hyperparameters
<details><summary>Click to expand</summary>

- `overwrite_output_dir`: False
- `do_predict`: False
- `eval_strategy`: steps
- `prediction_loss_only`: True
- `per_device_train_batch_size`: 4
- `per_device_eval_batch_size`: 4
- `per_gpu_train_batch_size`: None
- `per_gpu_eval_batch_size`: None
- `gradient_accumulation_steps`: 1
- `eval_accumulation_steps`: None
- `learning_rate`: 2e-05
- `weight_decay`: 0.0
- `adam_beta1`: 0.9
- `adam_beta2`: 0.999
- `adam_epsilon`: 1e-08
- `max_grad_norm`: 1.0
- `num_train_epochs`: 1
- `max_steps`: -1
- `lr_scheduler_type`: linear
- `lr_scheduler_kwargs`: {}
- `warmup_ratio`: 0.1
- `warmup_steps`: 0
- `log_level`: passive
- `log_level_replica`: warning
- `log_on_each_node`: True
- `logging_nan_inf_filter`: True
- `save_safetensors`: True
- `save_on_each_node`: False
- `save_only_model`: False
- `restore_callback_states_from_checkpoint`: False
- `no_cuda`: False
- `use_cpu`: False
- `use_mps_device`: False
- `seed`: 42
- `data_seed`: None
- `jit_mode_eval`: False
- `use_ipex`: False
- `bf16`: False
- `fp16`: True
- `fp16_opt_level`: O1
- `half_precision_backend`: auto
- `bf16_full_eval`: False
- `fp16_full_eval`: False
- `tf32`: None
- `local_rank`: 0
- `ddp_backend`: None
- `tpu_num_cores`: None
- `tpu_metrics_debug`: False
- `debug`: []
- `dataloader_drop_last`: False
- `dataloader_num_workers`: 0
- `dataloader_prefetch_factor`: None
- `past_index`: -1
- `disable_tqdm`: False
- `remove_unused_columns`: True
- `label_names`: None
- `load_best_model_at_end`: False
- `ignore_data_skip`: False
- `fsdp`: []
- `fsdp_min_num_params`: 0
- `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
- `fsdp_transformer_layer_cls_to_wrap`: None
- `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
- `deepspeed`: None
- `label_smoothing_factor`: 0.0
- `optim`: adamw_torch
- `optim_args`: None
- `adafactor`: False
- `group_by_length`: False
- `length_column_name`: length
- `ddp_find_unused_parameters`: None
- `ddp_bucket_cap_mb`: None
- `ddp_broadcast_buffers`: False
- `dataloader_pin_memory`: True
- `dataloader_persistent_workers`: False
- `skip_memory_metrics`: True
- `use_legacy_prediction_loop`: False
- `push_to_hub`: False
- `resume_from_checkpoint`: None
- `hub_model_id`: None
- `hub_strategy`: every_save
- `hub_private_repo`: False
- `hub_always_push`: False
- `gradient_checkpointing`: False
- `gradient_checkpointing_kwargs`: None
- `include_inputs_for_metrics`: False
- `eval_do_concat_batches`: True
- `fp16_backend`: auto
- `push_to_hub_model_id`: None
- `push_to_hub_organization`: None
- `mp_parameters`: 
- `auto_find_batch_size`: False
- `full_determinism`: False
- `torchdynamo`: None
- `ray_scope`: last
- `ddp_timeout`: 1800
- `torch_compile`: False
- `torch_compile_backend`: None
- `torch_compile_mode`: None
- `dispatch_batches`: None
- `split_batches`: None
- `include_tokens_per_second`: False
- `include_num_input_tokens_seen`: False
- `neftune_noise_alpha`: None
- `optim_target_modules`: None
- `batch_eval_metrics`: False
- `eval_on_start`: False
- `prompts`: None
- `batch_sampler`: no_duplicates
- `multi_dataset_batch_sampler`: proportional

</details>

### Training Logs
| Epoch  | Step | Training Loss | Validation Loss |
|:------:|:----:|:-------------:|:---------------:|
| 0.7407 | 100  | 0.2167        | 0.1060          |


### Framework Versions
- Python: 3.12.3
- Sentence Transformers: 3.4.0
- Transformers: 4.42.4
- PyTorch: 2.2.0+cu121
- Accelerate: 1.3.0
- Datasets: 3.2.0
- Tokenizers: 0.19.1

## Citation

### BibTeX

#### Sentence Transformers
```bibtex
@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}
```

#### Infonce
```bibtex
@misc{henderson2017efficient,
    title={Efficient Natural Language Response Suggestion for Smart Reply},
    author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
    year={2017},
    eprint={1705.00652},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}
```

<!--
## Glossary

*Clearly define terms in order to be accessible across audiences.*
-->

<!--
## Model Card Authors

*Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
-->

<!--
## Model Card Contact

*Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
-->