SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 3 days ago • 114
view article Article Fine-tune ModernBERT for RAG with Synthetic Data By sdiazlor and 2 others • 18 days ago • 33
MedEmbed: Embedding Models for Medical Domain Collection GitHub -> https://github.com/abhinand5/MedEmbed • 4 items • Updated Oct 21, 2024 • 9
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28, 2024 • 178
🇮🇹 Italian NLP Resources Collection Collection of models, datasets and demos relevant to Italian NLP 🇮🇹 • 282 items • Updated 1 day ago • 24
Is Cosine-Similarity of Embeddings Really About Similarity? Paper • 2403.05440 • Published Mar 8, 2024 • 3