shisa-ai
's Collections
shisa-v2-research
updated
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs
with Nothing
Paper
•
2406.08464
•
Published
•
67
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Paper
•
2406.20094
•
Published
•
100
argilla/magpie-ultra-v1.0
Viewer
•
Updated
•
3.22M
•
1.15k
•
42
Viewer
•
Updated
•
1k
•
7.02k
•
102
Viewer
•
Updated
•
817
•
5.78k
•
140
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language
Models
Paper
•
2401.01335
•
Published
•
65
Direct Nash Optimization: Teaching Language Models to Self-Improve with
General Preferences
Paper
•
2404.03715
•
Published
•
61
Self-Boosting Large Language Models with Synthetic Preference Data
Paper
•
2410.06961
•
Published
•
16
SPaR: Self-Play with Tree-Search Refinement to Improve
Instruction-Following in Large Language Models
Paper
•
2412.11605
•
Published
•
18
Magpie-Align/Magpie-Reasoning-V1-150K-CoT-Deepseek-R1-Llama-70B
Viewer
•
Updated
•
150k
•
296
•
17
sbintuitions/modernbert-ja-130m
Fill-Mask
•
Updated
•
5.39k
•
39
bespokelabs/Bespoke-Stratos-17k
Viewer
•
Updated
•
16.7k
•
30.7k
•
298
SymNoise: Advancing Language Model Fine-tuning with Symmetric Noise
Paper
•
2312.01523
•
Published