view article Article Train 400x faster Static Embedding Models with Sentence Transformers 23 days ago • 138
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • 7 days ago • 29
view article Article 🦸🏻#9: Does AI Remember? The Role of Memory in Agentic Workflows By Kseniase • 5 days ago • 7
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 16 days ago • 301
view article Article Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO) By ariG23498 • 19 days ago • 13
view article Article Hugging Face and FriendliAI partner to supercharge model deployment on the Hub 16 days ago • 30