Qianfan-VL: A Milestone Achievement in Chinese Multimodal AI with Domestic Chips By baidu • 6 days ago • 8
Ground-up efforts to build large datasets for effective and accurate translation of Modi-Script documents into modern Marathi By Arunbiz and 1 other • 5 days ago • 6
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 225
🌎 What kind of environmental impacts are AI companies disclosing? (And can we compare them?) 🌎 By sasha and 1 other • 13 days ago • 12
Preserving Agency: Why AI Safety Needs Community, Not Corporate Control By giadap • about 20 hours ago • 5
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • Feb 11 • 70
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment By NormalUhr • Feb 11 • 71
Qianfan-VL: A Milestone Achievement in Chinese Multimodal AI with Domestic Chips By baidu • 6 days ago • 8
Ground-up efforts to build large datasets for effective and accurate translation of Modi-Script documents into modern Marathi By Arunbiz and 1 other • 5 days ago • 6
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 225
🌎 What kind of environmental impacts are AI companies disclosing? (And can we compare them?) 🌎 By sasha and 1 other • 13 days ago • 12
Preserving Agency: Why AI Safety Needs Community, Not Corporate Control By giadap • about 20 hours ago • 5
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • Feb 11 • 70
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment By NormalUhr • Feb 11 • 71