view article Article AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems 5 days ago • 27
view article Article The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator 11 days ago • 33
Bread&Butter Collection Top LLMs 2025: ZAI GLM-4.7 (358B) & Moonshot Kimi-K2-Thinking. Next-gen reasoning, code, multilingual. State-of-the-art performance. Production-ready. • 2 items • Updated about 1 hour ago
Bread&Butter Collection Top LLMs 2025: ZAI GLM-4.7 (358B) & Moonshot Kimi-K2-Thinking. Next-gen reasoning, code, multilingual. State-of-the-art performance. Production-ready. • 2 items • Updated about 1 hour ago
Google Gemma Scope 2 - Neuronpedia Collection Google Gemma Scope 2: JumpReLU SAEs for Gemma 2 interpretability. 270M PT/IT, 1B PT variants. Neuronpedia integration. Mechanistic analysis. • 11 items • Updated about 1 hour ago • 1
Google Gemma Scope 2 - Neuronpedia Collection Google Gemma Scope 2: JumpReLU SAEs for Gemma 2 interpretability. 270M PT/IT, 1B PT variants. Neuronpedia integration. Mechanistic analysis. • 11 items • Updated about 1 hour ago • 1