Hiring 💼

Florian Zimmermeister PRO

flozi00

AI & ML interests

ASR, German LLM

Recent Activity

liked a model 8 days ago

Qwen/Qwen3.5-397B-A17B

upvoted a paper 9 days ago

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

liked a model 11 days ago

MiniMaxAI/MiniMax-M2.5

View all activity

Organizations

$A\\Ware's profile picture$

upvoted a paper 9 days ago

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published 20 days ago • 331

upvoted an article 22 days ago

Article

Open Responses: What you need to know

Jan 15

•

108

upvoted an article 27 days ago

Article

We Got Claude to Build CUDA Kernels and teach open models!

28 days ago

•

139

upvoted 2 papers about 1 month ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 228

Recursive Language Models

Paper • 2512.24601 • Published Dec 31, 2025 • 89

upvoted 2 papers about 2 months ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 311

Parallax: Efficient LLM Inference Service over Decentralized Environment

Paper • 2509.26182 • Published Sep 30, 2025 • 1

upvoted a collection 2 months ago

Audio2Face-3D

Collection

Open-weight Audio2Face-3D and Audio2Emotion networks and a sample dataset for training and evaluation • 8 items • Updated about 23 hours ago • 15

upvoted 2 articles 3 months ago

Article

Continuous batching from first principles

Nov 25, 2025

•

330

Article

🌳 QAT: The Art of Growing a Bonsai Model

Nov 9, 2025

•

upvoted 2 papers 3 months ago

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published Oct 29, 2025 • 78

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 129

upvoted a collection 4 months ago

Cerebras REAP

Collection

Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 28 items • Updated 6 days ago • 119

upvoted 4 papers 4 months ago

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26, 2025 • 80

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published Oct 6, 2025 • 129

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 509

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30, 2025 • 547

upvoted an article 4 months ago

Article

Building the Open Agent Ecosystem Together: Introducing OpenEnv

Oct 23, 2025

•

149

upvoted 2 papers 5 months ago

Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization

Paper • 2509.23202 • Published Sep 27, 2025 • 29

Reinforcement Learning on Pre-Training Data

Paper • 2509.19249 • Published Sep 23, 2025 • 67

Florian Zimmermeister PRO

AI & ML interests

Recent Activity

Organizations

flozi00's activity

Open Responses: What you need to know

We Got Claude to Build CUDA Kernels and teach open models!

Continuous batching from first principles

🌳 QAT: The Art of Growing a Bonsai Model

Building the Open Agent Ecosystem Together: Introducing OpenEnv