Reasoning models trained on synthetic data using reinforcement learning.
Yichao 'Peak' Ji
peakji
AI & ML interests
Agents, Small Language Models, Retrieval-Augmented Generation, Information Extraction
Recent Activity
liked
a model
2 days ago
google/siglip2-so400m-patch14-384
liked
a model
7 days ago
nvidia/NVIDIA-Nemotron-Nano-12B-v2-Base
liked
a model
12 days ago
google/gemma-3-4b-pt