chnug (Roger yau)

upvoted 2 articles 3 months ago

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

+3

Sep 23

•

134

Article

Optimization story: Bloom inference

Oct 12, 2022

•

7

upvoted a paper 4 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 190

upvoted an article 4 months ago

Article

🪆 Introduction to Matryoshka Embedding Models

+1

Feb 23, 2024

•

185

upvoted 2 collections 4 months ago

InternVL3.5

Collection

This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 54 items • Updated Sep 28 • 103

DINOv3

Collection

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21 • 429

upvoted an article 5 months ago

Article

Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios

Feb 12

•

27

upvoted a paper 6 months ago

MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent

Paper • 2507.02259 • Published Jul 3 • 5

upvoted an article 6 months ago

Article

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

Jun 11

•

121

upvoted 2 articles 7 months ago

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

+4

Jun 3

•

96

Article

Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models.

May 15

•

36

Roger yau

AI & ML interests

Organizations

Smol2Operator: Post-Training GUI Agents for Computer Use

Optimization story: Bloom inference

A Survey of Reinforcement Learning for Large Reasoning Models

🪆 Introduction to Matryoshka Embedding Models

InternVL3.5

DINOv3

Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios

MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models.

Roger yau

AI & ML interests

Organizations

chnug's activity

Smol2Operator: Post-Training GUI Agents for Computer Use

Optimization story: Bloom inference

🪆 Introduction to Matryoshka Embedding Models

Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models.