5 3 6

Kian Kyars

kyars

https://sites.ualberta.ca/~kkyars/

AI & ML interests

None yet

Recent Activity

commented on a paper 22 days ago

Treasure Hunt: Real-time Targeting of the Long Tail using Training-Time Markers

commented on a paper 22 days ago

Treasure Hunt: Real-time Targeting of the Long Tail using Training-Time Markers

updated a Space 27 days ago

Agents-MCP-Hackathon/Decider-MCP

View all activity

Organizations

commented 2 papers 22 days ago

Treasure Hunt: Real-time Targeting of the Long Tail using Training-Time Markers

Paper • 2506.14702 • Published Jun 17 • 4 •

Treasure Hunt: Real-time Targeting of the Long Tail using Training-Time Markers

Paper • 2506.14702 • Published Jun 17 • 4 •

updated a Space 27 days ago

Decider MCP

🚀

Chat with a friendly AI assistant

liked 2 models 27 days ago

openai/gpt-oss-20b

Text Generation • 22B • Updated 7 days ago • 9.29M • • 3.37k

openai/gpt-oss-120b

Text Generation • 120B • Updated 7 days ago • 2.56M • • 3.7k

New activity in nanotron/README about 1 month ago

awesome resource

#1 opened about 1 month ago by

kyars

commented on SmolLM3: smol, multilingual, long-context reasoner about 1 month ago

thanks!

commented on SmolLM3: smol, multilingual, long-context reasoner about 1 month ago

makes sense

commented on SmolLM3: smol, multilingual, long-context reasoner about 1 month ago

For the reasoning mid-training we used the ChatML template (so no system prompt), for SFT we use SmolLM3's final chat template.

ChatML does have a system prompt, can you please elaborate?

commented on SmolLM3: smol, multilingual, long-context reasoner about 1 month ago

is it not possible to do on-policy distillation?

commented on SmolLM3: smol, multilingual, long-context reasoner about 1 month ago

nice thoughts

commented on SmolLM3: smol, multilingual, long-context reasoner about 1 month ago

tied embeddings were just used because llama uses it or was there an ablation?

upvoted an article about 1 month ago

Article

SmolLM3: smol, multilingual, long-context reasoner

and 22 others •

Jul 8

• 643

commented on SmolLM3: smol, multilingual, long-context reasoner about 1 month ago

Let's do this with Muon.

New activity in nanotron/ultrascale-playbook 3 months ago

TP Question

#113 opened 3 months ago by

kyars

published a Space 3 months ago

First Agent Template

⚡

Answer questions and perform searches

updated a Space 3 months ago

First Agent Template

⚡

Answer questions and perform searches

published a Space 3 months ago

Decider MCP

🚀

Chat with a friendly AI assistant

commented a paper 3 months ago

The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search

Paper • 2504.08066 • Published Apr 10 • 14 •

upvoted a paper 3 months ago

The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search

Paper • 2504.08066 • Published Apr 10 • 14

Kian Kyars

AI & ML interests

Recent Activity

Organizations

kyars's activity

Decider MCP

awesome resource

SmolLM3: smol, multilingual, long-context reasoner

TP Question

First Agent Template

First Agent Template

Decider MCP