1 32 3

Nikolai Debono

boccu2009

AI & ML interests

None yet

Recent Activity

upvoted a collection 15 days ago

Nemotron-Pre-Training-Dataset

upvoted a paper 18 days ago

A Survey on Diffusion Language Models

upvoted a paper 21 days ago

SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens

View all activity

Organizations

None yet

upvoted a collection 15 days ago

Nemotron-Pre-Training-Dataset

Collection

7 items • Updated about 2 hours ago • 31

upvoted a paper 18 days ago

A Survey on Diffusion Language Models

Paper • 2508.10875 • Published 19 days ago • 33

upvoted a paper 21 days ago

SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens

Paper • 2508.05305 • Published 26 days ago • 45

upvoted a collection about 2 months ago

SmolLM3 pretraining datasets

Collection

datasets used in SmolLM3 pretraining • 15 items • Updated 21 days ago • 28

upvoted a paper about 2 months ago

Should We Still Pretrain Encoders with Masked Language Modeling?

Paper • 2507.00994 • Published Jul 1 • 78

upvoted 2 papers 4 months ago

Rethinking the Generation of High-Quality CoT Data from the Perspective of LLM-Adaptive Question Difficulty Grading

Paper • 2504.11919 • Published Apr 16 • 12

Tina: Tiny Reasoning Models via LoRA

Paper • 2504.15777 • Published Apr 22 • 55

upvoted 9 papers 5 months ago

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published Mar 28 • 46

Efficient Inference for Large Reasoning Models: A Survey

Paper • 2503.23077 • Published Mar 29 • 47

What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models

Paper • 2503.24235 • Published Mar 31 • 55

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published Mar 31 • 63

Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning

Paper • 2503.16252 • Published Mar 20 • 28

upvoted a paper 6 months ago

Personalize Anything for Free with Diffusion Transformer

Paper • 2503.12590 • Published Mar 16 • 44

upvoted a collection 6 months ago

Reasoning Datasets

Collection

Distilled synthetic Reasoning datasets • 7 items • Updated Feb 2 • 61

upvoted 2 papers 7 months ago

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published Jan 20 • 107

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 116

Nikolai Debono

AI & ML interests

Recent Activity

Organizations

boccu2009's activity