32 80 93

Somshubra Majumdar

smajumdar94

AI & ML interests

None yet

Recent Activity

upvoted a paper about 14 hours ago

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

upvoted a paper 1 day ago

Expanding RL with Verifiable Rewards Across Diverse Domains

liked a model 2 days ago

all-hands/openhands-lm-32b-v0.1

View all activity

Organizations

smajumdar94's activity

upvoted a paper about 14 hours ago

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published 3 days ago • 51

upvoted a paper 1 day ago

Expanding RL with Verifiable Rewards Across Diverse Domains

Paper • 2503.23829 • Published 3 days ago • 16

liked a model 2 days ago

all-hands/openhands-lm-32b-v0.1

Text Generation • Updated 3 days ago • 1.84k • 189

upvoted 2 papers 7 days ago

ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning

Paper • 2503.19470 • Published 9 days ago • 14

Open Deep Search: Democratizing Search with Open-source Reasoning Agents

Paper • 2503.20201 • Published 8 days ago • 39

upvoted a paper 8 days ago

Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking

Paper • 2503.19855 • Published 9 days ago • 24

upvoted a paper 10 days ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published 14 days ago • 46

liked 2 models 12 days ago

nvidia/Llama-3.1-Nemotron-Nano-8B-v1

Text Generation • Updated 18 days ago • 9.97k • 97

nvidia/Llama-3_3-Nemotron-Super-49B-v1

Text Generation • Updated 14 days ago • 40.6k • 215

liked a Space 12 days ago

Canary 1B Flash

🐤

Canary 1B Flash demo

upvoted an article 15 days ago

Article

NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets

16 days ago

• 31

liked a dataset 15 days ago

nvidia/Llama-Nemotron-Post-Training-Dataset-v1

Viewer • Updated 16 days ago • 15.2M • 10.3k • 297

liked a Space 16 days ago

269

Thera Arbitrary-Scale Super-Resolution

🔥

Enhance image quality with real-time super-resolution

upvoted a paper 17 days ago

Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond

Paper • 2503.10460 • Published 21 days ago • 27

liked a model 20 days ago

sesame/csm-1b

Text-to-Speech • Updated 18 days ago • 70.7k • • 1.78k

liked a model 22 days ago

nvidia/DeepSeek-R1-FP4

Text Generation • Updated Feb 26 • 51.1k • 231

liked a dataset 22 days ago

open-r1/codeforces

Viewer • Updated 1 day ago • 10k • 1.44k • 28

liked a model 23 days ago

RekaAI/reka-flash-3

Updated 21 days ago • 5.1k • 347

liked a dataset 27 days ago

deepmind/code_contests

Viewer • Updated Jun 11, 2023 • 4.04k • 12.4k • 161