3 8 3

Arun Prakash

arun-AiBharat

Arunprakash-a

AI & ML interests

LLMs, OCR

Recent Activity

upvoted an article 23 days ago

The N Implementation Details of RLHF with PPO

upvoted an article 27 days ago

Welcome GPT OSS, the new open-source model family from OpenAI!

upvoted an article about 1 month ago

Illustrating Reinforcement Learning from Human Feedback (RLHF)

View all activity

Organizations

upvoted an article 23 days ago

Article

The N Implementation Details of RLHF with PPO

and 2 others •

Oct 24, 2023

• 67

upvoted an article 27 days ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

and 11 others •

28 days ago

• 481

upvoted an article about 1 month ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

and 3 others •

Dec 9, 2022

• 332

upvoted 2 articles about 2 months ago

Article

Mixture of Experts Explained

and 5 others •

Dec 11, 2023

• 862

Article

SmolLM3: smol, multilingual, long-context reasoner

and 22 others •

Jul 8

• 643

upvoted 3 articles 4 months ago

Article

Let's talk about LLM evaluation

•

May 23, 2024

• 184

Article

Open LLM Leaderboard: DROP deep dive

and 4 others •

Dec 1, 2023

• 9

Article

What's going on with the Open LLM Leaderboard?

and 3 others •

Jun 23, 2023

• 43

Arun Prakash

AI & ML interests

Recent Activity

Organizations

arun-AiBharat's activity

The N Implementation Details of RLHF with PPO

Welcome GPT OSS, the new open-source model family from OpenAI!

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Mixture of Experts Explained

SmolLM3: smol, multilingual, long-context reasoner

Let's talk about LLM evaluation

Open LLM Leaderboard: DROP deep dive

What's going on with the Open LLM Leaderboard?