Dharma KC's picture

15 4 1

Dharma KC

kcdharma

·

AI & ML interests

Machine Learning

Recent Activity

upvoted a paper about 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

updated a model 5 months ago

kcdharma/PolicyGradient

updated a model 5 months ago

kcdharma/q-FrozenLake-v1-4x4-noSlippery

View all activity

Organizations

kcdharma's activity

upvoted a paper about 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 346

updated 2 models 5 months ago

kcdharma/PolicyGradient

Reinforcement Learning • Updated Oct 19, 2024

kcdharma/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated Oct 2, 2024

upvoted a paper 9 months ago

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published Jun 17, 2024 • 63

upvoted 2 papers 10 months ago

An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27, 2024 • 88

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 121

New activity in openchat/openchat over 1 year ago

complete inference code

#1 opened over 1 year ago by

liked a model over 1 year ago

openchat/openchat

Text Generation • Updated Jul 4, 2023 • 275 • 289

New activity in bigcode/starcoder almost 2 years ago

prompts for humaneval

#50 opened almost 2 years ago by

New activity in bigcode/santacoder almost 2 years ago

max_length_generation

#32 opened almost 2 years ago by

max_length_generation

#32 opened almost 2 years ago by

max_length_generation

#32 opened almost 2 years ago by