Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
34292.5
TFLOPS
2
15
8
Adam Yanxiao Zhao
sdpkjc
Follow
fredericmenezes's profile picture
qgallouedec's profile picture
2 followers
·
9 following
https://sdpkjc.com
sdpkjc_adam
sdpkjc
yanxiao-zhao
AI & ML interests
Reinforcement Learning
Recent Activity
authored
a paper
1 day ago
ComputerRL: Scaling End-to-End Online Reinforcement Learning for Computer Use Agents
authored
a paper
1 day ago
SATQuest: A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs
authored
a paper
1 day ago
CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning
View all activity
Organizations
sdpkjc
's datasets
17
Sort: Recently updated
sdpkjc/SATQuest-RFT-3k
Viewer
•
Updated
Jul 30
•
3k
•
20
sdpkjc/SATQuest
Viewer
•
Updated
Jul 30
•
140
•
13
sdpkjc/24problems_quiz-eval-n4-1-10-24
Viewer
•
Updated
May 22
•
55.5k
•
5
sdpkjc/24problems_quiz-eval-5
Viewer
•
Updated
May 22
•
100k
•
8
sdpkjc/24problems_quiz
Viewer
•
Updated
May 21
•
85.6k
•
16
sdpkjc/SATQuest-RFT-1k
Viewer
•
Updated
Apr 23
•
1k
•
8
sdpkjc/SATQuest-Tiny
Viewer
•
Updated
Apr 20
•
10
•
6
sdpkjc/SATQuest-G
Viewer
•
Updated
Mar 28
•
963
•
3
sdpkjc/NumBase-N01-S2g-B2g
Viewer
•
Updated
Feb 26
•
983k
•
4
sdpkjc/NumBase-N01-S2g-B28
Viewer
•
Updated
Feb 26
•
459k
•
3
sdpkjc/NumBase-N01-S2g-B24
Viewer
•
Updated
Feb 26
•
197k
•
2
sdpkjc/NumBase-N01-S28-B2g
Viewer
•
Updated
Feb 26
•
3.81k
•
2
sdpkjc/NumBase-N01-S28-B28
Viewer
•
Updated
Feb 26
•
1.78k
•
1
sdpkjc/NumBase-N01-S28-B24
Viewer
•
Updated
Feb 26
•
762
•
1
sdpkjc/NumBase-N01-S24-B2g
Viewer
•
Updated
Feb 26
•
210
•
2
sdpkjc/NumBase-N01-S24-B28
Viewer
•
Updated
Feb 26
•
98
sdpkjc/NumBase-N01-S24-B24
Viewer
•
Updated
Feb 26
•
42