Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
10
23
3
Jiajie Zhang
NeoZ123
Follow
tsq2000's profile picture
Zacchinardi's profile picture
sbrandeis's profile picture
8 followers
·
2 following
Neo-Zhangjiajie
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 4 hours ago
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards
submitted
a paper
about 5 hours ago
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards
published
a dataset
1 day ago
THU-KEG/CaRR-DeepDive
View all activity
Organizations
NeoZ123
's models
2
Sort: Recently updated
NeoZ123/LongReward-llama3.1-8b-SFT
Text Generation
•
9B
•
Updated
Oct 29, 2024
•
8
•
1
NeoZ123/LongReward-glm4-9b-SFT
Text Generation
•
9B
•
Updated
Oct 29, 2024
•
7