Jiajie Zhang's picture

Jiajie Zhang

NeoZ123

·

Neo-Zhangjiajie

AI & ML interests

None yet

Recent Activity

upvoted a paper about 4 hours ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

submitted a paper about 5 hours ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

published a dataset 1 day ago

THU-KEG/CaRR-DeepDive

View all activity

Organizations

NeoZ123 's models 2

NeoZ123/LongReward-llama3.1-8b-SFT

Text Generation • 9B • Updated Oct 29, 2024 • 8 • 1

NeoZ123/LongReward-glm4-9b-SFT

Text Generation • 9B • Updated Oct 29, 2024 • 7