Yichi Zhang's picture

2 3 1

Yichi Zhang

zycheiheihei

·

AI & ML interests

None yet

Recent Activity

updated a model about 2 months ago

zycheiheihei/BackTrack_1e-5_SFT_1e-6_DPO

updated a model about 2 months ago

zycheiheihei/BackTrack_5e-6_SFT_1e-6_DPO

published a model about 2 months ago

zycheiheihei/BackTrack_5e-6_SFT_1e-6_DPO

View all activity

Organizations

upvoted a collection 10 months ago

STAIR

Datasets and Models for STAIR (Improving Safety Alignment with Introspective Reasoning) • 7 items • Updated Feb 26, 2025 • 1

upvoted a paper 10 months ago

GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training

Paper • 2503.08525 • Published Mar 11, 2025 • 17

upvoted a paper over 1 year ago

Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study

Paper • 2406.07057 • Published Jun 11, 2024 • 17