Sjia Chen's picture

3 1

Sjia Chen

SjiaChen

·

CSJDeveloper

AI & ML interests

None yet

Recent Activity

liked a dataset 6 days ago

MMMU/MMMU

upvoted a paper 21 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

upvoted a paper 2 months ago

Toward Adaptive Reasoning in Large Language Models with Thought Rollback

View all activity

Organizations

SjiaChen's activity

liked a dataset 6 days ago

MMMU/MMMU

Viewer • Updated Sep 19, 2024 • 11.6k • 44.5k • 233

upvoted a paper 21 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 24 days ago • 142

upvoted 2 papers 2 months ago

Toward Adaptive Reasoning in Large Language Models with Thought Rollback

Paper • 2412.19707 • Published Dec 27, 2024 • 1

Boosting of Thoughts: Trial-and-Error Problem Solving with Large Language Models

Paper • 2402.11140 • Published Feb 17, 2024 • 1