Shiyu Huang's picture

7 5 14

Shiyu Huang

ShiyuHuang

·

https://huangshiyu13.github.io/

AI & ML interests

RL, Game AI, NLP, CV

Recent Activity

commented on a paper 9 days ago

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

updated a collection 14 days ago

video_benchmark

updated a collection 14 days ago

video_benchmark

View all activity

Organizations

ShiyuHuang's activity

commented a paper 9 days ago

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

Paper • 2503.01935 • Published 11 days ago • 24 •

updated a collection 14 days ago

video_benchmark

3 items • Updated 14 days ago

upvoted a paper 14 days ago

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published Jan 21 • 84

updated a collection 14 days ago

Reasoning

2 items • Updated 14 days ago

New activity in THUDM/cogvlm2-llama3-caption about 2 months ago

keep mentioning "bilibili" watermark

#6 opened 4 months ago by

中文效果怎么样呢

#1 opened 6 months ago by

authored a paper 2 months ago

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models

Paper • 2501.02955 • Published Jan 6 • 40

liked a dataset 2 months ago

THUDM/MotionBench

Viewer • Updated Jan 8 • 5k • 1.48k • 2

upvoted a paper 2 months ago

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models

Paper • 2501.02955 • Published Jan 6 • 40

authored a paper 2 months ago

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Paper • 2412.21059 • Published Dec 30, 2024 • 19

liked a dataset 2 months ago

AIWinter/LVBench

Updated Sep 13, 2024 • 288 • 3

updated a Space 2 months ago

MotionBench Leaderboard

Submit and view model evaluations on a leaderboard

liked a model 2 months ago

THUDM/VisionReward-Video

Text Generation • Updated Jan 1 • 4.32k • 5

liked a Space 3 months ago

MotionBench Leaderboard

Submit and view model evaluations on a leaderboard

updated 3 Spaces 3 months ago

LVBench Leaderboard

Submit model evaluations to a leaderboard

MotionBench Leaderboard

Submit and view model evaluations on a leaderboard

MotionBench Leaderboard

Submit and view model evaluations on a leaderboard