Yang Yue

yueyang2000

yueyang2000

AI & ML interests

None yet

Recent Activity

upvoted a paper 18 days ago

FAST: Efficient Action Tokenization for Vision-Language-Action Models

upvoted a paper about 2 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

upvoted a paper 2 months ago

Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

View all activity

Organizations

None yet

yueyang2000's activity

upvoted a paper 18 days ago

FAST: Efficient Action Tokenization for Vision-Language-Action Models

Paper • 2501.09747 • Published 22 days ago • 23

upvoted a paper about 2 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 129

upvoted 2 papers 2 months ago

Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Paper • 2412.04431 • Published Dec 5, 2024 • 17

GRAPE: Generalizing Robot Policy via Preference Alignment

Paper • 2411.19309 • Published Nov 28, 2024 • 44

liked a model 2 months ago

google/t5-v1_1-base

Text2Text Generation • Updated Jan 24, 2023 • 178k • 56

liked a Space 5 months ago

7.24k

Kolors Virtual Try-On

👕

Virtual try-on for clothes on a person

upvoted a paper 5 months ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 125

upvoted 2 papers 6 months ago

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Paper • 2408.11039 • Published Aug 20, 2024 • 58

Achieving Human Level Competitive Robot Table Tennis

Paper • 2408.03906 • Published Aug 7, 2024 • 27

liked a model 8 months ago

Salesforce/xgen-mm-phi3-mini-instruct-r-v1

Image-Text-to-Text • Updated 4 days ago • 1.3k • 186

upvoted a paper 9 months ago

KAN: Kolmogorov-Arnold Networks

Paper • 2404.19756 • Published Apr 30, 2024 • 109

liked a dataset 10 months ago

AbdomenAtlas/AbdomenAtlas1.0MiniBeta

Updated 22 days ago • 216 • 7

upvoted a paper 10 months ago

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

Paper • 2404.06512 • Published Apr 9, 2024 • 30

upvoted 4 papers 11 months ago

upvoted a paper 12 months ago

Subobject-level Image Tokenization

Paper • 2402.14327 • Published Feb 22, 2024 • 17

upvoted 2 papers about 1 year ago

Towards Conversational Diagnostic AI

Paper • 2401.05654 • Published Jan 11, 2024 • 17

Denoising Vision Transformers

Paper • 2401.02957 • Published Jan 5, 2024 • 29