Ju He's picture

1 16 8

Ju He

turkeyju

·

https://tacju.github.io/

TACJu

AI & ML interests

None yet

Recent Activity

updated a dataset 13 days ago

ccvl/ReVision-Panda

upvoted a paper 13 days ago

UniTok: A Unified Tokenizer for Visual Generation and Understanding

upvoted a paper 13 days ago

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

View all activity

Organizations

turkeyju's activity

upvoted 3 papers 13 days ago

UniTok: A Unified Tokenizer for Visual Generation and Understanding

Paper • 2502.20321 • Published 14 days ago • 29

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Paper • 2502.20172 • Published 14 days ago • 27

Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation

Paper • 2502.20388 • Published 14 days ago • 15

upvoted 3 papers about 1 month ago

COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation

Paper • 2502.02589 • Published Feb 4 • 10

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 111

Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step

Paper • 2501.13926 • Published Jan 23 • 37

upvoted a paper about 2 months ago

Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens

Paper • 2501.07730 • Published Jan 13 • 16

upvoted 2 papers 3 months ago

Flowing from Words to Pixels: A Framework for Cross-Modality Evolution

Paper • 2412.15213 • Published Dec 19, 2024 • 26

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 90

upvoted 3 papers 4 months ago

AnimateAnything: Consistent and Controllable Animation for Video Generation

Paper • 2411.10836 • Published Nov 16, 2024 • 22

Generative World Explorer

Paper • 2411.11844 • Published Nov 18, 2024 • 76

Randomized Autoregressive Visual Generation

Paper • 2411.00776 • Published Nov 1, 2024 • 17

upvoted a paper 5 months ago

Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens

Paper • 2410.13863 • Published Oct 17, 2024 • 37

upvoted a collection 8 months ago

ViTamin Family

Designing Scalable Vision Models in the Vision-language Era. The best performing model is 'jienengchen/ViTamin-XL-384px'. • 16 items • Updated Apr 11, 2024 • 8

upvoted a paper 9 months ago

An Image is Worth 32 Tokens for Reconstruction and Generation

Paper • 2406.07550 • Published Jun 11, 2024 • 58

upvoted a paper 11 months ago

COCONut: Modernizing COCO Segmentation

Paper • 2404.08639 • Published Apr 12, 2024 • 29