Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources Paper ā¢ 2504.00595 ā¢ Published 2 days ago ā¢ 23
Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation Paper ā¢ 2503.24379 ā¢ Published 3 days ago ā¢ 52
Dolphin: A Large-Scale Automatic Speech Recognition Model for Eastern Languages Paper ā¢ 2503.20212 ā¢ Published 8 days ago ā¢ 3
š March 2025 - Open releases from the Chinese community Collection 30 items ā¢ Updated about 18 hours ago ā¢ 12
Wan: Open and Advanced Large-Scale Video Generative Models Paper ā¢ 2503.20314 ā¢ Published 8 days ago ā¢ 44
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning Paper ā¢ 2503.15265 ā¢ Published 15 days ago ā¢ 44
FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis Paper ā¢ 2503.13265 ā¢ Published 17 days ago ā¢ 15
RWKV-7 "Goose" with Expressive Dynamic State Evolution Paper ā¢ 2503.14456 ā¢ Published 16 days ago ā¢ 130
APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs Paper ā¢ 2502.12085 ā¢ Published Feb 17 ā¢ 4
Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model Paper ā¢ 2503.07703 ā¢ Published 24 days ago ā¢ 34