Native Multimodal Models are World Learners 🌍
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation
RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics
Organization Card
spaces
17
pinned
Restarting
on
CPU Upgrade
124
Open Chinese LLM Leaderboard
🏆
Explore and submit LLM benchmarks
pinned
Running
6
Open Flageval Vlm Leaderboard
🥇
FlagEval VLM Leaderboard
Sleeping
21
URSA-1.7B-FSQ320
🎞
URSA Text-to-Image-to-Video
Running
6
EmbodiedVerse
🐢
Explore and compare model evaluations
Running
Featured
78
MTVCraft
👁
Open Veo3-style Audio-Video Generation
Running
6
Openseek
😻
Search for information using keywords
models
163
BAAI/Emu3.5-VisionTokenizer
0.5B
•
Updated
•
134
•
21
BAAI/Emu3.5-Image
Image-Text-to-Image
•
34B
•
Updated
•
181
•
66
BAAI/Emu3.5
Any-to-Any
•
34B
•
Updated
•
179
•
164
BAAI/URSA-0.6B-IBQ1024
Text-to-Image
•
Updated
•
4
•
3
BAAI/URSA-0.6B-FSQ320
Text-to-Video
•
Updated
•
3
BAAI/URSA-1.7B-IBQ1024
Text-to-Image
•
Updated
•
25
•
3
BAAI/URSA-1.7B-FSQ320
Text-to-Video
•
Updated
•
63
•
7
BAAI/RoboBrain-X0-Preview
Robotics
•
4B
•
Updated
•
17
•
10
BAAI/bge-reasoner-embed-qwen3-8b-0923
Feature Extraction
•
8B
•
Updated
•
250
•
24
BAAI/bge-multilingual-gemma2
Feature Extraction
•
9B
•
Updated
•
216k
•
•
193
datasets
118
BAAI/CI-VID
Viewer
•
Updated
•
342k
•
7.27k
•
4
BAAI/MOVE
Updated
•
8
•
2
BAAI/Infinity-Instruct
Viewer
•
Updated
•
21.9M
•
15k
•
688
BAAI/Chinese-LiPS
Viewer
•
Updated
•
36.2k
•
246
•
7
BAAI/SeniorTalk
Viewer
•
Updated
•
60.1k
•
999
•
28
BAAI/RefSpatial-Bench
Viewer
•
Updated
•
277
•
1.64k
•
16
BAAI/RoboBrain-X0-Sample-Data
Viewer
•
Updated
•
200
•
235
•
1
BAAI/RoboBrain-X0-Dataset
Viewer
•
Updated
•
61.6k
•
2.74k
•
9
BAAI/ROME
Viewer
•
Updated
•
281
•
416
•
5
BAAI/RealTalk-CN
Updated
•
55
•
9
