Yuran Wang
Ryann829
AI & ML interests
Multimodal Large Language Model
Recent Activity
authored
a paper
about 6 hours ago
CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation
upvoted
a
paper
about 12 hours ago
CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation
updated
a dataset
2 days ago
Ryann829/SconeEval