Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Haonan Zhang's picture
4 12 29

Haonan Zhang

haonanzhang
tnlin's profile picture RainBowLuo's profile picture
·
https://zchoi.github.io/
  • zchoi

AI & ML interests

AI & ML, Multi-modal Learning,Agent,LLM, etc.

Recent Activity

liked a dataset 3 days ago
HuggingFaceM4/FineVision
liked a model 3 days ago
meituan-longcat/LongCat-Flash-Chat
new activity 12 days ago
Tongyi-ConvAI/OmniCharacter-7B:Would you release the flow matching & Hifi-GAN model weights?
View all activity

Organizations

Tongyi-ConvAI's profile picture MagicBot's profile picture

haonanzhang 's collections 1

Papers
  • MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens

    Paper • 2406.11271 • Published Jun 17, 2024 • 21
  • Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided Diffusion

    Paper • 2410.13674 • Published Oct 17, 2024 • 17
  • Mini-Omni2: Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities

    Paper • 2410.11190 • Published Oct 15, 2024 • 22
Papers
  • MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens

    Paper • 2406.11271 • Published Jun 17, 2024 • 21
  • Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided Diffusion

    Paper • 2410.13674 • Published Oct 17, 2024 • 17
  • Mini-Omni2: Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities

    Paper • 2410.11190 • Published Oct 15, 2024 • 22
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs