jp1924's picture

jp1924

jp1924

·

jp1924

AI & ML interests

Audio, Vision

Recent Activity

new activity 2 days ago

jp1924/DocStruct4M:데이터셋 업로드하신 방법에 대한 문의를 드리고 싶습니다.

updated a dataset 2 days ago

jp1924/DocStruct4M

upvoted a paper 3 days ago

VisionZip: Longer is Better but Not Necessary in Vision Language Models

View all activity

Organizations

jp1924's activity

upvoted a paper 3 days ago

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Paper • 2412.04467 • Published Dec 5, 2024 • 106

upvoted a collection 14 days ago

ShareGPT Datasets

23 items • Updated 3 days ago • 4

upvoted a paper 29 days ago

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published about 1 month ago • 48