Jaehyun Jun's picture

Jaehyun Jun

btjhjeon

·

https://btjhjeon.github.io/

btjhjeon

AI & ML interests

Multimodal

Recent Activity

updated a collection 3 days ago

Multimodal Alignment

updated a collection 3 days ago

Multimodal Alignment

updated a collection 3 days ago

Multimodal Agent

View all activity

Organizations

upvoted 2 papers 27 days ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 148

SIMA 2: A Generalist Embodied Agent for Virtual Worlds

Paper • 2512.04797 • Published about 1 month ago • 24

upvoted 4 papers about 1 month ago

Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch

Paper • 2512.02395 • Published Dec 2, 2025 • 47

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 244

UniGame: Turning a Unified Multimodal Model Into Its Own Adversary

Paper • 2511.19413 • Published Nov 24, 2025 • 20

NVIDIA Nemotron Parse 1.1

Paper • 2511.20478 • Published Nov 25, 2025 • 21

upvoted 2 papers about 2 months ago

GUI-360: A Comprehensive Dataset and Benchmark for Computer-Using Agents

Paper • 2511.04307 • Published Nov 6, 2025 • 14

NVIDIA Nemotron Nano V2 VL

Paper • 2511.03929 • Published Nov 6, 2025 • 27

upvoted 2 papers 2 months ago

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27, 2025 • 96

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30, 2025 • 108

upvoted a collection 3 months ago

Qwen3

84 items • Updated 4 days ago • 1.54k

upvoted 2 papers 3 months ago

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16, 2025 • 52

Qwen3-Omni Technical Report

Paper • 2509.17765 • Published Sep 22, 2025 • 143

upvoted 2 collections 3 months ago

Qwen3-VL

37 items • Updated 4 days ago • 555

Qwen3-Omni

6 items • Updated 4 days ago • 177

upvoted 4 papers 4 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 195

Robix: A Unified Model for Robot Interaction, Reasoning and Planning

Paper • 2509.01106 • Published Sep 1, 2025 • 51

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 259

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 211

upvoted a paper 5 months ago

Beyond the Trade-off: Self-Supervised Reinforcement Learning for Reasoning Models' Instruction Following

Paper • 2508.02150 • Published Aug 4, 2025 • 36