hjkim's picture

184 1

hjkim

hojie11

·

hojie11

AI & ML interests

Computer Vision, 3D Vision, Anomaly Detection

Recent Activity

liked a model about 20 hours ago

google/gemma-3-27b-it

upvoted a paper about 22 hours ago

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

upvoted a paper about 22 hours ago

MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice

View all activity

Organizations

None yet

hojie11's activity

upvoted 2 papers about 22 hours ago

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published 3 days ago • 87

MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice

Paper • 2503.05978 • Published 6 days ago • 29

upvoted 2 papers 2 days ago

Tuning-Free Multi-Event Long Video Generation via Synchronized Coupled Sampling

Paper • 2503.08605 • Published 2 days ago • 20

Video Action Differencing

Paper • 2503.07860 • Published 3 days ago • 26

upvoted 5 papers 3 days ago

DreamRelation: Relation-Centric Video Customization

Paper • 2503.07602 • Published 3 days ago • 12

PE3R: Perception-Efficient 3D Reconstruction

Paper • 2503.07507 • Published 3 days ago • 8

Automated Movie Generation via Multi-Agent CoT Planning

Paper • 2503.07314 • Published 3 days ago • 34

FedRand: Enhancing Privacy in Federated Learning with Randomized LoRA Subparameter Updates

Paper • 2503.07216 • Published 3 days ago • 26

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published 3 days ago • 51

upvoted 4 papers 4 days ago

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities

Paper • 2503.03983 • Published 8 days ago • 22

Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching

Paper • 2503.05179 • Published 7 days ago • 42

TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

Paper • 2503.05638 • Published 6 days ago • 17

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published 7 days ago • 83

upvoted a paper 8 days ago

GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control

Paper • 2503.03751 • Published 8 days ago • 19

upvoted a paper 9 days ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published 10 days ago • 72

upvoted 4 papers 10 days ago

Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models

Paper • 2503.01774 • Published 10 days ago • 39

Chain of Draft: Thinking Faster by Writing Less

Paper • 2502.18600 • Published 16 days ago • 44

Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids

Paper • 2502.20396 • Published 14 days ago • 12

DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping

Paper • 2502.20900 • Published 13 days ago • 7

upvoted a paper 15 days ago

Towards an AI co-scientist

Paper • 2502.18864 • Published 16 days ago • 42