SeongWan Kim

idgmatrix

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 minutes ago

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

upvoted a paper about 1 hour ago

Competitive Programming with Large Reasoning Models

upvoted a paper about 4 hours ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

View all activity

Organizations

None yet

idgmatrix's activity

upvoted a paper 10 minutes ago

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published about 24 hours ago • 11

upvoted a paper about 1 hour ago

Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published 8 days ago • 21

upvoted 2 papers about 4 hours ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published 5 days ago • 51

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 1 day ago • 86

upvoted 3 papers 2 days ago

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published 5 days ago • 43

On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices

Paper • 2502.04363 • Published 7 days ago • 9

Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published 5 days ago • 61

upvoted 2 papers 6 days ago

Can LLMs Maintain Fundamental Abilities under KV Cache Compression?

Paper • 2502.01941 • Published 8 days ago • 11

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published 7 days ago • 49

upvoted 2 papers 7 days ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published 9 days ago • 109

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published 9 days ago • 53

upvoted 3 papers 9 days ago

upvoted 2 papers 13 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 15 days ago • 102

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published 15 days ago • 33

upvoted 2 papers 19 days ago

Autonomy-of-Experts Models

Paper • 2501.13074 • Published 21 days ago • 40

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 21 days ago • 316

upvoted a paper 20 days ago

Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Paper • 2501.12202 • Published 22 days ago • 33

upvoted a paper 22 days ago

GameFactory: Creating New Games with Generative Interactive Videos

Paper • 2501.08325 • Published 29 days ago • 61