3 50 214

Kristoffer Rolf Deinoff

gatepoet

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs

upvoted a paper 3 days ago

LoRACode: LoRA Adapters for Code Embeddings

upvoted a paper 3 days ago

Learning from Failures in Multi-Attempt Reinforcement Learning

View all activity

Organizations

None yet

gatepoet's activity

upvoted a paper 2 days ago

DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs

Paper • 2503.07067 • Published 4 days ago • 27

upvoted 2 papers 3 days ago

LoRACode: LoRA Adapters for Code Embeddings

Paper • 2503.05315 • Published 7 days ago • 8

Learning from Failures in Multi-Attempt Reinforcement Learning

Paper • 2503.04808 • Published 10 days ago • 17

upvoted a paper 4 days ago

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Paper • 2503.04724 • Published 7 days ago • 60

upvoted a paper 6 days ago

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published 8 days ago • 77

upvoted 2 papers 13 days ago

DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks

Paper • 2502.17157 • Published 18 days ago • 51

Thus Spake Long-Context Large Language Model

Paper • 2502.17129 • Published 18 days ago • 68

upvoted a paper 15 days ago

The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?

Paper • 2502.17535 • Published 18 days ago • 8

upvoted 2 papers 16 days ago

Slamming: Training a Speech Language Model on One GPU in a Day

Paper • 2502.15814 • Published 23 days ago • 66

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published 18 days ago • 27

upvoted a collection 28 days ago

Recurrent Models

Collection

These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space. • 14 items • Updated Feb 10 • 5

upvoted a paper about 1 month ago

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Paper • 2502.04128 • Published Feb 6 • 25

upvoted 3 papers about 2 months ago

Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

Paper • 2501.13928 • Published Jan 23 • 17

FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces

Paper • 2501.12909 • Published Jan 22 • 68

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

Paper • 2501.12570 • Published Jan 22 • 24

upvoted an article about 2 months ago

Article

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

•

Jan 20

• 63

upvoted a paper about 2 months ago

RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation

Paper • 2501.08617 • Published Jan 15 • 10

upvoted 2 papers 4 months ago

DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation

Paper • 2411.16657 • Published Nov 25, 2024 • 19

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published Nov 26, 2024 • 52

upvoted a paper 8 months ago

SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models

Paper • 2407.15841 • Published Jul 22, 2024 • 40