Yedidia AGNIMO

YedsonUQ

AI & ML interests

[Uncertainty Quantification, "Hallucinations"] in LLMs, Federated Learning

Recent Activity

updated a collection 2 days ago

Retrieval Augmented Generation (RAG)

updated a collection 2 days ago

Retrieval Augmented Generation (RAG)

upvoted a paper 2 days ago

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

View all activity

Organizations

None yet

YedsonUQ's activity

updated a collection 2 days ago

Retrieval Augmented Generation (RAG)

Collection

3 items • Updated 2 days ago

upvoted a paper 2 days ago

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published 4 days ago • 158

updated 2 collections 4 days ago

Findings

Collection

2 items • Updated 4 days ago

Reinforcement Learning (RL)

Collection

2 items • Updated 4 days ago

upvoted 2 papers 4 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 10 days ago • 100

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 9 days ago • 51

updated a collection 9 days ago

Reasoning

Collection

4 items • Updated 9 days ago

upvoted 2 papers 10 days ago

Do generative video models learn physical principles from watching videos?

Paper • 2501.09038 • Published 24 days ago • 32

Reasoning Language Models: A Blueprint

Paper • 2501.11223 • Published 19 days ago • 31

updated a collection 10 days ago

Models Series

Collection

3 items • Updated 10 days ago

upvoted a paper 10 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 16 days ago • 302

updated a collection 10 days ago

Models Series

Collection

3 items • Updated 10 days ago

upvoted 4 papers 17 days ago

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published 22 days ago • 36

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published 24 days ago • 53

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 22 days ago • 105

Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong

Paper • 2501.09775 • Published 22 days ago • 28