BOUKOUFFALLAH Abdallah

iBado

Abdellahbado

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities

upvoted a paper about 1 month ago

A Survey of Context Engineering for Large Language Models

upvoted a paper about 2 months ago

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

View all activity

Organizations

None yet

upvoted 2 papers about 1 month ago

Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities

Paper • 2507.13158 • Published Jul 17 • 24

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 248

upvoted 2 papers about 2 months ago

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14 • 88

AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs

Paper • 2507.05687 • Published Jul 8 • 26

upvoted 2 papers 2 months ago

A Survey on Vision-Language-Action Models: An Action Tokenization Perspective

Paper • 2507.01925 • Published Jul 2 • 37

From Bytes to Ideas: Language Modeling with Autoregressive U-Nets

Paper • 2506.14761 • Published Jun 17 • 17

upvoted 8 papers 3 months ago

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published Jun 17 • 42

Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published Jun 15 • 62

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 120

Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model

Paper • 2505.17894 • Published May 23 • 220

upvoted 6 papers 4 months ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17 • 122

AdaptThink: Reasoning Models Can Learn When to Think

Paper • 2505.13417 • Published May 19 • 82

Transformer Interpretability Beyond Attention Visualization

Paper • 2012.09838 • Published Dec 17, 2020 • 1

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15 • 120

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published May 8 • 186

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 184

BOUKOUFFALLAH Abdallah

AI & ML interests

Recent Activity

Organizations

iBado's activity