Roman256's picture
2 7

Roman256

Roman12322
ยท

AI & ML interests

None yet

Recent Activity

reacted to Kseniase's post with ๐Ÿ”ฅ 18 days ago
8 New Applications of Test-Time Scaling We've noticed a huge interest in test-time scaling (TTS), so we decided to explore this concept further. Test-time compute (TTC) refers to the amount of computational power used by an AI model when generating a response. Many researchers are now focused on scaling TTC, as it enables slow, deep "thinking" and step-by-step reasoning, which improves overall models' performance. Here are 8 fresh studies on test-time scaling: 1. https://huggingface.co/papers/2502.05171 Introduces an LM that scales TTC by reasoning in latent space instead of generating more tokens with no special training. Here, a recurrent block to processes information iteratively. 2. https://huggingface.co/papers/2502.04728 Shows how TTS is applied to enhance model's Planning Domain Definition Language (PDDL) reasoning capabilities, which can be used to generate a symbolic world model. 3. https://huggingface.co/papers/2502.06703 Analyzes optimal TTS strategies and shows how small models can outperform much larger ones. 4. https://huggingface.co/papers/2502.04128 Shows how TTS improves expressiveness, timbre consistency and accuracy in speech synthesis with Llasa framework. It also dives into benefits of scaling train-time compute. 5. https://huggingface.co/papers/2502.07154 Suggests a modified training loss for better reasoning of LLMs when scaling TTC. 6. https://huggingface.co/papers/2502.05078 Unifies the strengths of chain, tree, and graph paradigms into one framework that expands reasoning only on necessary subproblems. 7. https://huggingface.co/papers/2502.01839 Explores scaling trends of self-verification and how to improve its capabilities with TTC. 8. https://huggingface.co/papers/2501.14723 Explores how scaling serial compute (iterations) and parallel compute (trajectories), can improve accuracy in real-world software engineering issues. Also, explore our article about TTS for more -> https://huggingface.co/blog/Kseniase/testtimecompute
reacted to Kseniase's post with โž• 18 days ago
8 Free Sources about AI Agents: Agents seem to be everywhere and this collection is for a deep dive into the theory and practice: 1. "Agents" Google's whitepaper by Julia Wiesinger, Patrick Marlow and Vladimir Vuskovic -> https://www.kaggle.com/whitepaper-agents Covers agents, their functions, tool use and how they differ from models 2. "Agents in the Long Game of AI. Computational Cognitive Modeling for Trustworthy, Hybrid AI" book by Marjorie McShane, Sergei Nirenburg, and Jesse English -> https://direct.mit.edu/books/oa-monograph/5833/Agents-in-the-Long-Game-of-AIComputational Explores building AI agents, using Hybrid AI, that combines ML with knowledge-based reasoning 3. "AI Engineer Summit 2025: Agent Engineering" 8-hour video -> https://www.youtube.com/watch?v=D7BzTxVVMuw Experts' talks that share insights on the freshest Agent Engineering advancements, such as Google Deep Research, scaling tips and more 4. AI Agents Course from Hugging Face -> https://huggingface.co/learn/agents-course/en/unit0/introduction Agents' theory and practice to learn how to build them using top libraries and tools 5. "Artificial Intelligence: Foundations of Computational Agents", 3rd Edition, book by David L. Poole and Alan K. Mackworth -> https://artint.info/3e/html/ArtInt3e.html Agents' architectures, how they learn, reason, plan and act with certainty and uncertainty 6. "Intelligent Agents: Theory and Practice" book by Michael Wooldridge -> https://www.cs.ox.ac.uk/people/michael.wooldridge/pubs/ker95/ker95-html.html A fascinating option to dive into how agents were seen in 1995 and explore their theory, architectures and agent languages 7. The Turing Post articles "AI Agents and Agentic Workflows" on Hugging Face -> https://huggingface.co/Kseniase We explore agentic workflows in detail and agents' building blocks, such as memory and knowledge 8. Our collection "8 Free Sources to Master Building AI Agents" -> https://www.turingpost.com/p/building-ai-agents-sources
View all activity

Organizations

None yet

models

None public yet

datasets

None public yet