pratikbhavsar
·
AI & ML interests
LLM agents, evaluation & reasoning
Recent Activity
Organizations
-
-
-
-
-
-
-
-
-
-
-
published
an
article
about 2 months ago
view article
Launching Agent Leaderboard v2: The Enterprise-Grade Benchmark for AI Agents
view article
Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios