Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Sunny Panchal's picture

Sunny Panchal

NeuralStorm

AI & ML interests

None yet

Organizations

None yet

Collections 4

Tasks/Benchmarks/Envs

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Paper • 2412.14161 • Published Dec 18, 2024 • 51

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

Paper • 2412.15204 • Published Dec 19, 2024 • 33

models

None public yet

datasets

None public yet

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs