AI2 WildBench Leaderboard (V2)
Display and explore a leaderboard of language models
Display and explore a leaderboard of language models
Display LMArena Leaderboard
Embedding Leaderboard
Track, rank and evaluate open LLMs and chatbots
Calculate GPU requirements for running LLMs
Identify key entities in text
Browse and filter leaderboard of language models
Generate text from document images
Analyze document layout from images
Extract text from documents using images or PDFs
Submit model evaluation results to leaderboard
Generate descriptions and answers about images
Efficient quantized retrieval over Wikipedia
Display and analyze reward model evaluation results
Identify objects in images based on text descriptions
Display image analysis results
VLMEvalKit Evaluation Results Collection
Display interactive web apps
Visualize Open vs. Proprietary LLM Progress
Upload a PDF and ask questions about its content
Submit and evaluate models on GAIA benchmark
Identify and highlight key entities in text
Explore and analyze code completion benchmarks
Create a Hugging Face dataset from text files
Generate speech from text in multiple languages
Generate captions and analyze images with various tasks
Generate React TypeScript App
Video captioning/tracking
Explore visual document retrieval benchmark results
In-browser speech recognition w/ word-level timestamps
Generate insights from charts using text prompts
Need to analyze data? Let a Llama-3.1 agent do it for you!
Display a text analysis tool
View and submit language model evaluations
Detect objects in images using text prompts
VLMEvalKit Eval Results in video understanding benchmark
Extract text from images using various OCR modes
Generate a leaderboard for evaluating language models
remove background from any image
Vote on AI responses to rank models
What happened in open-source AI this year, and whatβs next?
Generate interactive React app data visualizations
Detect and estimate human poses in images and videos
Generate interactive Jupyter notebooks with user input
Ranking of LLMs for agentic tasks
OmniParser, turn your LLM into GUI agent
Enhance low-light images to improve clarity
PDF to Structured Data powered by Google DeepMind Gemini 2.0
Handwritten Signature Detection
Convert images and text into structured documents
Generate text and speech from text, audio, images, and videos
Detect faces in uploaded images
Convert PDFs to Markdown with open-source parsers
Remove background from images
A Unified Framework for Image Customization
Dolphin Demo
Create and enrich datasets using AI
Display OCR model leaderboard and evaluation data
Hand-controlled arpeggiator, drum machine, and visualizer
olmocr / nanonets ocr / qwen2vl ocr / aya vision / rolmocr
Display OCRBench leaderboard for text recognition models
camel doc ocr / core ocr / docscope ocr / monkey ocr
nanonets ocr / smoldocling / monkey ocr / typhoon ocr
Run GGUF directly on your browser!
Extract text from images and XML files using OCR models
AI Image Detection Demo
Kontext image editing on FLUX[dev]
Classify text with zero-shot classification
GLiClass for Reranking Sentence Pairs
High-accuracy vision & reasoning for complex tasks
Run code and analyze data in a Jupyter notebook