Arun Kumar's picture

9

Arun Kumar

matrix2596

AI & ML interests

None yet

Organizations

None yet

matrix2596's activity

upvoted a paper 4 months ago

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 49

upvoted 8 papers 6 months ago

Breaking reCAPTCHAv2

Paper • 2409.08831 • Published Sep 13, 2024 • 5

On the Diagram of Thought

Paper • 2409.10038 • Published Sep 16, 2024 • 14

TurboEdit: Instant text-based image editing

Paper • 2408.08332 • Published Aug 14, 2024 • 20

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 58

NanoFlow: Towards Optimal Large Language Model Serving Throughput

Paper • 2408.12757 • Published Aug 22, 2024 • 18

Configurable Foundation Models: Building LLMs from a Modular Perspective

Paper • 2409.02877 • Published Sep 4, 2024 • 29

Efficient LLM Scheduling by Learning to Rank

Paper • 2408.15792 • Published Aug 28, 2024 • 20

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3, 2024 • 83