QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge Paper • 2503.16709 • Published 13 days ago • 2
DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network Paper • 2303.02165 • Published Mar 5, 2023
Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge Paper • 2312.05693 • Published Dec 9, 2023 • 1
EdgeQAT: Entropy and Distribution Guided Quantization-Aware Training for the Acceleration of Lightweight LLMs on the Edge Paper • 2402.10787 • Published Feb 16, 2024