LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Image and Video Generation Paper • 2508.03485 • Published Aug 5 • 2
Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs Paper • 2508.14896 • Published Aug 20 • 22
MoPE-CLIP: Structured Pruning for Efficient Vision-Language Models with Module-wise Pruning Error Metric Paper • 2403.07839 • Published Mar 12, 2024
TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation Paper • 2505.05422 • Published May 8 • 8