DiDisama/eagle_x4_7b_clip_pix2struct_conv_det
9B
•
Updated
•
9
[ICLR 2026] Official Implementation of paper 'Investigating Redundancy in Multimodal Large Language Models with Multiple Vision Encoders'