Replaced get_max_length with get_max_cache_shape
#12 opened 4 months ago
by
MHGN
The model only contains mamba and no attention
#11 opened 7 months ago
by
AscendingGrass
Nvidia ChatRTX
#8 opened 9 months ago
by
monostronomy