Decode AttnGate for Reasoning Models
-
SeerAttention/SeerAttention-Decode-Qwen3-14B-AttnGates
Text Generation • Updated • 1.12k -
SeerAttention/SeerAttention-Decode-Qwen3-4B-AttnGates
Text Generation • Updated • 1.34k • 2 -
SeerAttention/SeerAttention-Decode-Qwen3-8B-AttnGates
Text Generation • Updated • 1.29k -
SeerAttention/SeerAttention-Decode-R1-Distill-Qwen-14B-AttnGates
Text Generation • Updated • 137