michaelbenayoun/deepseekv3-tiny-4kv-heads-4-layers-random Text Generation • 0.0B • Updated Jul 24 • 16
michaelbenayoun/llama-2-tiny-4kv-heads-2layers-random Feature Extraction • 0.0B • Updated May 7, 2024 • 4
michaelbenayoun/llama-2-tiny-4kv-heads-8layers-random Feature Extraction • 0.0B • Updated May 3, 2024 • 2
michaelbenayoun/llama-2-tiny-16layers-32kv-heads-random Feature Extraction • 0.0B • Updated Jan 4, 2024 • 8
michaelbenayoun/mistral-tiny-4layers-8kv-heads-random Text Generation • 0.0B • Updated Nov 9, 2023 • 4