Custom GGUF quants of Meta’s Llama-3.2-Instruct's finetunes, where the Output Tensors are quantized to Q8_0 or F32 and the Embeddings are kept @F32
Joseph
Joseph717171
AI & ML interests
None yet
Recent Activity
reacted
to
tegridydev's
post
with 🚀
about 11 hours ago
So, what is #MechanisticInterpretability 🤔
Mechanistic Interpretability (MI) is the discipline of opening the black box of large language models (and other neural networks) to understand the underlying circuits, features and/or mechanisms that give rise to specific behaviours
Instead of treating a model as a monolithic function, we can:
1. Trace how input tokens propagate through attention heads & MLP layers
2. Identify localized “circuit motifs”
3. Develop methods to systematically break down or “edit” these circuits to confirm we understand the causal structure.
Mechanistic Interpretability aims to yield human-understandable explanations of how advanced models represent and manipulate concepts which hopefully leads to
1. Trust & Reliability
2. Safety & Alignment
3. Better Debugging / Development Insights
https://bsky.app/profile/mechanistics.bsky.social/post/3lgvvv72uls2x
reacted
to
tegridydev's
post
with đź‘€
about 11 hours ago
So, what is #MechanisticInterpretability 🤔
Mechanistic Interpretability (MI) is the discipline of opening the black box of large language models (and other neural networks) to understand the underlying circuits, features and/or mechanisms that give rise to specific behaviours
Instead of treating a model as a monolithic function, we can:
1. Trace how input tokens propagate through attention heads & MLP layers
2. Identify localized “circuit motifs”
3. Develop methods to systematically break down or “edit” these circuits to confirm we understand the causal structure.
Mechanistic Interpretability aims to yield human-understandable explanations of how advanced models represent and manipulate concepts which hopefully leads to
1. Trust & Reliability
2. Safety & Alignment
3. Better Debugging / Development Insights
https://bsky.app/profile/mechanistics.bsky.social/post/3lgvvv72uls2x
liked
a model
about 11 hours ago
bartowski/Virtuoso-Lite-GGUF
Organizations
Collections
3
Custom GGUF quants of Llama-3.1-8B-Instruct fine-tunes, where the Output Tensors are quantized to Q8_0 while the Embeddings are kept at F32. 🧠🔥🚀
-
Joseph717171/Hermes-3-Llama-3.1-8B-OQ8_0-F32.EF32.IQ4_K-Q8_0-GGUF
Updated • 646 • 2 -
Joseph717171/Llama-3.1-SuperNova-Lite-8.0B-OQ8_0-F32.EF32.IQ4_K-Q8_0-GGUF
Updated • 366 • 2 -
Joseph717171/Hermes-3-Llama-3.1-8B_TIES_with_base_Embeds_Initialized_dtypeF32-OQ8_0-F32.EF32.IQ4_K-Q8_0-GGUF
Updated • 85 • 1
models
32
Joseph717171/DeepSeek-R1-Distill-Llama-8B-OQ8_0-F32.EF32.IQ4_K-Q8_0-GGUF
Updated
•
1.24k
•
2
Joseph717171/Llama-3.1-SuperNova-Lite-8.0B-OQ8_0-F32.EF32.IQ4_K-Q8_0-GGUF
Updated
•
366
•
2
Joseph717171/Hermes-3-Llama-3.1-8B-OQ8_0-F32.EF32.IQ4_K-Q8_0-GGUF
Updated
•
646
•
2
Joseph717171/Models
Updated
•
869
•
4
Joseph717171/Granite-3.1-8B-instruct-OQ8_0-F32.EF32.IQ4_K-Q8_0-GGUF
Updated
•
37
Joseph717171/Imatrices
Updated
•
3
Joseph717171/Hermes-3-Llama-3.2-3B-OQ8_0-F32.EF32.IQ4_K-Q8_0-GGUF
Updated
•
112
•
1
Joseph717171/Llama-3.1-SuperNova-Lite-14B
Text Generation
•
Updated
•
1
Joseph717171/SuperNova-Lite-Hermes-3-Llama-3.1-8B_TIES_with_base_Embeddings_Pre-Initialized-dtypeF32
Text Generation
•
Updated
•
2
•
2
Joseph717171/Llama-3.1-8B-InitializedEmbeddings_with_Hermes-3
Text Generation
•
Updated
•
106
datasets
None public yet