Joseph's picture

Joseph

Joseph717171

·

AI & ML interests

None yet

Recent Activity

reacted to tegridydev's post with 🚀 about 11 hours ago

So, what is #MechanisticInterpretability 🤔 Mechanistic Interpretability (MI) is the discipline of opening the black box of large language models (and other neural networks) to understand the underlying circuits, features and/or mechanisms that give rise to specific behaviours Instead of treating a model as a monolithic function, we can: 1. Trace how input tokens propagate through attention heads & MLP layers 2. Identify localized “circuit motifs” 3. Develop methods to systematically break down or “edit” these circuits to confirm we understand the causal structure. Mechanistic Interpretability aims to yield human-understandable explanations of how advanced models represent and manipulate concepts which hopefully leads to 1. Trust & Reliability 2. Safety & Alignment 3. Better Debugging / Development Insights https://bsky.app/profile/mechanistics.bsky.social/post/3lgvvv72uls2x

reacted to tegridydev's post with 👀 about 11 hours ago

So, what is #MechanisticInterpretability 🤔 Mechanistic Interpretability (MI) is the discipline of opening the black box of large language models (and other neural networks) to understand the underlying circuits, features and/or mechanisms that give rise to specific behaviours Instead of treating a model as a monolithic function, we can: 1. Trace how input tokens propagate through attention heads & MLP layers 2. Identify localized “circuit motifs” 3. Develop methods to systematically break down or “edit” these circuits to confirm we understand the causal structure. Mechanistic Interpretability aims to yield human-understandable explanations of how advanced models represent and manipulate concepts which hopefully leads to 1. Trust & Reliability 2. Safety & Alignment 3. Better Debugging / Development Insights https://bsky.app/profile/mechanistics.bsky.social/post/3lgvvv72uls2x

liked a model about 11 hours ago

bartowski/Virtuoso-Lite-GGUF

View all activity

Organizations

Collections 3

models 32

Joseph717171/DeepSeek-R1-Distill-Llama-8B-OQ8_0-F32.EF32.IQ4_K-Q8_0-GGUF

Updated 9 days ago • 1.24k • 2

Joseph717171/Llama-3.1-SuperNova-Lite-8.0B-OQ8_0-F32.EF32.IQ4_K-Q8_0-GGUF

Updated 9 days ago • 366 • 2

Joseph717171/Hermes-3-Llama-3.1-8B-OQ8_0-F32.EF32.IQ4_K-Q8_0-GGUF

Updated 9 days ago • 646 • 2

Joseph717171/Models

Updated 9 days ago • 869 • 4

Joseph717171/Granite-3.1-8B-instruct-OQ8_0-F32.EF32.IQ4_K-Q8_0-GGUF

Updated Dec 19, 2024 • 37

Joseph717171/Imatrices

Updated Dec 19, 2024 • 3

Joseph717171/Hermes-3-Llama-3.2-3B-OQ8_0-F32.EF32.IQ4_K-Q8_0-GGUF

Updated Dec 13, 2024 • 112 • 1

Joseph717171/Llama-3.1-SuperNova-Lite-14B

Text Generation • Updated Nov 14, 2024 • 1

Joseph717171/SuperNova-Lite-Hermes-3-Llama-3.1-8B_TIES_with_base_Embeddings_Pre-Initialized-dtypeF32

Text Generation • Updated Oct 29, 2024 • 2 • 2

Joseph717171/Llama-3.1-8B-InitializedEmbeddings_with_Hermes-3

Text Generation • Updated Oct 27, 2024 • 106

datasets

None public yet