This contains the weights of a sparse autoencoder I trained on the residual activations of Mistral-7B-Instruct-v0.1. I used The Pile (uncopyrighted) for the training data. As of right now, I have only trained a single SAE (on layer 16), though I may do more in the future.
The easiest way to use the model is with the SAE Lens library.
Here is the training repo.
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
HF Inference deployability: The model has no library tag.