Model Description

SpikeGPT-OpenWebText-216M is a L18-D768 SpikeGPT model trained on OpenWebText. See https://github.com/ridgerchu/SpikeGPT for details.

ctx_len = 1024 n_layer = 18 n_embd = 768

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.

Dataset used to train ridger/SpikeGPT-OpenWebText-216M