This model is very impressive. Could you create a small version with the same vocabulary for speculative decoding, say, a 0.6B?
· Sign up or log in to comment