This model aims to test the conversion between Megatron-LM and transformers. It is a small GPT-2-like model that has been used to debug the script. Use it only for integration tests
Downloads last month
15,894
Safetensors
Model size
16.2M params
Tensor type
BF16
Β·
Model tree for bigscience/bigscience-small-testing