Commit History
params are adam_*, not adamw_*
19cf0bd
Fix future deprecation push_to_hub_model_id
e79c8e6
add validation and tests for adamw hyperparam
cb9d3af
Merge branch 'main' into flash-optimum
fd2c981
unverified
new validation for mpt w grad checkpoints
14668fa
add streaming dataset support for pretraining datasets
eea2731
more gpt-neox long ctx fixes
ab5cd28
fix bettertransformers save, force it to skip after saving correctly in callback
1a82082
more tweaks to do pre-training with bettertransformers
1210dc8
add validation/warning for bettertransformers and torch version
71a43f8
add support for opimum bettertransformers
1edc30c
Fix failing test
bfd27ba
Validate falcon with fsdp
babf0fd
Update doc for grad_accu and add validation tests for batch size
3c71c8d
add support for gradient accumulation steps
3aad5f3
Black formatting
b832a0a
Lint validation
c2dbf2c
new hf_use_auth_token setting so login to hf isn't required
1c33eb8
Feat: Update validate_config and add tests
52dd92a
Add warning for `trust_remote_code`
2824423
unverified
Merge branch 'main' into refactor/rename-4b-to-gptq
147241c
unverified
checking for False is not sufficent for NoneType/unset configs
4c500f5
refactor(param): rename load_4bit config param by gptq
dd00657
Thytu
commited on