Commit History
chore(config): clean up old log for Qwen (#1034)
74532dd
unverified
use recommended setting for use_reentrant w gradient checkpointing (#1021)
4d2e842
unverified
Fix: bf16 support for inference (#981)
3678a6c
unverified
Adds chat templates (#1022)
f8ae59b
unverified
[WandB] Push axolotl config to top level wandb files (#1014)
4f4d638
unverified
add ultrachat prompt strategies (#996)
ba043a3
unverified
feat: remove need to add load_in* during merge (#1017)
f6ecf14
unverified
remove landmark attn and xpos rope implementations (#1010)
70b46ca
unverified
add config to model card (#1005)
85dd4d5
unverified
FEAT: add tagging support to axolotl (#1004)
db9094d
unverified
Feat: Warns to add to modules_to_save when adding tokens or switching special_tokens (#787)
1ffa386
unverified
fix mistral prompt assembly (#982)
7bbaac9
unverified
fix: add lr scheduler kwargs to Trainer (#972)
13e9381
unverified
Fix prompt assembly for llama (#952)
5ada140
unverified
fix: switch to using the HuggingFace Transformers NEFT implementation (#941)
ef24342
unverified
kallewoof
commited on
Fix Deepspeed loading (#950)
5ea3aa3
unverified
Flash attn hotfix (#951)
f1f60cb
unverified
fix: remove excessive newlines in system prompt(s) for alpaca (#936)
450e04d
unverified
kallewoof
commited on