Commit History
Disable caching on `--disable_caching` in CLI (#1110)
d66b101
unverified
misc fixes from #943 (#1086) [skip ci]
23495a8
unverified
update sharegpt conversations when chatml chat template is set (#1075) [skip ci]
0ce1a65
unverified
Add: mlflow for experiment tracking (#1059) [skip ci]
090c24d
unverified
feature: better device mapping for large models (#918)
bdfefaf
unverified
set default for merge (#1044)
63fb3eb
unverified
RL/DPO (#935)
f243c21
Fix: bf16 support for inference (#981)
3678a6c
unverified
feat: remove need to add load_in* during merge (#1017)
f6ecf14
unverified
remove landmark attn and xpos rope implementations (#1010)
70b46ca
unverified
Fix Deepspeed loading (#950)
5ea3aa3
unverified
ensure merged model matches the training dtype (#902)
1d21aa6
unverified
include the suffix modified string in ascii art (#852)
614cff4
unverified
Feat: Added Gradio support (#812)
738a057
unverified
Create preprocess CLI (#785)
e50ab07
unverified
improve handling of the prepared ds path and other cfg defaults (#701)
1c412c7
unverified
Save Axolotl config as WandB artifact (#716)
490923f
unverified
Jan Philipp Harries
commited on
prepared dataset caching, other misc fixes (#665)
e50a64e
unverified
Warn users to login to HuggingFace (#645)
85b0be2
unverified
Napuh
commited on