Commit History
RL/DPO (#935)
f243c21
add config to model card (#1005)
85dd4d5
unverified
fix: switch to using the HuggingFace Transformers NEFT implementation (#941)
ef24342
unverified
kallewoof
commited on
Fix Deepspeed loading (#950)
5ea3aa3
unverified
support for mamba (#915)
40a6362
unverified
use accelerate logging for zero/main loggin only
b2430ce
cleanup verbosity a bit
4c834bf
refactor neft patch to be more re-usable similar to trl's impl (#796)
827ec3d
unverified
Implement fused modules (#747)
15d3a65
unverified
Fix DeepSpeed Zero 3 Saving (#709)
e4d1585
unverified
create a model card with axolotl badge (#624)
501958b
unverified
set fsdp state dict (#584)
be75668
unverified
Jan Philipp Harries
Jan Philipp Harries
commited on