Commit History
Phi2 rewrite (#1058)
732851f
unverified
winglian
commited on
streaming multipack for pretraining dataset (#959)
553c80f
unverified
feat: always push checkpoint to hub if set (#1049) [skip ci]
cbdbf9e
unverified
Nanobit
commited on
RL/DPO (#935)
f243c21
winglian
commited on
use recommended setting for use_reentrant w gradient checkpointing (#1021)
4d2e842
unverified
winglian
commited on
remove landmark attn and xpos rope implementations (#1010)
70b46ca
unverified
winglian
commited on
FEAT: add tagging support to axolotl (#1004)
db9094d
unverified
fix: add lr scheduler kwargs to Trainer (#972)
13e9381
unverified
Nanobit
commited on
fix: switch to using the HuggingFace Transformers NEFT implementation (#941)
ef24342
unverified
kallewoof
commited on
support for mamba (#915)
40a6362
unverified
winglian
commited on
Feat(wandb): Refactor to be more flexible (#767)
a1da39c
unverified
Nanobit
commited on
feature: loss watchdog for terminating training runs that are failing (#899)
58ec8b1
unverified
Feat: Add warmup_ratio (#893)
fb12895
unverified
Nanobit
commited on
don't train if eval split is too small (#873)
797f3dd
unverified
winglian
commited on
various bugfixes (#856)
1470650
unverified
winglian
commited on
cleanup the old multipack dataloader (#841)
1a6309c
unverified
winglian
commited on
multipack w batch sampler (#795)
641e6f7
unverified
winglian
commited on
Threaded MultipackDistributedDataloader with prefetched samples (#759)
05bd6f1
unverified
casperhansen
commited on
refactor setup trainer so we can add more hooks (#773)
6c81c61
unverified
winglian
commited on