Commit History
Add training callback to send predictions to WandB table (#521)
5b67ea9
unverified
Glavin001
commited on
document that packaging needs to be installed before flash-attn (#559)
9845c5e
unverified
winglian
commited on
ergonomic update to optimizer config doc (#548)
6d57f2f
unverified
The Objective Dad
commited on
update readme to point to direct link to runpod template, cleanup install instrucitons (#532)
34c0a86
unverified
winglian
commited on
Adding NCCL Timeout Guide (#536)
5e2d8a4
unverified
The Objective Dad
commited on
Fix(doc): Inform Windows users to use WSL/docker (#518)
f51c9c5
unverified
Nanobit
commited on
Added advanced DDP args (#515)
396a7a7
unverified
Jan Philipp Harries
Jan Philipp Harries
commited on
support for datasets with multiple names (#480)
5ac3392
unverified
winglian
commited on
Fix(doc): Clarify no amp to full yaml docs (#496)
48c5647
unverified
Nanobit
commited on
pad_to_worst_case_seq_len boolean, for testing memory limits (#498)
8e197f6
unverified
Feat(doc): Update eval_steps doc (#487)
ad8be43
unverified
Nanobit
commited on
ReLoRA implementation (with quantization) (#322)
bde3c5a
unverified
Fix(doc): Clarify config (#466)
55c23c7
unverified
Nanobit
commited on
feat: add Metharme prompt strategy (#446)
f474650
unverified
feat(docs): improve user customized prompts (#443)
04a42b6
unverified
Nanobit
commited on
feat(doc): add pillow to lambda instructions (#445)
919f4ca
unverified
Nanobit
commited on
support user defined prompters, pretokenized datasets in config, local parquet, local arrow files (#348)
d2e7f27
unverified
winglian
commited on
docs(readme): add `cd axolotl` (#440)
d21318d
unverified
philpax
commited on
use save_strategy from config if available (#434)
b3f5e00
unverified
winglian
commited on
flash attn pip install (#426)
cf66547
unverified
Fix(docs): Remove gptq+lora and fix xformer compat list (#423)
3d1f203
unverified
Nanobit
commited on
hopefully improve the README (#419)
2495909
unverified
winglian
commited on
Merge pull request #413 from mhenrichsen/chore/update-deepseed-config
f806e86
unverified
mhenrichsen
commited on
Feat(doc): Add lr_quadratic_warmup to readme (#412)
2b990eb
unverified
Nanobit
commited on
update path to align with fsdp example
bd8cab4
mhenrichsen
commited on
Fix(config): Update handling of deepspeed config (#404)
c01015f
unverified
Nanobit
commited on
Fix(docs): Update flash attn requirements (#409)
72fe3f8
unverified
Nanobit
commited on
update docs for tokenizer_legacy (#401)
47961fd
unverified
winglian
commited on
Feat(doc): Add how to save by epochs (#396)
be294fd
unverified
Nanobit
commited on
Feat(doc): Add max_steps to readme (#389)
41ecb45
unverified
Nanobit
commited on
Feat(config): Add hub_strategy (#386)
73a0b6e
unverified
Nanobit
commited on
Feat(doc): Improve sharegpt doc (#378)
729c299
unverified
Nanobit
commited on
Attention mask and position id fixes for packing (#285)
2bb0b78
unverified
winglian
commited on
Add wandb_entity to wandb options, update example configs, update README (#361)
7019509
unverified
Feat: Add rope scaling (#343)
b521206
unverified
Nanobit
commited on
Update README.md on pretraining_dataset (#360)
fae6ed8
unverified
Nanobit
commited on
Clarify pre-tokenize before multigpu (#359)
94d03c8
unverified
Nanobit
commited on
note pattern when using groups
b4d1d22
tmm1
commited on
update comment for group_by_length
9f99104
tmm1
commited on
python 3.10 and 3.11 both work fine, as does pytorch 2.1.0.dev
58d6659
tmm1
commited on
there is no configs folder
cc7e800
tmm1
commited on
update README for updated docker images (#328)
41a4d15
unverified
winglian
commited on
Merge pull request #306 from ethanhs/xgen
dcdec44
unverified
winglian
commited on
don't resize embeddings to multiples of 32x by default
1066751
winglian
commited on
Add XGen info to README and example config
3881143
ethanhs
commited on
Fix(readme): Improve wording for push model
165907f
unverified
Nanobit
commited on
fix(readme): remove accelerate config
b64f411
unverified
Nanobit
commited on
Merge pull request #279 from NanoCode012/feat/multi-gpu-readme
469c08c
unverified
winglian
commited on