Commits · Dovakiins/qwerrwe

chore: bump transformers to v4.34.1 to fix tokenizer issue (#745)

8966a6f
unverified

Nanobit commited on Oct 20, 2023

pin xformers >= 0.0.22 (#724)

bfbdba8
unverified

winglian commited on Oct 13, 2023

Fix(version): Update FA to work with Mistral SWA (#673)

43856c0
unverified

Nanobit commited on Oct 4, 2023

Feat: Allow usage of native Mistral FA when no sample_packing (#669)

697c50d
unverified

Nanobit commited on Oct 4, 2023

removed duplicate on requirements.txt (#661)

a7e56d8
unverified

Napuh commited on Oct 2, 2023

add mistral e2e tests (#649)

5b0bc48
unverified

winglian commited on Sep 29, 2023

Mistral flash attn packing (#646)

b6ab8aa
unverified

winglian commited on Sep 27, 2023

use fastchat conversations template (#578)

e7d3e2d
unverified

winglian commited on Sep 27, 2023

Feat: Add support for upstream FA2 (#626)

19a600a
unverified

Nanobit commited on Sep 26, 2023

update README w deepspeed info (#605)

c25ba79
unverified

winglian commited on Sep 22, 2023

Update requirements.txt (#610)

ec0958f
unverified

Javier commited on Sep 20, 2023

fix wandb so mypy doesn't complain (#562)

bf08044
unverified

winglian commited on Sep 13, 2023

Update requirements.txt (#543)

c1921c9
unverified

dongxiaolong commited on Sep 8, 2023

update readme to point to direct link to runpod template, cleanup install instrucitons (#532)

34c0a86
unverified

winglian commited on Sep 8, 2023

Add support for GPTQ using native transformers/peft (#468)

3355706
unverified

winglian commited on Sep 5, 2023

add eval benchmark callback (#441)

7657632
unverified

winglian commited on Aug 29, 2023

customizable ascii art (#506)

548787d
unverified

winglian commited on Aug 29, 2023

Fix missing 'packaging' wheel (#482)

c500d02
unverified

Maxime commited on Aug 26, 2023

allow newer deps

c29117a

tmm1 commited on Aug 26, 2023

flash attn pip install (#426)

cf66547
unverified

mhenrichsen Ubuntu mhenrichsen Mads Henrichsen

winglian commited on Aug 18, 2023

adds color (#425)

0a22847
unverified

mhenrichsen

winglian commited on Aug 18, 2023

remove extra accelearate in requirements (#430)

82e111a
unverified

winglian commited on Aug 18, 2023

Attention mask and position id fixes for packing (#285)

2bb0b78
unverified

winglian commited on Aug 12, 2023

Merge pull request #355 from tmm1/bitsandbytes-fixes

35c8b90
unverified

tmm1 commited on Aug 11, 2023

bump to latest bitsandbytes release with major bug fixes

fce40aa

tmm1 commited on Aug 9, 2023

use newer pynvml package

9c31410

tmm1 commited on Aug 9, 2023

log GPU memory usage

e303d64

tmm1 commited on Aug 9, 2023

pin accelerate so it works with llama2 (#330)

6c9a87c
unverified

winglian commited on Jul 31, 2023

latest HEAD of accelerate causes 0 loss immediately w FSDP (#321)

9f69c4d
unverified

winglian commited on Jul 24, 2023

add hf_transfer to requirements for faster hf upload

6dd2e7d

winglian commited on Jul 17, 2023

Update requirements.txt

273b3a3
unverified

Teknium commited on Jul 16, 2023

add support for opimum bettertransformers

1edc30c

winglian commited on May 27, 2023

Add accelerate dep

36ec6e1

winglian commited on May 30, 2023

Move black to dev requirements

1bf1f59

Nanobit commited on May 29, 2023

Convert attrdict to addict

bdfe7c9

Nanobit commited on May 27, 2023

update docker to compile latest bnb to properly support qlora

312b8d5

winglian commited on May 27, 2023

Update requirements.txt

7e81ca7
unverified

winglian

Nanobit commited on May 24, 2023

integrate qlora? maybe?

3b4d055

winglian commited on May 23, 2023

update entrypoint and force min accelerate

fa8bd14

winglian commited on May 18, 2023

Fix BNB OOM by pinning version

fe582df
unverified

Nanobit commited on May 8, 2023

docker layer caching, build w axolotl from base build

990bec6

winglian commited on May 7, 2023

cleanup empty lines, tweak env for runpod setup

7753cde

winglian commited on Apr 19, 2023

quickstart instructions for starting from runpod (#5)

0a472e1
unverified

winglian commited on Apr 18, 2023

fix install to work with latest alpaca lora 4bit

4131183

winglian commited on Apr 17, 2023

4bit quantized support (wip)

77fca25

winglian commited on Apr 17, 2023

helpful info output

937f44f

winglian commited on Apr 15, 2023

various bugfixes

80b2ed2

winglian commited on Apr 15, 2023

config chooser, update readme instructions, device config, llama flash attention, debug out the labels, fix config key checks, other bugfixes

f2a2029

winglian commited on Apr 14, 2023

WIP for axolotl trainer

ce24f5e

winglian commited on Apr 14, 2023

Commit History

chore: bump transformers to v4.34.1 to fix tokenizer issue (#745) 8966a6f unverified

pin xformers >= 0.0.22 (#724) bfbdba8 unverified

Fix(version): Update FA to work with Mistral SWA (#673) 43856c0 unverified

Feat: Allow usage of native Mistral FA when no sample_packing (#669) 697c50d unverified

removed duplicate on requirements.txt (#661) a7e56d8 unverified

add mistral e2e tests (#649) 5b0bc48 unverified

Mistral flash attn packing (#646) b6ab8aa unverified

use fastchat conversations template (#578) e7d3e2d unverified

Feat: Add support for upstream FA2 (#626) 19a600a unverified

update README w deepspeed info (#605) c25ba79 unverified

Update requirements.txt (#610) ec0958f unverified

fix wandb so mypy doesn't complain (#562) bf08044 unverified

Update requirements.txt (#543) c1921c9 unverified

update readme to point to direct link to runpod template, cleanup install instrucitons (#532) 34c0a86 unverified

Add support for GPTQ using native transformers/peft (#468) 3355706 unverified

add eval benchmark callback (#441) 7657632 unverified

customizable ascii art (#506) 548787d unverified

Fix missing 'packaging' wheel (#482) c500d02 unverified

allow newer deps c29117a

flash attn pip install (#426) cf66547 unverified

adds color (#425) 0a22847 unverified

remove extra accelearate in requirements (#430) 82e111a unverified

Attention mask and position id fixes for packing (#285) 2bb0b78 unverified

Merge pull request #355 from tmm1/bitsandbytes-fixes 35c8b90 unverified

bump to latest bitsandbytes release with major bug fixes fce40aa

use newer pynvml package 9c31410

log GPU memory usage e303d64

pin accelerate so it works with llama2 (#330) 6c9a87c unverified

latest HEAD of accelerate causes 0 loss immediately w FSDP (#321) 9f69c4d unverified

add hf_transfer to requirements for faster hf upload 6dd2e7d

Update requirements.txt 273b3a3 unverified

add support for opimum bettertransformers 1edc30c

Add accelerate dep 36ec6e1

Move black to dev requirements 1bf1f59

Convert attrdict to addict bdfe7c9

update docker to compile latest bnb to properly support qlora 312b8d5

Update requirements.txt 7e81ca7 unverified

integrate qlora? maybe? 3b4d055

update entrypoint and force min accelerate fa8bd14

Fix BNB OOM by pinning version fe582df unverified

docker layer caching, build w axolotl from base build 990bec6

cleanup empty lines, tweak env for runpod setup 7753cde

quickstart instructions for starting from runpod (#5) 0a472e1 unverified

fix install to work with latest alpaca lora 4bit 4131183

4bit quantized support (wip) 77fca25

helpful info output 937f44f

various bugfixes 80b2ed2

config chooser, update readme instructions, device config, llama flash attention, debug out the labels, fix config key checks, other bugfixes f2a2029

WIP for axolotl trainer ce24f5e

chore: bump transformers to v4.34.1 to fix tokenizer issue (#745)

8966a6f
unverified

pin xformers >= 0.0.22 (#724)

bfbdba8
unverified

Fix(version): Update FA to work with Mistral SWA (#673)

43856c0
unverified

Feat: Allow usage of native Mistral FA when no sample_packing (#669)

697c50d
unverified

removed duplicate on requirements.txt (#661)

a7e56d8
unverified

add mistral e2e tests (#649)

5b0bc48
unverified

Mistral flash attn packing (#646)

b6ab8aa
unverified

use fastchat conversations template (#578)

e7d3e2d
unverified

Feat: Add support for upstream FA2 (#626)

19a600a
unverified

update README w deepspeed info (#605)

c25ba79
unverified

Update requirements.txt (#610)

ec0958f
unverified

fix wandb so mypy doesn't complain (#562)

bf08044
unverified

Update requirements.txt (#543)

c1921c9
unverified

update readme to point to direct link to runpod template, cleanup install instrucitons (#532)

34c0a86
unverified

Add support for GPTQ using native transformers/peft (#468)

3355706
unverified

add eval benchmark callback (#441)

7657632
unverified

customizable ascii art (#506)

548787d
unverified

Fix missing 'packaging' wheel (#482)

c500d02
unverified

allow newer deps

c29117a

flash attn pip install (#426)

cf66547
unverified

adds color (#425)

0a22847
unverified

remove extra accelearate in requirements (#430)

82e111a
unverified

Attention mask and position id fixes for packing (#285)

2bb0b78
unverified

Merge pull request #355 from tmm1/bitsandbytes-fixes

35c8b90
unverified

bump to latest bitsandbytes release with major bug fixes

fce40aa

use newer pynvml package

9c31410

log GPU memory usage

e303d64

pin accelerate so it works with llama2 (#330)

6c9a87c
unverified

latest HEAD of accelerate causes 0 loss immediately w FSDP (#321)

9f69c4d
unverified

add hf_transfer to requirements for faster hf upload

6dd2e7d

Update requirements.txt

273b3a3
unverified

add support for opimum bettertransformers

1edc30c

Add accelerate dep

36ec6e1

Move black to dev requirements

1bf1f59

Convert attrdict to addict

bdfe7c9

update docker to compile latest bnb to properly support qlora

312b8d5

Update requirements.txt

7e81ca7
unverified

integrate qlora? maybe?

3b4d055

update entrypoint and force min accelerate

fa8bd14

Fix BNB OOM by pinning version

fe582df
unverified

docker layer caching, build w axolotl from base build

990bec6

cleanup empty lines, tweak env for runpod setup

7753cde

quickstart instructions for starting from runpod (#5)

0a472e1
unverified

fix install to work with latest alpaca lora 4bit

4131183

4bit quantized support (wip)

77fca25

helpful info output

937f44f

various bugfixes

80b2ed2

config chooser, update readme instructions, device config, llama flash attention, debug out the labels, fix config key checks, other bugfixes

f2a2029

WIP for axolotl trainer

ce24f5e