Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Dovakiins
/
qwerrwe
like
0
Build error
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
d1aed4c
qwerrwe
100 contributors
History:
18 commits
winglian
deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches
d1aed4c
almost 2 years ago
configs
deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches
almost 2 years ago
data
WIP for axolotl trainer
almost 2 years ago
scripts
deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches
almost 2 years ago
src
various bugfixes
almost 2 years ago
.editorconfig
Safe
186 Bytes
WIP for axolotl trainer
almost 2 years ago
.gitattributes
Safe
49 Bytes
make it work with pythia in the cloud
almost 2 years ago
.gitignore
41 Bytes
WIP for axolotl trainer
almost 2 years ago
README.md
1.81 kB
config chooser, update readme instructions, device config, llama flash attention, debug out the labels, fix config key checks, other bugfixes
almost 2 years ago
ds_config.json
844 Bytes
deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches
almost 2 years ago
pyproject.toml
Safe
90 Bytes
WIP for axolotl trainer
almost 2 years ago
requirements.txt
209 Bytes
helpful info output
almost 2 years ago
setup.cfg
560 Bytes
various bugfixes
almost 2 years ago