Axolotl
Go ahead and axolotl questions
Support Matrix
fp16/fp32 | fp16/fp32 w/ lora | 4bit-quant | 4bit-quant w/flash attention | flash attention | xformers attention | |
---|---|---|---|---|---|---|
llama | β | β | β | β | β | β |
Pythia | β | β | β | β | β | β |
cerebras | β | β | β | β | β | β |
Getting Started
- Point the config you are using to a huggingface hub dataset (see configs/llama_7B_4bit.yml)
datasets:
- path: vicgalle/alpaca-gpt4
type: alpaca
Optionally Download some datasets, see data/README.md
Create a new or update the existing YAML config config/pythia_1_2B_alpaca.yml
Install python dependencies with ONE of the following:
pip3 install -e .[int4]
(recommended)pip3 install -e .[int4_triton]
pip3 install -e .
If not using
int4
orint4_triton
, runpip install "peft @ git+https://github.com/huggingface/peft.git"
Configure accelerate
accelerate config
or update~/.cache/huggingface/accelerate/default_config.yaml
compute_environment: LOCAL_MACHINE
distributed_type: MULTI_GPU
downcast_bf16: 'no'
gpu_ids: all
machine_rank: 0
main_training_function: main
mixed_precision: bf16
num_machines: 1
num_processes: 4
rdzv_backend: static
same_network: true
tpu_env: []
tpu_use_cluster: false
tpu_use_sudo: false
use_cpu: false
- Train!
accelerate launch scripts/finetune.py
, make sure to choose the correct YAML config file - Alternatively you can pass in the config file like:
accelerate launch scripts/finetune.py configs/llama_7B_alpaca.yml
~~