Commits · Dovakiins/qwerrwe

more config pruning and migrating

c530e4b

winglian commited on Jun 11, 2023

Merge pull request #189 from OpenAccess-AI-Collective/fixes-20230711

f620706
unverified

winglian commited on Jun 11, 2023

get rid of some configs, formalize pythioa lora config

77762a5

winglian commited on Jun 11, 2023

new validation for mpt w grad checkpoints

14668fa

winglian commited on Jun 11, 2023

Fix strict and Lint

b565ecf

Angainor commited on Jun 11, 2023

match up gradient checkpointing when using lora w config

fe0b768

winglian commited on Jun 11, 2023

Merge pull request #186 from akj2018/main

e944311
unverified

Nanobit commited on Jun 11, 2023

Update FAQS.md

e3e7b52
unverified

Akj2023 commited on Jun 11, 2023

Fix set mem_id for inference and refactor

974dc00

Nanobit commited on Jun 11, 2023

Set mem cache args on inference

572d114

Nanobit commited on Jun 11, 2023

Clean up landmark patching

a6190c8

Nanobit commited on Jun 11, 2023

Fix undefined LlamaForCausalLM and del try except

563b6d8

Nanobit commited on Jun 11, 2023

peft no longer needs device_map

cd0a6f6

winglian commited on Jun 11, 2023

Update FAQS.md

0e664a5
unverified

Akj2023

Nanobit commited on Jun 11, 2023

Update FAQS.md

dd7d16d
unverified

Akj2023 commited on Jun 11, 2023

Address PR suggestion

e285e24
unverified

Nanobit

winglian commited on Jun 11, 2023

Refactor landmark attention patch

919727b

Nanobit commited on Jun 9, 2023

Update FAQS.md

5ffefee
unverified

Akj2023 commited on Jun 11, 2023

Merge pull request #183 from OpenAccess-AI-Collective/inference-from-stdin

d9f713e
unverified

winglian commited on Jun 10, 2023

fix formatting

958da70

winglian commited on Jun 10, 2023

pass a prompt in from stdin for inference

c4e4f81

winglian commited on Jun 10, 2023

Fix missing cfg.

a808bf9
unverified

Angainor Development commited on Jun 10, 2023

Merge pull request #182 from OpenAccess-AI-Collective/fix-llama-ref

0124825
unverified

winglian commited on Jun 10, 2023

Update scripts/finetune.py

759e867
unverified

winglian

Nanobit commited on Jun 10, 2023

address PR feedback

0c6f928

winglian commited on Jun 10, 2023

add streaming dataset support for pretraining datasets

eea2731

winglian commited on Jun 10, 2023

linting fix

1db46a9

winglian commited on Jun 9, 2023

more gpt-neox long ctx fixes

ab5cd28

winglian commited on Jun 1, 2023

fix bettertransformers save, force it to skip after saving correctly in callback

1a82082

winglian commited on Jun 1, 2023

more tweaks to do pre-training with bettertransformers

1210dc8

winglian commited on Jun 1, 2023

experimental expansion of ctx len

488a67d

winglian commited on May 31, 2023

add validation/warning for bettertransformers and torch version

71a43f8

winglian commited on May 28, 2023

use pythia-12b, neox-20b is flaky

3961902

winglian commited on May 27, 2023

add flash attn context for efficient training and attempt setting model to train mode:

8792199

winglian commited on May 27, 2023

add support for opimum bettertransformers

1edc30c

winglian commited on May 27, 2023

fix for local variable 'LlamaForCausalLM' referenced before assignment

14163c1

winglian commited on Jun 10, 2023

Merge pull request #181 from OpenAccess-AI-Collective/xpos-rope

41e4f6c
unverified

winglian commited on Jun 10, 2023

Merge branch 'main' into patch-1

79e2a6f
unverified

Angainor Development commited on Jun 10, 2023

Remove explicit definition of cfg.inference

c250898
unverified

Angainor Development commited on Jun 10, 2023

Merge pull request #180 from Glavin001/feat/stream-inference

215d775
unverified

winglian commited on Jun 10, 2023

formatting for linter

f36e227
unverified

winglian commited on Jun 10, 2023

add option to readme

5878bb1

winglian commited on Jun 10, 2023

add support to extend context with xpos rope

a03a7d7

winglian commited on Jun 10, 2023

Add streaming inference & fix stopping at EOS

fec6bcc

Glavin001 commited on Jun 10, 2023

Merge pull request #179 from OpenAccess-AI-Collective/fix-max_seq_len

931e606
unverified

winglian commited on Jun 10, 2023

fix for max sequence len across different model types

7f09106

winglian commited on Jun 10, 2023

Merge pull request #178 from PocketDocLabs/main

6b50200
unverified

Nanobit commited on Jun 9, 2023

Update README.md to reflect current gradient checkpointing support

16f9e28
unverified

PocketDoc commited on Jun 9, 2023

Merge pull request #176 from NanoCode012/fix/peft-import

b9083a7
unverified

Nanobit commited on Jun 9, 2023

Fix backward compat for peft

aefb2fc

Nanobit commited on Jun 9, 2023

Commit History

more config pruning and migrating c530e4b

Merge pull request #189 from OpenAccess-AI-Collective/fixes-20230711 f620706 unverified

get rid of some configs, formalize pythioa lora config 77762a5

new validation for mpt w grad checkpoints 14668fa

Fix strict and Lint b565ecf

match up gradient checkpointing when using lora w config fe0b768

Merge pull request #186 from akj2018/main e944311 unverified

Update FAQS.md e3e7b52 unverified

Fix set mem_id for inference and refactor 974dc00

Set mem cache args on inference 572d114

Clean up landmark patching a6190c8

Fix undefined LlamaForCausalLM and del try except 563b6d8

peft no longer needs device_map cd0a6f6

Update FAQS.md 0e664a5 unverified

Update FAQS.md dd7d16d unverified

Address PR suggestion e285e24 unverified

Refactor landmark attention patch 919727b

Update FAQS.md 5ffefee unverified

Merge pull request #183 from OpenAccess-AI-Collective/inference-from-stdin d9f713e unverified

fix formatting 958da70

pass a prompt in from stdin for inference c4e4f81

Fix missing cfg. a808bf9 unverified

Merge pull request #182 from OpenAccess-AI-Collective/fix-llama-ref 0124825 unverified

Update scripts/finetune.py 759e867 unverified

address PR feedback 0c6f928

add streaming dataset support for pretraining datasets eea2731

linting fix 1db46a9

more gpt-neox long ctx fixes ab5cd28

fix bettertransformers save, force it to skip after saving correctly in callback 1a82082

more tweaks to do pre-training with bettertransformers 1210dc8

experimental expansion of ctx len 488a67d

add validation/warning for bettertransformers and torch version 71a43f8

use pythia-12b, neox-20b is flaky 3961902

add flash attn context for efficient training and attempt setting model to train mode: 8792199

add support for opimum bettertransformers 1edc30c

fix for local variable 'LlamaForCausalLM' referenced before assignment 14163c1

Merge pull request #181 from OpenAccess-AI-Collective/xpos-rope 41e4f6c unverified

Merge branch 'main' into patch-1 79e2a6f unverified

Remove explicit definition of cfg.inference c250898 unverified

Merge pull request #180 from Glavin001/feat/stream-inference 215d775 unverified

formatting for linter f36e227 unverified

add option to readme 5878bb1

add support to extend context with xpos rope a03a7d7

Add streaming inference & fix stopping at EOS fec6bcc

Merge pull request #179 from OpenAccess-AI-Collective/fix-max_seq_len 931e606 unverified

fix for max sequence len across different model types 7f09106

Merge pull request #178 from PocketDocLabs/main 6b50200 unverified

Update README.md to reflect current gradient checkpointing support 16f9e28 unverified

Merge pull request #176 from NanoCode012/fix/peft-import b9083a7 unverified

Fix backward compat for peft aefb2fc

more config pruning and migrating

c530e4b

Merge pull request #189 from OpenAccess-AI-Collective/fixes-20230711

f620706
unverified

get rid of some configs, formalize pythioa lora config

77762a5

new validation for mpt w grad checkpoints

14668fa

Fix strict and Lint

b565ecf

match up gradient checkpointing when using lora w config

fe0b768

Merge pull request #186 from akj2018/main

e944311
unverified

Update FAQS.md

e3e7b52
unverified

Fix set mem_id for inference and refactor

974dc00

Set mem cache args on inference

572d114

Clean up landmark patching

a6190c8

Fix undefined LlamaForCausalLM and del try except

563b6d8

peft no longer needs device_map

cd0a6f6

Update FAQS.md

0e664a5
unverified

Update FAQS.md

dd7d16d
unverified

Address PR suggestion

e285e24
unverified

Refactor landmark attention patch

919727b

Update FAQS.md

5ffefee
unverified

Merge pull request #183 from OpenAccess-AI-Collective/inference-from-stdin

d9f713e
unverified

fix formatting

958da70

pass a prompt in from stdin for inference

c4e4f81

Fix missing cfg.

a808bf9
unverified

Merge pull request #182 from OpenAccess-AI-Collective/fix-llama-ref

0124825
unverified

Update scripts/finetune.py

759e867
unverified

address PR feedback

0c6f928

add streaming dataset support for pretraining datasets

eea2731

linting fix

1db46a9

more gpt-neox long ctx fixes

ab5cd28

fix bettertransformers save, force it to skip after saving correctly in callback

1a82082

more tweaks to do pre-training with bettertransformers

1210dc8

experimental expansion of ctx len

488a67d

add validation/warning for bettertransformers and torch version

71a43f8

use pythia-12b, neox-20b is flaky

3961902

add flash attn context for efficient training and attempt setting model to train mode:

8792199

add support for opimum bettertransformers

1edc30c

fix for local variable 'LlamaForCausalLM' referenced before assignment

14163c1

Merge pull request #181 from OpenAccess-AI-Collective/xpos-rope

41e4f6c
unverified

Merge branch 'main' into patch-1

79e2a6f
unverified

Remove explicit definition of cfg.inference

c250898
unverified

Merge pull request #180 from Glavin001/feat/stream-inference

215d775
unverified

formatting for linter

f36e227
unverified

add option to readme

5878bb1

add support to extend context with xpos rope

a03a7d7

Add streaming inference & fix stopping at EOS

fec6bcc

Merge pull request #179 from OpenAccess-AI-Collective/fix-max_seq_len

931e606
unverified

fix for max sequence len across different model types

7f09106

Merge pull request #178 from PocketDocLabs/main

6b50200
unverified

Update README.md to reflect current gradient checkpointing support

16f9e28
unverified

Merge pull request #176 from NanoCode012/fix/peft-import

b9083a7
unverified

Fix backward compat for peft

aefb2fc