Commit History
add check for attr
c9a149f
Fix strict and Lint
b565ecf
match up gradient checkpointing when using lora w config
fe0b768
Fix undefined LlamaForCausalLM and del try except
563b6d8
peft no longer needs device_map
cd0a6f6
Refactor landmark attention patch
919727b
Fix missing cfg.
a808bf9
unverified
Angainor Development
commited on
Merge pull request #182 from OpenAccess-AI-Collective/fix-llama-ref
0124825
unverified
more gpt-neox long ctx fixes
ab5cd28
more tweaks to do pre-training with bettertransformers
1210dc8
add support for opimum bettertransformers
1edc30c
fix for local variable 'LlamaForCausalLM' referenced before assignment
14163c1
Merge branch 'main' into patch-1
79e2a6f
unverified
Angainor Development
commited on
add support to extend context with xpos rope
a03a7d7
fix for max sequence len across different model types
7f09106
Fix backward compat for peft
aefb2fc
WIP: Rely on cfg.inference
813cfa4
unverified
Angainor Development
commited on
Fix patching via import instead of hijacking
e44c9e0
Feat: Add landmark attention
55b8542
Fix future deprecate prepare_model_for_int8_training
df9528f
Fix training over existing lora
193c73b
unverified
Angainor Development
commited on
new prompters, misc fixes for output dir missing using fsdp, and changing max seq len
4ac9e25
Merge pull request #124 from OpenAccess-AI-Collective/xformers-fix
2d0ba3b
unverified
remove unused import and update readme
e3c494c
copy xformers attn from ooba since we removed dep on alpaca_lora_4bit
6cb2310
fix up tokenizer config, isort fix
39a208c
split up llama model loading so config can be loaded from base config and models can be loaded from a path
2520ecd
Fix incorrect rebase
594e72b
fix relative path for fixtures
cfcc549
Apply isort then black
37293dc
Fix mypy typing
e9650d3
Lint models.py
f4e5d86
fix relative path for fixtures
e65aeed
refactor: fix previous refactors
56f9ca5
Refactor to use DictDefault instead
8bd7a49
Convert attrdict to addict
bdfe7c9
Merge pull request #67 from OpenAccess-AI-Collective/refactor-tokenizer-load
0d4a7f4
unverified
Merge branch 'main' into refactor/rename-4b-to-gptq
147241c
unverified
fix auto linear modules for lora w/o any set already
4c90633
refactor(param): rename load_4bit config param by gptq
dd00657
Thytu
commited on