make sure to save the lora adapter at the end of RL/dpo training (#1573) 796a085 unverified winglian commited on May 8, 2024
Pass weakref to model in the SIGINT handler to free up model post train function (#1581) dde02fc unverified chiragjn winglian commited on May 3, 2024
Train parameters exclusively in specific ranges (#1390) 05bcc9e unverified seungduk commited on Mar 14, 2024
workaround for transformers bug requireing do_sample for saveing pretrained (#1206) ba944e6 unverified winglian commited on Jan 25, 2024
fix: switch to using the HuggingFace Transformers NEFT implementation (#941) ef24342 unverified kallewoof commited on Dec 13, 2023
refactor neft patch to be more re-usable similar to trl's impl (#796) 827ec3d unverified winglian commited on Oct 29, 2023
set fsdp state dict (#584) be75668 unverified Jan Philipp Harries Jan Philipp Harries commited on Sep 15, 2023