Highlighted work

grimjim 's Collections

Highlighted work

Full weight models

Quantized models

Mirrored mergekit-ready models

Experimental and negative results

updated 1 day ago

My "greatest hits", sort of

Upvote

grimjim/SauerHuatuoSkywork-o1-Llama-3.1-8B

Text Generation • Updated 3 days ago • 26 • 2

Note The addition of o1-inspired reasoning uplifted the Instruct model on most benchmarks. As of the initial merge release date, this is the highest benching Llama 3.x 8B model that I've achieved on the current Open LLM leaderboard.
grimjim/SauerHuatuoSkywork-o1-Llama-3.1-8B-GGUF

Text Generation • Updated 1 day ago • 83
grimjim/HuatuoSkywork-o1-Llama-3.1-8B

Text Generation • Updated 16 days ago • 156

Note This merge of o1 reasoning models achieved an unexpectedly high MATH Level 5 score of 33.99%, which was the highest I saw at the time for Llama 3.x 8B models on the Open LLM Leaderboard.
grimjim/llama-3-Nephilim-v3-8B

Text Generation • Updated Sep 3, 2024 • 178 • 13

Note Proof of concept that a text completion model, based on Instruct in this case, doesn't need any fine-tuning specifically targeting roleplay. All merge components are academic in origin.
grimjim/llama-3-Nephilim-v3-8B-GGUF

Text Generation • Updated Aug 25, 2024 • 102 • 12
grimjim/Llama-3.1-8B-Instruct-abliterated_via_adapter

Text Generation • Updated Sep 18, 2024 • 5.12k • 29

Note Llama 3.1 8B "abliterated" via transfer of the feature via a LoRA. There's probably some damage to the model that could be fixed with additional fine-tuning, as that's a common consequence of abliteration.
grimjim/Llama-3.1-8B-Instruct-abliterated_via_adapter-GGUF

Text Generation • Updated Sep 4, 2024 • 519 • 23
grimjim/Llama-3-Instruct-abliteration-LoRA-8B

Updated Sep 10, 2024 • 7

Note The LoRA adapter obtained from Llama 3, and later applied against Llama 3.1.
grimjim/kukulemon-7B

Text Generation • Updated Mar 21, 2024 • 74 • 11

Note One of my first merges, combining two smart models with a roleplay-oriented merge. Someone on YouTube called out this Mistral v0.1 7B architecture model in a video.
grimjim/kukulemon-7B-GGUF

Text Generation • Updated Aug 26, 2024 • 43 • 2

Upvote