Sweaterdog
/

Andy-v3.5-Beta

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Andy-v3.5-Beta / README.md

Sweaterdog's picture

Update README.md

463b2b1 verified about 1 month ago

|

2.8 kB

	---
	base_model:
	- unsloth/Qwen2.5-7B-bnb-4bit
	tags:
	- text-generation-inference
	- transformers
	- unsloth
	- qwen2
	- trl
	license: apache-2.0
	language:
	- en
	datasets:
	- Sweaterdog/Andy-v3.5-Beta
	---

	# Uploaded models

	- Developed by: Sweaterdog
	- License: apache-2.0
	- Finetuned from model : unsloth/Qwen2.5-7B-bnb-4bit

	The MindCraft LLM tuning CSV file can be found here, this can be tweaked as needed. [MindCraft-LLM](https://huggingface.co/datasets/Sweaterdog/Andy-v3.5-Beta)

	# This is a very very early access Beta Model
	This model is NOT a final version, but instead is a test to see how well models can be with a small dataset. This dataset is also a test of how smaller models can be improved from extremely high quality, and as close to real-world scenarios as possible.

	This model listed here (Andy-3.5-beta-10) is NOT the final model, but instead a preview for the new training method, this model performs well at playing Minecraft and can even play with no instructions other than history.
	That all being said, this model was trained on a small dataset, meaning it doesn't have *every single example* it may need, the final version will have a much larger dataset.
	Also, if you want to use this model, you can test Modelfile, or Modelfile 2, I haven't had a chance to dive deep into which performs better, but the model is alright, it isn't the best, but better than a non-tuned model.

	# Where data came from

	The storing memory parts are real examples from in-game interactions

	The coding is artifical and was generated by GPT-o1, with the instruction to include reasoning and thinking in the comments of the code

	The playing is artificial and was generated by me, a human, and used prompts focusing on points where some models fail, such as mining.

	This model should not be a reflection on how smaller models play Minecraft, if it performs well, and better than Andy-v2-qwen, then Yay! If not, I wasn't expecting it to be better, (And neither should you!)

	You are totally allowed to test the beta model.

	I hope this model performs well for you!

	# How you can help make Andy-3.5

	I am one person, who is paying and working for this model, I want it to be amazing for all people to play with it for free.
	However, that is not feasible finacially for one person like me.
	IF you would like to help out, you can find me on discord, via the [MindCraft server.](https://discord.com/invite/mp73p35dzC), my username on there is "Sweaterdog"

	ALSO

	The models are going to change, I am changing hyperparameters on tuning to (hopefully) increase performance and decrease hallucinations

	BTW, if you want to download this model, I suggest using llama.cpp to make a quantization of it, I would have done it during tuning but I ran out of GPU time on google colab