RESMPDEV
/

Qwen2-Wukong-7B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Kearm commited on Jul 14, 2024

Commit

dbdcda0

·

verified ·

1 Parent(s): 6cd85f5

Update README.md

Files changed (1) hide show

README.md +27 -3

README.md CHANGED Viewed

@@ -1,3 +1,27 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+language:
+- en
+pipeline_tag: text-generation
+tags:
+- chat
+---
+# Qwen2-Wukong-7B
+![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/655dc641accde1bbc8b41aec/xOe1Nb3S9Nb53us7_Ja3s.jpeg)
+Qwen2-Wukong-7B is a dealigned chat finetune of the original fantastic Qwen2-7B model by the Qwen team.
+This model was trained on the teknium OpenHeremes-2.5 dataset and some supplementary datasets from Cognitive Computations
+This model was trained for 3 epochs with a custom FA2 implementation for AMD cards.
+Special thanks to [Tensorwave](https://tensorwave.com/) for providing the compute for this training with their fantastic Mi300x Instict nodes.
+[<img src="https://raw.githubusercontent.com/axolotl-ai-cloud/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/axolotl-ai-cloud/axolotl)
+# Example Outputs
+TBD