CISCai
/

granite-3.2-8b-instruct-SOTA-GGUF

Text Generation

Model card Files Files and versions Community

CISCai commited on 27 days ago

Commit

e841d9d

·

verified ·

1 Parent(s): 08cf1e2

Added RAG and reasoning examples

Files changed (1) hide show

README.md +45 -0

README.md CHANGED Viewed

@@ -311,6 +311,51 @@ print(llm.create_chat_completion(
 ))
 ```
 <!-- README_GGUF.md-how-to-run end -->
 <!-- original-model-card start -->

 ))
 ```
+#### Simple llama-cpp-python RAG code, requires [PR#1440](https://github.com/abetlen/llama-cpp-python/pull/1440)
+```python
+from llama_cpp import Llama
+llm = Llama(model_path="./granite-3.2-8b-instruct.IQ4_XS.gguf", n_gpu_layers=41, n_ctx=131072)
+print(llm.create_chat_completion(
+    messages = [
+        {
+            "role": "user",
+            "content": "Write a short summary of each document please."
+        }
+    ],
+    documents = [
+        {
+            "text": "Lorem ipsum",
+        },
+        {
+            "text": "Dolor sit amet",
+        }
+    ]
+))
+```
+#### Simple llama-cpp-python reasoning code, requires [PR#1440](https://github.com/abetlen/llama-cpp-python/pull/1440)
+```python
+from llama_cpp import Llama
+llm = Llama(model_path="./granite-3.2-8b-instruct.IQ4_XS.gguf", n_gpu_layers=41, n_ctx=131072)
+print(llm.create_chat_completion(
+    messages = [
+        {
+            "role": "user",
+            "content": "You have 10 liters of a 30% acid solution. How many liters of a 70% acid solution must be added to achieve a 50% acid mixture?"
+        }
+    ],
+    template_kwargs = {
+        "thinking": True
+    }
+))
+```
 <!-- README_GGUF.md-how-to-run end -->
 <!-- original-model-card start -->