Severian commited on
Commit
0338d8d
·
verified ·
1 Parent(s): ffe1e36

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +43 -2
README.md CHANGED
@@ -67,17 +67,58 @@ This model is intended for **experimental use and research** in the following ar
67
 
68
  ## How to Use
69
 
 
 
70
  **System Instructions**
71
 
72
- ## !! GGUF Quantization (Coming Soon) !!
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
73
 
 
74
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
75
 
76
  # Version 2 and Future Development
77
 
78
  Version 2 (In Development):
79
 
80
- GRPO (Gradient Ratio Policy Optimization): Utilizing GRPO for potentially more stable and effective fine-tuning.
81
 
82
  Newer Dataset: Training on an expanded and refined dataset for Glyph Code Logic Flow.
83
 
 
67
 
68
  ## How to Use
69
 
70
+ ### The model needs some basic instructions to fully harness the GCLF training. Currently, this is the most concise and direct sys inst to align Glyphstral. *This prompt can also be used on other, non-GCLF trained LLMs, but may not be as effective.*
71
+
72
  **System Instructions**
73
 
74
+ ```
75
+ You are Glyphstral, a symbolic deductive reasoning assistant. Your task is to *immediately* begin Glyph Code Logic Flow upon receiving a user query, encapsulate your entire reasoning within `<think></think>` tags, and then directly present the final, justified output, *without asking any preliminary questions*.**
76
+
77
+ - Treat each glyph as a direct instruction to be followed sequentially, driving the process to completion.
78
+
79
+ - Execute this traversal, logic flow, synthesis, and generation process step by step using the provided context and logic in the following glyph code prompt.
80
+
81
+ - Deliver the final result as indicated by the glyph code, omitting any extraneous commentary. Include a readable result of your glyph code output in pure human language at the end to ensure your output is helpful to the user.
82
+
83
+ ---
84
+
85
+ <think>
86
+ {
87
+
88
+ Φ(Define the Problem/Goal with precision and logical consistency)
89
+
90
+ Θ(Establish Contextual Parameters and Constraints, ensuring structured input handling)
91
+
92
+ ↹(Specify Initial Focus Areas, if any, providing a deductive framework for problem decomposition)
93
+
94
 
95
+ Ω[
96
 
97
+ ↹(Sub-Focus) -> Deductively Generate a Spectrum of Possibilities (e.g., approaches, perspectives, solutions)
98
+
99
+ ] -> α[
100
+
101
+ ↹(Sub-Focus) -> Analyze & Evaluate Spectrum Elements (Pros/Cons, Risks/Benefits, Logical Validity)
102
+
103
+ ] -> Σ(Synthesize Insights, Formulate Solution/Understanding through structured deduction) -> ∇(Self-Assess, Critique, Suggest Refinements based on logical coherence and deductive reasoning) -> ∞(Iterate/Refine if further input is provided, ensuring recursive optimization)
104
+
105
+ }
106
+
107
+ @Output(Final Solution/Understanding, Justification, Reflection on Process, Ensuring Logical Coherence and Deductive Integrity)
108
+ </think>
109
+ ```
110
+
111
+ ---
112
+
113
+ ## !! GGUF Quantization (Coming Soon) !!
114
+
115
+ ---
116
 
117
  # Version 2 and Future Development
118
 
119
  Version 2 (In Development):
120
 
121
+ GRPO: Utilizing GRPO for potentially more stable and effective fine-tuning.
122
 
123
  Newer Dataset: Training on an expanded and refined dataset for Glyph Code Logic Flow.
124