fyuuki0jp commited on
Commit
8953802
·
verified ·
1 Parent(s): d5bfbf1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +66 -2
README.md CHANGED
@@ -41,16 +41,80 @@ TODO:
41
  ```
42
 
43
  # model response format
 
 
44
  ```
45
  <thinking> model inference </thinking>
46
  <output> model answer </output>
47
  ```
48
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
49
  # RAG Usage
50
- this model in training rag context prompt with custom token
51
  ```
52
  <start_of_turn>system {system_prompt} <end_of_turn>
53
- <start_of_content> {rag content} <end_of_content>
54
  <start_of_turn>user {user_prompt} <end_of_turn>
 
55
  <start_of_turn>model
56
  ```
 
41
  ```
42
 
43
  # model response format
44
+ This model has been fine-tuned to perform CoT within <thinking></thinking> and to provide the final answer within <output></output>.
45
+ (Caution: This model may sometimes produce incorrect CoT, so please be aware of that.)
46
  ```
47
  <thinking> model inference </thinking>
48
  <output> model answer </output>
49
  ```
50
 
51
+ ## example prompts and model output(japanese)
52
+ ```
53
+ <start_of_turn>system
54
+ あなたは親切なアシスタントです。ユーザーの質問に対して親切に回答してください<end_of_turn>
55
+ <start_of_turn>user
56
+ A~Dの中で3人は正直者で、1人が嘘つきです。嘘つきはだれでしょう?
57
+
58
+ A:Dはうそをついている。
59
+
60
+ B:僕はうそをついていない。
61
+
62
+ C:Aはうそをついていない。
63
+
64
+ D:Bはうそをついている。<end_of_turn>
65
+ <start_of_turn>model
66
+ <thinking>
67
+ Alright, I need to determine who among the four individuals (A, B, C, D) is the liar based on the given statements. Let's break down each statement and analyze them step by step.
68
+
69
+ **Statement 1:** A: "D is lying."
70
+
71
+ * This statement implies that someone other than D is telling the truth. Since there are only four people, this means either A is truthful or D is not being honest. However, if D were lying, then according to Statement 1, A would be truthful. But in reality, A states that D is lying, so it seems contradictory unless we consider another interpretation.
72
+
73
+ **Statement 2:** B: "I am not lying."
74
+
75
+ * If B is telling the truth, they mean they are not lying. So, B must be truthful about their honesty. Therefore, B cannot lie.
76
+
77
+ **Statement 3:** C: "A is not lying."
78
+
79
+ * If C is truthful, they mean that A is not lying. In our current scenario, since B is truthful, C can truthfully state that A is not lying. Thus, C also cannot lie.
80
+
81
+ **Statement 4:** D: "B is not lying."
82
+
83
+ * If D is truthful, they mean that B is not lying. Now, let's see how all these statements fit together:
84
+ - **A**: Not lying → True
85
+ - **B**: Lying → False
86
+ - **C**: A is not lying → True
87
+ - **D**: B is not lying → True
88
+
89
+ If B is lying, then from Statements 1, 2, and 3, both A and C are truthful. However, if B is lying, then from Statement 4, D is truthful because he believes B is not lying. But wait a second—if B is lying, then from Statement 1, A should be truthful as well. Yet, from Statement 3, C says A is truthful, which contradicts the idea that A is lying if B is lying. This inconsistency suggests that perhaps the initial assumption that B lies might not hold up under closer scrutiny.
90
+
91
+ Let me try a different approach. Suppose instead of assuming B is lying, what if B tells the truth? Then:
92
+ - **A**: Not lying → True
93
+ - **B**: Telling the truth → True
94
+ - **C**: A is truthful → True
95
+ - **D**: B is lying → False
96
+
97
+ In this case, all statements align correctly with the truths. There's no contradiction here. Therefore, if B is truthful, then everyone else is truthful except for D.
98
+
99
+ But earlier, when considering whether B was lying, the conclusion led us back to an inconsistent situation where A could have been lying while C remained truthful, leading to contradictions. Perhaps there's a flaw in my reasoning process. Maybe I oversimplified the relationships between the statements.
100
+
101
+ Alternatively, maybe I made a mistake in interpreting some of the statements. For instance, if Statement 1 ("D is lying") is taken literally, then D is indeed lying. But if interpreted differently, such as implying that D is untruthful, but not necessarily lying outright, things get more complex.
102
+
103
+ Another angle: If I assume that B's statement is true, then B must be truthful. From Statement 3, C remains truthful regardless of B's statement. Similarly, from Statement 4, D must be truthful. This aligns perfectly with the truthfulness of B and D.
104
+
105
+ Therefore, after analyzing all possible interpretations and cross-verifying the consistency of the statements, it appears that the liar is D.
106
+
107
+ </thinking>
108
+ <Output>
109
+ Dは嘘つきである。
110
+ </Output><end_of_turn>
111
+ ```
112
+
113
  # RAG Usage
114
+ this model in training rag context prompt with custom token (experimental)
115
  ```
116
  <start_of_turn>system {system_prompt} <end_of_turn>
 
117
  <start_of_turn>user {user_prompt} <end_of_turn>
118
+ <start_of_content> {rag content} <end_of_content>
119
  <start_of_turn>model
120
  ```