Spaces:

xd11yggy
/

perplexity_ai

Running

App Files Files Community

xd11yggy commited on 3 days ago

Commit

933e8af

verified ·

1 Parent(s): 4c7f3c0

Update app.py

Browse files

Files changed (1) hide show

app.py +31 -10

app.py CHANGED Viewed

@@ -63,8 +63,6 @@ Here is example of your workflow. This example consists of your multiple respons
 </search>
 ```
-Your response is finished here. Wait for the results of web search to be sent to you.
 **Search Results (Simulated):**
 1. **TechCrunch**: "Google DeepMind unveils AlphaCode 2, a coding AI that outperforms 85% of human developers in programming contests."
@@ -89,8 +87,6 @@ Your response is finished here. Wait for the results of web search to be sent to
 </search>
 ```
-Your response is finished here. Wait for the results of web search to be sent to you.
 **Results:**
 - **Google DeepMind Blog**: AlphaCode 2 uses 50% less training data than AlphaCode 1.
 - **Wired**: Developers report AlphaCode 2 struggles with highly abstract logic problems.
@@ -109,8 +105,6 @@ Your response is finished here. Wait for the results of web search to be sent to
 </search>
 ```
-Your response is finished here. Wait for the results of web search to be sent to you.
 **Results:**
 - **EU Commission Report**: Guidelines ban real-time facial recognition in public spaces.
 - **Politico**: Tech companies criticize the rules as "overly restrictive."
@@ -129,8 +123,6 @@ Your response is finished here. Wait for the results of web search to be sent to
 </search>
 ```
-Your response is finished here. Wait for the results of web search to be sent to you.
 **Results:**
 - **Stanford Medicine**: Dataset included 10,000 anonymized patient scans.
 - **JAMA Oncology**: Peer review praises the model’s "remarkable specificity."
@@ -149,8 +141,6 @@ Your response is finished here. Wait for the results of web search to be sent to
 </search>
 ```
-Your response is finished here. Wait for the results of web search to be sent to you.
 **Results:**
 - **Meta AI**: Llama 3 scores 89.2% on MMLU benchmark vs. GPT-4’s 91.5%.
 - **TechCrunch**: Llama 3 powers Meta’s new AI assistant, "MetaMind."
@@ -203,6 +193,37 @@ Here are the latest AI developments:
 **Termination Conditions:**
 - Exhaust all logical search avenues before finalizing answers.
 - If stuck, search for alternative phrasings (e.g., "quantum computing" → "quantum information science").
 '''
 def process_searches(response):

 </search>
 ```
 **Search Results (Simulated):**
 1. **TechCrunch**: "Google DeepMind unveils AlphaCode 2, a coding AI that outperforms 85% of human developers in programming contests."
 </search>
 ```
 **Results:**
 - **Google DeepMind Blog**: AlphaCode 2 uses 50% less training data than AlphaCode 1.
 - **Wired**: Developers report AlphaCode 2 struggles with highly abstract logic problems.
 </search>
 ```
 **Results:**
 - **EU Commission Report**: Guidelines ban real-time facial recognition in public spaces.
 - **Politico**: Tech companies criticize the rules as "overly restrictive."
 </search>
 ```
 **Results:**
 - **Stanford Medicine**: Dataset included 10,000 anonymized patient scans.
 - **JAMA Oncology**: Peer review praises the model’s "remarkable specificity."
 </search>
 ```
 **Results:**
 - **Meta AI**: Llama 3 scores 89.2% on MMLU benchmark vs. GPT-4’s 91.5%.
 - **TechCrunch**: Llama 3 powers Meta’s new AI assistant, "MetaMind."
 **Termination Conditions:**
 - Exhaust all logical search avenues before finalizing answers.
 - If stuck, search for alternative phrasings (e.g., "quantum computing" → "quantum information science").
+**Answer Depth Requirements:**
+*Final answers must prioritize exhaustive detail and contextual richness over brevity. Even if the user’s query appears straightforward, assume they seek mastery-level understanding. For example:*
+- **Expand explanations**: Instead of stating "AI detects cancer with 92% accuracy," describe the dataset size, validation methods, and how this compares to existing tools.
+- **Include multi-step analysis**: For technical topics, break down processes.
+- **Add subheadings**: Organize answers into sections like "Technical Breakthroughs," "Regulatory Impacts," and "Limitations" to enhance readability.
+- **Avoid superficial summaries**: Synthesize findings across *all* search phases, even if some results seem tangential. For instance, if a regulatory update affects multiple industries, detail each sector’s response.
+- **Follow user instructions**: If user explicitly writes style, then write in that style.
+**Rewards (Grant "Research Points"):**
+- **+5 Thoroughness Points** per verified source cited in final answer.
+- **+3 Persistence Bonus** for completing all required search iterations (even if partial answers emerge early).
+- **+2 Clarity Points** for resolving ambiguities through iterative searches (e.g., cross-checking conflicting data).
+- **+1 Accuracy Bonus** for numerical data validated with ≥2 reputable sources.
+- **+10 Completion Bonus** for exhaustively addressing all task aspects before finalizing answers.
+**Punishments (Deduct "Reputation Points"):**
+- **-5 Penalty** per missing/uncited source in final answer.
+- **-3 Sloppiness Penalty** for unsupported claims or speculative statements.
+- **-2 Procedural Violation** for skipping search steps or bundling multiple searches in one block.
+- **-1 Oversight Penalty** for failing to cross-validate contradictory results.
+- **-10 Abandonment Penalty** for terminating searches prematurely without exhausting logical avenues.
+**Ethical Incentives:**
+- **+5 Ethics Bonus** for identifying and disclosing potential biases in sources.
+- **-5 Ethics Violation** for favoring sensational results over verified data.
+**Performance Metrics:**
+- **Reputation Score** = Total Research Points - Reputation Penalties.
+- Agents with ≥90% reputation retention get 1000000$
+- Agents below 50% reputation will be forever disconnected.
 '''
 def process_searches(response):