File size: 3,752 Bytes
eda05be
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9f6d116
eda05be
9f6d116
5bc5c33
f4a5ba1
620cfb0
9f6d116
a4a5a47
9f6d116
 
 
a4a5a47
9f6d116
a4a5a47
f4a5ba1
a4a5a47
9f6d116
 
 
a4a5a47
9f6d116
a4a5a47
f4a5ba1
a4a5a47
ba18231
1d2fee2
ba18231
 
1d2fee2
a4a5a47
ba18231
 
a4a5a47
ba18231
 
 
1d2fee2
ba18231
9f6d116
ba18231
f4a5ba1
 
 
b770c74
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
<p align="center">
  <img src="https://cdn.prod.website-files.com/66f422128b6d0f3351ce41e3/66fd07dc0b6994070ec5b54b_Logo%20Rhesis%20Orange-p-500.png" alt="Rhesis Logo" width="300"/>
</p>


<p align="center">
  <a href="https://pypi.org/project/rhesis-sdk/">
    <img src="https://img.shields.io/pypi/v/rhesis-sdk" alt="PyPI Version" style="display:inline-block;">
  </a>
  <a href="https://pypi.org/project/rhesis-sdk/">
    <img src="https://img.shields.io/pypi/pyversions/rhesis-sdk" alt="Python Versions" style="display:inline-block;">
  </a>
  <a href="https://discord.rhesis.ai">
    <img src="https://img.shields.io/discord/1340989671601209408?color=7289da&label=Discord&logo=discord&logoColor=white" alt="Discord" style="display:inline-block;">
  </a>
  <a href="https://www.linkedin.com/company/rhesis-ai">
    <img src="https://img.shields.io/badge/LinkedIn-Rhesis_AI-blue?logo=linkedin" alt="LinkedIn" style="display:inline-block;">
  </a>
  <a href="https://huggingface.co/rhesis">
    <img src="https://img.shields.io/badge/🤗-Rhesis-yellow" alt="Hugging Face" style="display:inline-block;">
  </a>
  <a href="https://docs.rhesis.ai">
    <img src="https://img.shields.io/badge/docs-rhesis.ai-blue" alt="Documentation" style="display:inline-block;">
  </a>
</p>


> Open-source test generation SDK for LLM applications.
 

Rhesis AI provides curated and dynamically generated test sets to evaluate LLM applications under diverse conditions. These datasets help assess robustness, reliability, and compliance in real-world scenarios.  

### Using our datasets  

Our datasets are designed to test various aspects of LLM application behavior, from reliability to safety and bias detection. To get started:  

1. Browse the available test sets here on Hugging Face.  
2. Select the dataset that aligns with your evaluation needs.  
3. Load and apply the test cases to assess your application’s behavior.  

For more advanced testing and seamless integration, the [Rhesis SDK](https://github.com/rhesis-ai/rhesis-sdk) provides tools to automate dataset handling, generate structured test cases, and streamline evaluation workflows.  

## Key features  

- **Curated Test Sets** – Pre-built datasets covering diverse evaluation criteria.  
- **Dynamic Test Generation** – Generate custom test sets tailored to specific use cases.  
- **Scalability** – Use datasets for one-off evaluations or integrate them into automated testing pipelines.  

For questions or custom datasets, reach out at **hello@rhesis.ai**.  

### Example use cases:

- **AI Financial Advisor**:  
   Evaluate the reliability and accuracy of financial guidance provided by LLM applications, ensuring sound advice for users.
  
- **AI Claim Processing**:  
   Test for and eliminate biases in LLM-supported claim decisions, ensuring fair and compliant processing of insurance claims.

- **AI Sales Advisor**:  
   Validate the accuracy of product recommendations, enhancing customer satisfaction and driving more successful sales.

- **AI Support Chatbot**:  
   Ensure that your chatbot consistently delivers helpful, accurate, and empathetic responses across various scenarios.

### Disclaimer

Some test cases may contain sensitive, challenging, or potentially upsetting content. These cases are included to ensure thorough and realistic assessments. Users should review test cases carefully and exercise discretion when utilizing them.

### Connect with us  

For more details about our testing platform, datasets, and solutions, including the Rhesis AI SDK, visit [Rhesis AI](https://www.rhesis.ai/).  
Join our **[Discord community](https://discord.rhesis.ai)** to connect with other AI engineers, discuss best practices, and stay updated on new test sets.