OVHcloud AI Endpoints

All supported OVHcloud AI Endpoints models can be found here

OVHcloud AI Endpoints is a managed AI inference service that provides access to a wide range of state-of-the-art machine learning models. As part of OVHcloud’s Public Cloud offering, AI Endpoints enables developers to easily integrate AI capabilities into their applications with data sovereignty, privacy, and GDPR compliance.

Supported tasks

Chat Completion (LLM)

Find out more about Chat Completion (LLM) here.

Language

Client

Provider

Settings

import os
from openai import OpenAI

client = OpenAI(
    base_url="https://router.huggingface.co/v1",
    api_key=os.environ["HF_TOKEN"],
)

completion = client.chat.completions.create(
    model="openai/gpt-oss-20b:ovhcloud",
    messages=[
        {
            "role": "user",
            "content": "What is the capital of France?"
        }
    ],
)

print(completion.choices[0].message)

Chat Completion (VLM)

Find out more about Chat Completion (VLM) here.

Language

Client

Provider

Settings

import os
from openai import OpenAI

client = OpenAI(
    base_url="https://router.huggingface.co/v1",
    api_key=os.environ["HF_TOKEN"],
)

completion = client.chat.completions.create(
    model="Qwen/Qwen2.5-VL-72B-Instruct:ovhcloud",
    messages=[
        {
            "role": "user",
            "content": [
                {
                    "type": "text",
                    "text": "Describe this image in one sentence."
                },
                {
                    "type": "image_url",
                    "image_url": {
                        "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"
                    }
                }
            ]
        }
    ],
)

print(completion.choices[0].message)

Update on GitHub

Inference Providers

OVHcloud AI Endpoints

Supported tasks

Chat Completion (LLM)

Chat Completion (VLM)