--- title: LongCePO Chatbot (Sambanova) emoji: 🤖 colorFrom: blue colorTo: green sdk: gradio sdk_version: 5.27.1 app_file: app.py pinned: false --- # LongCePO Chatbot with Sambanova Backend This is a simple chatbot interface demonstrating the LongCePO (Long-Context Planning and Optimization) method using a Sambanova model (`Llama-4-Maverick-17B-128E-Instruct`) as the backend LLM. ## How it works The LongCePO method is designed to handle long contexts (potentially millions of tokens) by: 1. **Planning:** Decomposing the initial query into sub-questions. 2. **MapReduce:** Answering each sub-question by processing chunks of the long context, summarizing relevant information, and aggregating results. This application takes a long text context and a query based on that context. It then uses the modified `longcepo` plugin (originally from the `optillm` repository) to generate an answer using the Sambanova API. ## How to use 1. **(Optional)** Enter a system prompt to guide the chatbot's behavior. 2. Paste the long text document into the **Context** box. 3. Enter your question based on the provided context into the **Query** box. 4. Click **Submit**. The chatbot will process the request using the LongCePO pipeline and display the final answer. **Note:** Processing long contexts can take some time depending on the length of the context and the complexity of the query. ## API Key This application requires a Sambanova API key to function. The key should be stored as a Hugging Face Space Secret named `SAMBANOVA_API_KEY`.