Spaces:

arad1367
/

Llama-3.1-8b-Chatbot

Runtime error

This PR allows 2min of response

by Fabrice-TIERCELIN - opened Oct 1, 2024

←

Files changed (1) hide show

app.py CHANGED Viewed

@@ -47,7 +47,7 @@ model = AutoModelForCausalLM.from_pretrained(
     device_map="auto",
     quantization_config=quantization_config)
-@spaces.GPU()
 def stream_chat(
     message: str,
     history: list,

     device_map="auto",
     quantization_config=quantization_config)
+@spaces.GPU(duration=120)
 def stream_chat(
     message: str,
     history: list,