Uncomment and activate LangChain and HuggingFace integration 78c6106 Maximofn commited on 13 days ago
Upgrade Python base image to 3.13-slim and adjust Dockerfile b4b56e1 Maximofn commited on 13 days ago
Switch from local SmolLM2 model to Qwen2.5-72B-Instruct via HuggingFace Inference API e55b892 Maximofn commited on 13 days ago
Configure cache directories and add system prompt for local model 4aaa328 Maximofn commited on 13 days ago
Switch from HuggingFace InferenceClient to local model loading 34aabf7 Maximofn commited on 13 days ago
Add accelerate library to requirements for model optimization 167addd Maximofn commited on 13 days ago
Update Dockerfile to handle HuggingFace token secret and adjust port d655e0b Maximofn commited on 13 days ago
Refactor app.py with improved model initialization and endpoint structure e4413df Maximofn commited on 13 days ago
Re-enable HuggingFace model integration and restore generate endpoint e2f16cb Maximofn commited on 13 days ago
Update Dockerfile to change FastAPI server port from 8000 to 7860 27abb6b Maximofn commited on 13 days ago
Temporarily disable HuggingFace model integration and simplify FastAPI endpoints b0424cf Maximofn commited on 13 days ago
Switch HuggingFace model from Qwen to SmolLM for lightweight inference a824dd2 Maximofn commited on 13 days ago
Refactor HuggingFace model integration and simplify token handling 159fcfe Maximofn commited on 13 days ago
Add token logging and debug information for HuggingFace authentication 20d2599 Maximofn commited on 13 days ago
Update project documentation and dependencies to English, modify HuggingFace model initialization 5f2f118 Maximofn commited on 13 days ago