vllm-inference / README.md

Commit History

feat(add-model): always download model during build, it will be cached in the consecutive builds
8679a35

yusufs commited on

feat(download-model): add download model at runtime
fc30f26

yusufs commited on

feat(endpoint): add prefix /api on each endpoint
5f3bf21

yusufs commited on

feat(refactor): move the files to root
7935381

yusufs commited on

feat(first-commit): follow examples and tutorials
ae7cfbb

yusufs commited on

initial commit
1a7087e
verified

yusufs commited on