diff --git a/.gitignore b/.gitignore index 226869e..dc47eed 100644 --- a/.gitignore +++ b/.gitignore @@ -2,4 +2,5 @@ .envrc __pycache__ bin/ -lib64 \ No newline at end of file +lib64 +.venv \ No newline at end of file diff --git a/README.md b/README.md index 117600d..dda0ea2 100644 --- a/README.md +++ b/README.md @@ -39,11 +39,12 @@ reporting these metrics to the autoscaler. If you are using a Vast.ai template that includes PyWorker integration (marked as autoscaler compatible), it should work out of the box. The template will typically start the appropriate PyWorker server automatically. Here's a few: -* **TGI (Text Generation Inference):** [Vast.ai Template](https://cloud.vast.ai?ref_id=140778&template_id=72d8dcb41ea3a58e06c741e2c725bc00) -* **ComfyUI:** [Vast.ai Template](https://cloud.vast.ai?ref_id=140778&template_id=ad72c8bf7cf695c3c9ddf0eaf6da0447) +* **vLLM:** [Vast.ai Template](https://cloud.vast.ai?ref_id=62897&template_id=63ae93902bf3978bea033782592b784d) +* **TGI (Text Generation Inference):** [Vast.ai Template](https://cloud.vast.ai?ref_id=62897&template_id=6fa6bd5bdf5f0df63db80e40b086037d) +* **ComfyUI:** [Vast.ai Template](https://cloud.vast.ai?ref_id=62897&template_id=e6748878ba688e765e3e9fca29541938) Currently available workers: -* `hello_world`: A simple example worker for a basic LLM server. +* `openai`: A simple example worker for a basic vLLM server. * `comfyui`: A worker for the ComfyUI image generation backend. * `tgi`: A worker for the Text Generation Inference backend.