chris mckenzie 905255932b Merge pull request #1 from kristopolous/dependabot/pip/pip-ff0b128437
build(deps): bump transformers from 4.43.2 to 4.48.0 in the pip group across 1 directory
2025-03-25 14:46:48 -07:00
2024-09-04 11:19:30 -07:00
2025-03-25 14:44:59 -07:00
2024-09-04 11:19:30 -07:00

Vast PyWorker

Vast PyWorker is a Python web server designed to run alongside a LLM or image generation models running on vast, enabling autoscaler integration. It serves as the primary entry point for API requests, forwarding them to the model's API hosted on the same instance. Additionally, it monitors performance metrics and estimates current workload based on factors such as the number of tokens processed for LLMs or image resolution and steps for image generation models, reporting these metrics to the autoscaler.

How to Use

If you want to use autoscaler, you just need to use one of Vast's autoscaler templates. If you'd like to implement PyWorker for a template that is not marked as autoscaler compatible on Vast, refer to workers/hello_world/README.md

S
Description
No description provided
Readme 860 KiB
Languages
Python 86.2%
Shell 13.8%