Mikhail Yevchenko
|
1cea6fbd2d
|
Update model server URL and port configuration
|
2026-05-20 13:34:45 +00:00 |
|
Mikhail Yevchenko
|
94926b74b6
|
Add log message for server listening status
|
2026-05-18 19:42:40 +00:00 |
|
Mikhail Yevchenko
|
d0347b0755
|
Update log file path and enhance load log messages
|
2026-05-18 18:41:14 +00:00 |
|
Lucas Armand
|
e02f4bc943
|
Lowered concurrency of vLLM and TGI benchmarks
|
2025-12-17 11:55:33 -08:00 |
|
Lucas Armand
|
bcb04b9a32
|
add missing comma
|
2025-12-17 11:40:40 -08:00 |
|
Lucas Armand
|
9daf171487
|
Increase queue limits for vLLM and TGI
|
2025-12-17 11:38:55 -08:00 |
|
LucasArmandVast
|
29f836eb1a
|
Backwards compatible vLLM payload (#75)
* Support old vLLM payloads
|
2025-12-15 19:58:02 -08:00 |
|
LucasArmandVast
|
4380d98c01
|
Use PyWorker SDK (#67)
* Change PyWorker to Worker SDK
* Moved /lib to vast-sdk (https://github.com/vast-ai/vast-sdk)
|
2025-12-15 19:33:03 -08:00 |
|