Lucas Armand
|
9bc9ba11c5
|
Increase TGI benchmark tokens to 500
|
2026-04-30 14:04:39 -07:00 |
|
Lucas Armand
|
e02f4bc943
|
Lowered concurrency of vLLM and TGI benchmarks
|
2025-12-17 11:55:33 -08:00 |
|
Lucas Armand
|
9daf171487
|
Increase queue limits for vLLM and TGI
|
2025-12-17 11:38:55 -08:00 |
|
LucasArmandVast
|
4380d98c01
|
Use PyWorker SDK (#67)
* Change PyWorker to Worker SDK
* Moved /lib to vast-sdk (https://github.com/vast-ai/vast-sdk)
|
2025-12-15 19:33:03 -08:00 |
|