Lucas Armand
c05131cd14
explicit not None check
2025-12-15 19:55:43 -08:00
Lucas Armand
ebaf3b6d3a
Add fix
2025-12-15 19:51:27 -08:00
LucasArmandVast
4380d98c01
Use PyWorker SDK ( #67 )
...
* Change PyWorker to Worker SDK
* Moved /lib to vast-sdk (https://github.com/vast-ai/vast-sdk )
2025-12-15 19:33:03 -08:00
Colter Downing
6b5b1341a7
update tgi client
2025-12-03 18:38:42 -08:00
Colter-Downing
8be92c03de
Merge pull request #69 from vast-ai/AUTO-874--fix-openai-worker-client
...
defaults to ENDPOINT_NAME and DEFAULT_MODEL but uses the flag first
2025-12-03 16:59:56 -08:00
Colter Downing
adedb8ba90
defaults to ENDPOINT_NAME and DEFAULT_MODEL but uses the flag first if present
2025-12-03 16:57:28 -08:00
Lucas Armand
0bcd2219ea
Increase model wait time for vLLM
2025-12-03 12:38:52 -08:00
Lucas Armand
e0449cb3c7
add llama log
2025-11-21 10:22:16 -08:00
Lucas Armand
3adec1826d
minor changes
2025-11-11 17:11:38 -08:00
Lucas Armand
b55bfa9611
Updated clients, include vastai-sdk, handle non-UTF-8
2025-11-11 17:09:28 -08:00
Colter Downing
bcecd6df40
Suppress matplot debug logs
2025-10-25 16:18:02 -07:00
Nader Arbabian
0bf3247a34
fix completions and interactive client
2025-08-11 12:37:53 -07:00
Nader Arbabian
8804e17201
download vast.ai's root certificate in order to make pyworker requests ( #25 )
2025-08-08 17:04:16 -07:00
Rob Ballantyne
e0be45f39a
Addresses breaking change in core pyworker ( #22 )
...
* Addresses breaking change in test_utils.py
Endpoint.get_endpoint_api_key() now requires instance
Moves the call to this function out of the APIClient and into main
* Ensure make_benchmark_payload has a value to calculate the workload
---------
Co-authored-by: Nader Arbabian <nader@vast.ai >
2025-07-18 16:11:10 -07:00
Nader Arbabian
be2aafdb1f
fix pyright errors + revert to old way of handling cancelled api requests ( #23 )
2025-07-17 16:59:06 -07:00
Rob Ballantyne
69d9b7455f
OpenAI compatible worker ( #19 )
...
Adds initial support for OpenAI compatible inference servers
Available endpoints:
- `/v1/completions`
- `/v1/chat/completions`
2025-07-16 09:46:26 +01:00