Mikhail Yevchenko
|
e8e04fe8bc
|
Remove commented out info log actions from log configuration
|
2026-05-21 19:55:13 +00:00 |
|
Mikhail Yevchenko
|
6cb3acdd64
|
Add logging import and set logger level to WARNING
|
2026-05-21 19:50:21 +00:00 |
|
Mikhail Yevchenko
|
586ccbff1b
|
Update log action configuration to specify detailed error and info messages
|
2026-05-21 19:47:16 +00:00 |
|
Mikhail Yevchenko
|
bcc6b62277
|
Update log action configuration to enable error and info logging
|
2026-05-21 19:40:47 +00:00 |
|
Mikhail Yevchenko
|
3285d9118f
|
Enhance completions benchmark generator to extract words from a fallback Perl copyright file
|
2026-05-21 19:33:41 +00:00 |
|
Mikhail Yevchenko
|
f77d943d79
|
Refactor log message handling and improve word extraction in completions benchmark
|
2026-05-21 19:25:09 +00:00 |
|
Mikhail Yevchenko
|
976622a594
|
Remove nltk dep
|
2026-05-21 19:11:53 +00:00 |
|
Mikhail Yevchenko
|
3898a8a651
|
Update model server URL to remove port specification
|
2026-05-21 18:50:41 +00:00 |
|
Mikhail Yevchenko
|
170571714f
|
Add log message for model load event in Ollama configuration
|
2026-05-21 15:27:28 +00:00 |
|
Mikhail Yevchenko
|
81347ab8a0
|
Remove placeholder log messages for model load, error, and info
|
2026-05-21 15:11:30 +00:00 |
|
Mikhail Yevchenko
|
6bb0097829
|
Refactor model configuration and update log messages for Ollama
|
2026-05-21 15:11:25 +00:00 |
|
Mikhail Yevchenko
|
1cea6fbd2d
|
Update model server URL and port configuration
|
2026-05-20 13:34:45 +00:00 |
|
Mikhail Yevchenko
|
94926b74b6
|
Add log message for server listening status
|
2026-05-18 19:42:40 +00:00 |
|
Mikhail Yevchenko
|
d0347b0755
|
Update log file path and enhance load log messages
|
2026-05-18 18:41:14 +00:00 |
|
Lucas Armand
|
9bc9ba11c5
|
Increase TGI benchmark tokens to 500
|
2026-04-30 14:04:39 -07:00 |
|
Lucas Armand
|
e02f4bc943
|
Lowered concurrency of vLLM and TGI benchmarks
|
2025-12-17 11:55:33 -08:00 |
|
Lucas Armand
|
bcb04b9a32
|
add missing comma
|
2025-12-17 11:40:40 -08:00 |
|
Lucas Armand
|
9daf171487
|
Increase queue limits for vLLM and TGI
|
2025-12-17 11:38:55 -08:00 |
|
LucasArmandVast
|
29f836eb1a
|
Backwards compatible vLLM payload (#75)
* Support old vLLM payloads
|
2025-12-15 19:58:02 -08:00 |
|
LucasArmandVast
|
4380d98c01
|
Use PyWorker SDK (#67)
* Change PyWorker to Worker SDK
* Moved /lib to vast-sdk (https://github.com/vast-ai/vast-sdk)
|
2025-12-15 19:33:03 -08:00 |
|
Colter Downing
|
222ac2a0dd
|
default endpoint name
|
2025-12-04 10:54:55 -08:00 |
|
Colter Downing
|
40aed9b5f8
|
adding s3 as an option
|
2025-12-04 10:52:57 -08:00 |
|
Colter Downing
|
d4d36bf86e
|
done with comfy updates
|
2025-12-03 20:45:55 -08:00 |
|
Colter Downing
|
e839cfc6e8
|
include view in API wrapper
|
2025-12-03 20:22:45 -08:00 |
|
Colter Downing
|
f04138e13b
|
update to be able to get images
|
2025-12-03 20:16:25 -08:00 |
|
Colter Downing
|
6b5b1341a7
|
update tgi client
|
2025-12-03 18:38:42 -08:00 |
|
Colter-Downing
|
8be92c03de
|
Merge pull request #69 from vast-ai/AUTO-874--fix-openai-worker-client
defaults to ENDPOINT_NAME and DEFAULT_MODEL but uses the flag first
|
2025-12-03 16:59:56 -08:00 |
|
Colter Downing
|
adedb8ba90
|
defaults to ENDPOINT_NAME and DEFAULT_MODEL but uses the flag first if present
|
2025-12-03 16:57:28 -08:00 |
|
Lucas Armand
|
0bcd2219ea
|
Increase model wait time for vLLM
|
2025-12-03 12:38:52 -08:00 |
|
Lucas Armand
|
e0449cb3c7
|
add llama log
|
2025-11-21 10:22:16 -08:00 |
|
Lucas Armand
|
3adec1826d
|
minor changes
|
2025-11-11 17:11:38 -08:00 |
|
Lucas Armand
|
b55bfa9611
|
Updated clients, include vastai-sdk, handle non-UTF-8
|
2025-11-11 17:09:28 -08:00 |
|
Abiola Akinnubi
|
2cde573c56
|
Merge pull request #48 from vast-ai/comfy-request-idx
Added request_idx to comfy auth_data
|
2025-10-30 11:27:35 -07:00 |
|
Abiola Akinnubi
|
944f83fc03
|
Removed extra spaces from operator assignment
|
2025-10-28 21:03:52 +00:00 |
|
LucasArmandVast
|
d6a6e34c6b
|
Merge branch 'main' into new-pyworker
|
2025-10-27 12:43:49 -07:00 |
|
Abiola Akinnubi
|
f56bbc0ebe
|
Added request_idx to comfy auth_data
|
2025-10-27 03:17:06 +00:00 |
|
Colter Downing
|
bcecd6df40
|
Suppress matplot debug logs
|
2025-10-25 16:18:02 -07:00 |
|
Rob Ballantyne
|
f4f7080df1
|
Re-add comment
|
2025-10-23 17:00:28 +01:00 |
|
Rob Ballantyne
|
d51a338e8f
|
log when benchmark file not used
|
2025-10-23 16:41:02 +01:00 |
|
Rob Ballantyne
|
92a04bd7af
|
No silent fail if benchmark file is missing
|
2025-10-23 13:41:03 +01:00 |
|
Rob Ballantyne
|
ec25dda3ad
|
Merge branch 'vast-ai:main' into feat/comfyui-json-benchmark-workflow-from-file
|
2025-10-08 14:49:32 +01:00 |
|
Rob Ballantyne
|
4fdc314fd9
|
Fix healthcheck endpoint URL
|
2025-10-06 22:16:09 +01:00 |
|
Rob Ballantyne
|
3786cf978d
|
Add awareness of errors thrown by the provisioning script
|
2025-10-05 23:14:59 +01:00 |
|
Rob Ballantyne
|
a86d4bcf9c
|
Import json
|
2025-10-05 23:05:33 +01:00 |
|
Rob Ballantyne
|
e9b6a14a5e
|
Import Path
|
2025-10-05 22:59:19 +01:00 |
|
Rob Ballantyne
|
cadac033e1
|
Enables use of custom workflow for benchmarking
Retains existing method is misc/benchmark.json is nopt present
|
2025-10-05 22:53:22 +01:00 |
|
abiola-vastai
|
38782d89bc
|
undo the fix for comfy yesterday.
|
2025-09-03 17:12:35 +00:00 |
|
abiola-vastai
|
b20d9e714c
|
Blind hotfix to see if comfy UI default is needed. if it does work we would revert back.
|
2025-09-03 01:20:09 +00:00 |
|
Rob Ballantyne
|
b8377c4081
|
Set cost to 100
|
2025-08-28 16:13:17 +01:00 |
|
Rob Ballantyne
|
703435d10e
|
Improve MODEL_SERVER_START_* messages
|
2025-08-26 12:42:04 +01:00 |
|