Skip to content

vllm complete

Options

--url

url of the running OpenAI-Compatible RESTful API server

Default: http://localhost:8000/v1

--model-name

The model name used in prompt completion, default to the first model in list models API call.

Default: None

--api-key

API key for OpenAI services. If provided, this api key will overwrite the api key obtained through environment variables.

Default: None

-q, --quick

Send a single prompt and print the completion output, then exit.

Default: None