vllm complete¶

Options¶

url of the running OpenAI-Compatible RESTful API server

Default: http://localhost:8000/v1

The model name used in prompt completion, default to the first model in list models API call.

Default: None

API key for OpenAI services. If provided, this api key will overwrite the api key obtained through environment variables.

Default: None

Send a single prompt and print the completion output, then exit.

Default: None