The name of the model to use.
The input prompt to complete.
Maximum number of tokens to generate (0 for unlimited).
Sampling temperature (default: 1.0).
Whether to stream the response (ignored in this implementation).
A JSONValue object in OpenAI-compatible format.
Performs an OpenAI-style text completion.