The name of the model to use.
An array of Message structs representing the chat history.
Maximum number of tokens to generate (0 for unlimited).
Sampling temperature (default: 1.0).
Whether to stream the response (ignored in this implementation).
A JSONValue object in OpenAI-compatible format.
Performs an OpenAI-style chat completion.