Create a completion for the provided prompt and parameters. Compatible with OpenAI’s chat completions API. Supports 40+ models including GPT-4, Claude, Gemini, and CaseMark legal AI models. Includes streaming support, token counting, and usage tracking.
Documentation Index
Fetch the complete documentation index at: https://docs.case.dev/llms.txt
Use this file to discover all available pages before exploring further.
API key starting with sk_case_
List of messages comprising the conversation
Model to use for completion. Defaults to casemark/core-large if not specified
"casemark/core-large"
Maximum number of tokens to generate
1000
Sampling temperature between 0 and 2
0.7
Whether to stream back partial progress
false
CaseMark-only: when true, allows reasoning fields in responses. Defaults to false (reasoning is suppressed).
false
Nucleus sampling parameter
Frequency penalty parameter
Presence penalty parameter