Skip to main content

orchid01

orchid01 is Orchid’s finance-native model. Use model id orchid01 in all requests. There are two modes — standard and thinking — controlled via the orchid config object in the request body.

Standard mode (default)

Fast, direct responses. Recommended for most tasks: document extraction, summarisation, Q&A over provided context, structured data conversion.
response = client.chat.completions.create(
    model="orchid01",
    messages=[{"role": "user", "content": "Extract all covenants from this agreement..."}],
    # thinking is off by default
)
SettingValue
Default max_tokens4096
Default temperature0.1
Recommended max_tokens4096–8192

Thinking mode

Extended reasoning for complex multi-step analysis. The model reasons through the problem before responding. Reasoning is visible in reasoning_content alongside the final answer in content.
response = client.chat.completions.create(
    model="orchid01",
    messages=[{"role": "user", "content": "Build a full covenant analysis..."}],
    extra_body={"orchid": {"thinking": True}},
)

# Access reasoning and answer separately
reasoning = response.choices[0].message.model_extra.get("reasoning_content", "")
answer    = response.choices[0].message.content
Thinking mode requires max_tokens ≥ 16,000. Reasoning tokens count toward the limit — responses may be truncated below this threshold.
SettingValue
Minimum max_tokens16000
Recommended max_tokens16384–32768
temperatureFixed at 1.0 (set automatically)
If you pass max_tokens below 16,000 with thinking enabled, Orchid automatically raises it to 16,000 and includes max_tokens_adjusted: true in the response metadata.

The orchid config object

Pass an orchid key alongside standard OpenAI fields to control Orchid-specific behaviour:
{
  "model": "orchid01",
  "messages": [...],
  "orchid": {
    "thinking":      false,
    "dehallucinate": true
  }
}
FieldTypeDefaultDescription
thinkingbooleanfalseEnable deep reasoning mode
dehallucinatebooleantrueRun grounding check and return hallucination_score

Response metadata

Every response includes an orchid field with grounding information:
{
  "choices": [...],
  "orchid": {
    "dehallucinate_requested": true,
    "grounded":                true,
    "score":                   0.97,
    "flagged_spans":           [],
    "checked":                 true
  }
}
FieldDescription
groundedWhether all claims are supported by provided context
scoreConfidence score 0.0–1.0 (1.0 = fully grounded)
flagged_spansSpecific claims that could not be verified
checkedWhether the grounding check actually ran
The check runs when dehallucinate: true (default), the response contains financial figures, and sufficient context was provided. If skipped, checked: false and score: 1.0.