Error types for cross-.so error reporting.
entropic_error_t
Error codes returned by all C API functions.
Activate model on GPU (WARM → ACTIVE).
Result of a single generation call.
bool time_limited
true if generation was terminated by time limit rather than EOS/stop sequence/max_tokens.
entropic_error_t error_code
Error code (ENTROPIC_OK if no error)
double swap_ms
Model swap time.
double routing_ms
Router classification time.
bool ok() const
True if generation completed without error.
double generation_time_ms
Wall-clock generation time.
int seq_id
Sequence identifier for multi-sequence backends.
double throughput_tok_s
Measured throughput for this generation (tok/s).
std::string raw_content
Raw model output before adapter processing.
std::string finish_reason
Finish reason: "stop", "length", "error".
int original_max_tokens
Original max_tokens before auto-adaptation reduced it.
std::string content
Generated text (cleaned by adapter)
std::vector< ToolCall > tool_calls
Tool calls parsed from content.
std::string error_message
Error description (empty if no error)
int token_count
Generated token count.
double total_ms
Total end-to-end time.