|
Entropic 2.3.8
Local-first agentic inference engine
|
Model configuration for a single tier. More...
#include <entropic/types/config.h>

Public Attributes | |
| std::filesystem::path | path |
| Resolved model file path. | |
| std::string | adapter = "qwen35" |
| Chat adapter name. | |
| int | context_length = 16384 |
| Context window size (512–131072) | |
| int | gpu_layers = -1 |
| GPU offload layers (-1 = all) | |
| bool | keep_warm = false |
| Pre-warm model at startup. | |
| bool | use_mlock = true |
| Lock model in system RAM. | |
| int | reasoning_budget = -1 |
| Think token budget (-1 = unlimited) | |
| std::string | cache_type_k = "f16" |
| KV cache key quantization type. | |
| std::string | cache_type_v = "f16" |
| KV cache value quantization type. | |
| int | n_batch = 512 |
| Batch size for prompt processing. | |
| int | n_threads = 0 |
| CPU threads (0 = auto-detect) | |
| std::string | tensor_split |
| Multi-GPU tensor split ratios (empty = single GPU) | |
| bool | flash_attn = true |
| Enable flash attention. | |
| std::optional< std::vector< std::string > > | allowed_tools |
| Tool whitelist (nullopt = all) | |
| std::filesystem::path | mmproj_path |
| Vision projector GGUF path. | |
| std::string | model_format = "gguf" |
| Expected model format. | |
Model configuration for a single tier.
Contains all parameters needed to load and configure a model, including llama.cpp pass-through fields for KV cache, batching, threading, and attention.
| std::string entropic::ModelConfig::adapter = "qwen35" |
| std::optional<std::vector<std::string> > entropic::ModelConfig::allowed_tools |
| std::string entropic::ModelConfig::cache_type_k = "f16" |
| std::string entropic::ModelConfig::cache_type_v = "f16" |
| int entropic::ModelConfig::context_length = 16384 |
| bool entropic::ModelConfig::flash_attn = true |
| int entropic::ModelConfig::gpu_layers = -1 |
| bool entropic::ModelConfig::keep_warm = false |
| std::filesystem::path entropic::ModelConfig::mmproj_path |
| std::string entropic::ModelConfig::model_format = "gguf" |
Expected model format.
"gguf" (default), "axmodel", "onnx", or empty (auto-detect). The backend validates that the file matches the expected format during load(). Mismatch returns ENTROPIC_ERROR_LOAD_FAILED with a diagnostic message identifying the actual format.
| int entropic::ModelConfig::n_batch = 512 |
| int entropic::ModelConfig::n_threads = 0 |
| std::filesystem::path entropic::ModelConfig::path |
| int entropic::ModelConfig::reasoning_budget = -1 |
| std::string entropic::ModelConfig::tensor_split |
| bool entropic::ModelConfig::use_mlock = true |