|
Entropic 2.3.8
Local-first agentic inference engine
|
LlamaCppBackend — llama.cpp C API integration. More...
#include <entropic/inference/backend.h>#include "prompt_cache.h"#include <llama.h>#include <atomic>#include <chrono>#include <cstdint>#include <functional>#include <memory>#include <mutex>#include <string>#include <vector>

Go to the source code of this file.
Classes | |
| class | entropic::LlamaCppBackend |
| LlamaCppBackend — common llama.cpp patterns (15% layer). More... | |
Namespaces | |
| namespace | entropic |
| Activate model on GPU (WARM → ACTIVE). | |
LlamaCppBackend — llama.cpp C API integration.
Versioned subclass pattern: LlamaCppBackend provides common llama.cpp patterns (decode loop, sampler chain, tokenization). The pinned-commit subclass (LlamaCppBackend_b8420) overrides API-version-specific calls.
Internal to inference .so — not exposed across boundaries.
Definition in file llama_cpp_backend.h.