|
Entropic 2.3.8
Local-first agentic inference engine
|

Directories | |
| adapters | |
Files | |
| adapter_manager.cpp | |
| AdapterManager implementation — LoRA lifecycle and hot-swap. | |
| backend.cpp | |
| InferenceBackend base class implementation. | |
| grammar_registry.cpp | |
| GrammarRegistry implementation — named grammar management. | |
| image_preprocessor.cpp | |
| Image preprocessing implementation. | |
| inference_c_api.cpp | |
| C API wrappers for InferenceBackend. | |
| interface_factory.cpp | |
| InferenceInterface factory — bridges orchestrator to C callbacks. | |
| llama_cpp_backend.cpp | |
| LlamaCppBackend implementation — direct llama.cpp C API. | |
| llama_cpp_backend.h | |
| LlamaCppBackend — llama.cpp C API integration. | |
| orchestrator.cpp | |
| ModelOrchestrator implementation. | |
| profile_registry.cpp | |
| ProfileRegistry implementation – named GPU resource profiles. | |
| prompt_cache.cpp | |
| PromptCache implementation — LRU eviction, thread-safe. | |
| prompt_cache.h | |
| Host-memory KV cache state storage with LRU eviction. | |
| secondary_model_loader.cpp | |
| SecondaryModelLoader implementation. | |
| speculative_compat.cpp | |
| Implementation of the target/draft compatibility check. | |
| throughput_tracker.cpp | |
| ThroughputTracker implementation – EWMA throughput measurement. | |