Entropic 2.3.8
Local-first agentic inference engine
Loading...
Searching...
No Matches
inference Directory Reference
Directory dependency graph for inference:
inference

Directories

 adapters
 

Files

 adapter_manager.cpp
 AdapterManager implementation — LoRA lifecycle and hot-swap.
 
 backend.cpp
 InferenceBackend base class implementation.
 
 grammar_registry.cpp
 GrammarRegistry implementation — named grammar management.
 
 image_preprocessor.cpp
 Image preprocessing implementation.
 
 inference_c_api.cpp
 C API wrappers for InferenceBackend.
 
 interface_factory.cpp
 InferenceInterface factory — bridges orchestrator to C callbacks.
 
 llama_cpp_backend.cpp
 LlamaCppBackend implementation — direct llama.cpp C API.
 
 llama_cpp_backend.h
 LlamaCppBackend — llama.cpp C API integration.
 
 orchestrator.cpp
 ModelOrchestrator implementation.
 
 profile_registry.cpp
 ProfileRegistry implementation – named GPU resource profiles.
 
 prompt_cache.cpp
 PromptCache implementation — LRU eviction, thread-safe.
 
 prompt_cache.h
 Host-memory KV cache state storage with LRU eviction.
 
 secondary_model_loader.cpp
 SecondaryModelLoader implementation.
 
 speculative_compat.cpp
 Implementation of the target/draft compatibility check.
 
 throughput_tracker.cpp
 ThroughputTracker implementation – EWMA throughput measurement.