|
Entropic 2.3.8
Local-first agentic inference engine
|
Implementation of the target/draft compatibility check. More...
#include <entropic/inference/speculative_compat.h>#include <llama.h>#include <algorithm>#include <cstring>#include <optional>#include <string>
Go to the source code of this file.
Namespaces | |
| namespace | entropic |
| Activate model on GPU (WARM → ACTIVE). | |
Functions | |
| CompatResult | entropic::speculative::check_compat (const llama_model *target, const llama_model *draft) |
| Check whether a draft model can pair with a target for sequential speculative decoding. | |
Implementation of the target/draft compatibility check.
Mirrors the file-private common_speculative_are_compatible function from extern/llama.cpp/common/speculative.cpp plus an entropic-side recurrent-architecture gate. See the header for the full rationale.
Definition in file speculative_compat.cpp.
| CompatResult entropic::speculative::check_compat | ( | const llama_model * | target, |
| const llama_model * | draft | ||
| ) |
Check whether a draft model can pair with a target for sequential speculative decoding.
Compatibility orchestrator.
Mirrors the logic of llama.cpp's file-private common_speculative_are_compatible (in common/speculative.cpp) and additionally enforces entropic's architecture gate:
SPEC_VOCAB_MAX_SIZE_DIFFERENCE in llama.cpp).[SPEC_VOCAB_CHECK_START_TOKEN_ID=5, min(n_vocab_tgt, n_vocab_dft)).| target | Target (verifier) llama_model handle. Must be non-null. |
| draft | Draft (proposer) llama_model handle. Must be non-null. |
compatible=true and empty diagnostic on success; compatible=false with a specific diagnostic string identifying the first failed rule on failure. @utility | target | Target (verifier) model. |
| draft | Draft (proposer) model. |
Definition at line 271 of file speculative_compat.cpp.