Prime-Silo is the canonical Sovereign AI Agent OS and runtime harness where deterministic Pypes transformation algebra meets institutional cognition. Power autonomous agents with the open-standard Model Context Protocol (MCP) for tool connectivity and Agent-to-Agent (A2A) collaborative swarms—delivering auditable lineage without cloud gatekeepers or burning capital on context dumps.
Prime-Silo forks the execution shell into two strict boundaries: a read-only deterministic execution zone governed by cryptographic HMAC SHA-256 signatures, and an exploratory read-write agent sandbox (agent_sandbox/views/). Zero silent background mutation or scope creep.
Replaces flat vector chunking and probabilistic similarity search with deterministic Docling hierarchical tree extraction. Queries resolve precise section nodes via the Denodo pattern, eliminating vector hallucinations and enabling Triple fan-out cross-document reasoning.
Unifies multi-modal document understanding (PDFs, architectural diagrams, tables) with cryptographic capability tokens. Every subagent is bound to a least-privilege credential scope, preventing unauthorized cross-workspace access or key leakage.
Enforces the Green/Yellow/Red risk tier matrix and connects directly to the Bridge Cockpit Telemetry HUD. Frontend UI control elements (the 6 Lenses: Pulse, Memory, Documents, Code, Flows, Runs) interface with the local LAN pool (BENNY_LEMONADE_ENDPOINTS) via prime-silo-nexus MCP without raw code dumping or cloud context tax.
Addresses the "unread history problem" across 111+ agent session directories. Longview runs continuously on local background models to synthesize past transcripts, implementation plans, and walkthroughs into a unified, queryable organizational memory graph.
Establishes 6σ-safe operational invariants (≤ 3.4 defects per million ops). Before any Human-In-The-Loop decision or experimental analysis, operators stamp named restore points in their session—guaranteeing zero context contamination and instant rollback capability.
benny checkpoint stamp)<SSD_ROOT>) with zero machine-dependent driftThe comprehensive operating manual and governance standard for Prime-Silo. Enforces Chronological Lineage Protocol (CLP) audit logs, automated redaction of PII, and full compliance reporting for enterprise engineering teams.
At the center of Prime-Silo lies the immutable manifest engine. No agent can execute code or mutate system state without explicit cryptographic authorization. The inner cylinder glows as a permanent read-only anchor against agentic hallucination and drift.
Surrounding the core are our hexagonal data transformation prisms. Rather than unstructured scripts, data flows through rigorous staged pipelines: raw Bronze ingestion, validated Silver schema enforcement, and aggregated Gold institutional intelligence.
As we expand further outward, the concentric rings tilt and separate. This is our Tri-Graph Context-Augmented Generation mesh. Instead of copying data or dumping massive file trees into prompt windows, Prime-Silo virtualizes data in place—querying only atomic nodes across Knowledge, Code ASTs, and Memo-Ray memory graphs.
In full explosion mode, floating geometric nodes orbit the outer perimeter like weightless satellites. This represents our distributed Swarm Execution Layer. When complex tasks arise, Benny fans out parallel LAN worker nodes (`BENNY_LEMONADE_ENDPOINTS`) to synthesize code and analyze logs simultaneously, condensing results into single verified deliverables.
Traditional agents dump massive manifests and ASTs into every prompt, bleeding capital and latency. Prime-Silo virtualizes data in place (The Denodo Pattern), combining Knowledge RAG, Code ASTs, and Memo-Ray memory graphs to eliminate context bloat by up to 99%.
BENNY_LEMONADE_ENDPOINTS processes ~80% of queries locally at $0.00 cloud cost.
[STATUS] All local systems nominal. 2 active LAN workers fanned out. --- Host: ryzen.local:13305/api/v1 [ACTIVE] Model: Qwen-2.5-Coder-32B Latency: 12ms GPU: 18.4GB/24GB Host: t480.local:13305/api/v1 [ACTIVE] Model: Llama-3-8B-Instruct Latency: 28ms CPU: 16 threads --- [TELEMETRY] Local Cache Hit Rate: 88.4% | Tokens Saved This Session: 412,890
Real-time telemetry monitoring distributed Benny Lemonade endpoints, local RAM watchdogs, and zero-token-tax cache hit rates across your institutional mesh. Notice how scrolling automatically transitions the live background HUD!
Navigate institutional cognition through time. Every session, reasoning trace, and compiled artifact is structured into an auditable tree without external vector database costs.
Ingest PDFs, markdown, and compliance guidelines locally. Tree-Sitter parsing indexes document structure with zero external data leakage or privacy risks.
Real-time structural indexing of your Python, Rust, and TypeScript repositories. Queries resolve exact AST nodes rather than dumping raw files into prompt windows.
Design and monitor staged Bronze ➔ Silver ➔ Gold data pipelines. High-performance Polars execution with step-level checkpointing and outlier rejection.
Track parallel agentic swarms across local and remote clusters. The Chronological Lineage Protocol (CLP) logs every prompt, tool call, and result.
Stop bleeding capital on context dumps and non-deterministic agent loops. Schedule an architectural deep dive or request access to the Canonical Prime-Silo substrate for your enterprise.