🏛️ SOVEREIGN AI AGENT OS & HARNESS // MCP & A2A NATIVE // SUPPORTED BY AGENT BENNY // ZERO TOKEN TAX

The Sovereign AI Agent OS.
Deterministic Harness for MCP, A2A & Institutional Reasoning.

Prime-Silo is the canonical Sovereign AI Agent OS and runtime harness where deterministic Pypes transformation algebra meets institutional cognition. Power autonomous agents with the open-standard Model Context Protocol (MCP) for tool connectivity and Agent-to-Agent (A2A) collaborative swarms—delivering auditable lineage without cloud gatekeepers or burning capital on context dumps.

╭─ 🕸️ TRI-GRAPH CAG & MEMO-RAY // LIVE INSTITUTIONAL NETWORK
● GRAPH ACTIVE // PULSES NOMINAL ─╮
│ 🔗 GRAPH LAYERS │
▶ 01. BOUNDARY GATE
02. DOCLING SPINE
03. VISION & SCOPE
04. BRIDGE COCKPIT UI
05. MEMO-RAY GRAPH
06. 6σ CHECKPOINTS
07. CLP LINEAGE
// L1 DETERMINISM BOUNDARY GATE
Active Gate: HMAC SHA-256 | Drift: 0.00% | Manifests Verified: 1,420
Cryptographic payload verification gate preventing silent background mutations before pipeline execution.
Document 01 // Determinism Boundary

ADR-001 // Split-Brain Execution Substrate

Prime-Silo forks the execution shell into two strict boundaries: a read-only deterministic execution zone governed by cryptographic HMAC SHA-256 signatures, and an exploratory read-write agent sandbox (agent_sandbox/views/). Zero silent background mutation or scope creep.

  • Cryptographic HMAC SHA-256 payload verification before execution gate
  • Strict separation between signed production pypes and sandbox exploration
  • Eliminates agentic hallucination drift by anchoring to immutable manifests
Document 02 // Vectorless Document Backbone

ADR-002 // PageIndex Vectorless RAG Spine

Replaces flat vector chunking and probabilistic similarity search with deterministic Docling hierarchical tree extraction. Queries resolve precise section nodes via the Denodo pattern, eliminating vector hallucinations and enabling Triple fan-out cross-document reasoning.

  • Hierarchical tree extraction preserves heading structure and table geometry
  • Zero-vector retrieval invariants guarantee 100% deterministic citations
  • Triple fan-out graph generation connects specifications to code symbols
Document 03 // Secure Ingestion & Credential Binding

ADR-003 // Vision Ingestion & Least-Privilege Scope

Unifies multi-modal document understanding (PDFs, architectural diagrams, tables) with cryptographic capability tokens. Every subagent is bound to a least-privilege credential scope, preventing unauthorized cross-workspace access or key leakage.

  • Vision-augmented parsing extracts high-fidelity structural data from complex layouts
  • Cryptographic capability tokens bind subagent permissions to specific workspaces
  • Zero credential leakage across institutional boundary walls
Document 04 // Token Economics & Local Routing

ADR-004 // Local Offload Orchestrator & Bridge Cockpit UI

Enforces the Green/Yellow/Red risk tier matrix and connects directly to the Bridge Cockpit Telemetry HUD. Frontend UI control elements (the 6 Lenses: Pulse, Memory, Documents, Code, Flows, Runs) interface with the local LAN pool (BENNY_LEMONADE_ENDPOINTS) via prime-silo-nexus MCP without raw code dumping or cloud context tax.

  • Bridge Cockpit UI control elements (6 Lenses) interface with backend graph via MCP
  • Green/Yellow/Red risk tier matrix automates local vs. cloud task routing
  • Digest Discipline: local workers return compact verification summaries
Document 05 // Memory Consolidation

ADR-005 // Longview Session Synthesis

Addresses the "unread history problem" across 111+ agent session directories. Longview runs continuously on local background models to synthesize past transcripts, implementation plans, and walkthroughs into a unified, queryable organizational memory graph.

  • Synthesizes multi-agent conversation histories without burning cloud planner tokens
  • Extracts reusable skills and architectural patterns from historical trajectories
  • Maintains continuous organizational awareness across development cycles
Requirements // Phase H & PBR-001

PBR-001 & REQ-H // Session Checkpoints & Portability

Establishes 6σ-safe operational invariants (≤ 3.4 defects per million ops). Before any Human-In-The-Loop decision or experimental analysis, operators stamp named restore points in their session—guaranteeing zero context contamination and instant rollback capability.

  • 6σ-safe quality target enforced across data integrity and path portability
  • Stamps named session checkpoints before HITL gates (benny checkpoint stamp)
  • Full external SSD portability (<SSD_ROOT>) with zero machine-dependent drift
Manual // Canonical Governance

GUIDE // Institutional Governance & Audit Lineage

The comprehensive operating manual and governance standard for Prime-Silo. Enforces Chronological Lineage Protocol (CLP) audit logs, automated redaction of PII, and full compliance reporting for enterprise engineering teams.

  • Chronological Lineage Protocol (CLP) records every prompt, tool call, and result
  • Automated PII redaction and compliance reporting by default
  • Plain-English governance workflows designed for institutional reliability
[ACT I: 3D WIREFRAME TRI-GRAPH SUBSTRATE]
SCROLL DOWN TO EXPLODE LAYERS ↴
Chapter I // Core Gate

The Deterministic HITL Signature Core

At the center of Prime-Silo lies the immutable manifest engine. No agent can execute code or mutate system state without explicit cryptographic authorization. The inner cylinder glows as a permanent read-only anchor against agentic hallucination and drift.

  • Cryptographic HMAC SHA-256 payload verification
  • Strict ADR-001 boundary separation (Signed vs Sandbox)
  • Zero silent scope creep or background mutation
Chapter II // Layer 0 Algebra

Pypes Transformation Prisms

Surrounding the core are our hexagonal data transformation prisms. Rather than unstructured scripts, data flows through rigorous staged pipelines: raw Bronze ingestion, validated Silver schema enforcement, and aggregated Gold institutional intelligence.

  • High-performance vectorized Polars dataframe engine
  • Automated outlier rejection and strict schema checking
  • Instant step-level checkpoint resumption (--resume <id>)
Chapter III // Denodo Pattern

Tri-Graph CAG & Data Virtualization

As we expand further outward, the concentric rings tilt and separate. This is our Tri-Graph Context-Augmented Generation mesh. Instead of copying data or dumping massive file trees into prompt windows, Prime-Silo virtualizes data in place—querying only atomic nodes across Knowledge, Code ASTs, and Memo-Ray memory graphs.

  • Up to 98% elimination of prompt context token tax
  • Tree-Sitter AST real-time function boundary indexing
  • Memo-Ray chronological entity mapping (Session ➔ Thought ➔ Artifact)
Chapter IV // Swarm Layer

Longview Swarm Worker Constellation

In full explosion mode, floating geometric nodes orbit the outer perimeter like weightless satellites. This represents our distributed Swarm Execution Layer. When complex tasks arise, Benny fans out parallel LAN worker nodes (`BENNY_LEMONADE_ENDPOINTS`) to synthesize code and analyze logs simultaneously, condensing results into single verified deliverables.

  • LAN worker auto-discovery across local Ryzen & ThinkPad hardware
  • Local CPU/GPU watchdog with thermal throttling guards
  • Unbroken chronological CLP audit ledger for 100% traceability

Zero Token Tax via Tri-Graph CAG

Traditional agents dump massive manifests and ASTs into every prompt, bleeding capital and latency. Prime-Silo virtualizes data in place (The Denodo Pattern), combining Knowledge RAG, Code ASTs, and Memo-Ray memory graphs to eliminate context bloat by up to 99%.

⚡ Institutional Feature Savings Toggles EMPIRICAL AUDIT PROOF
💡 Architecture Win: Local model pool LAN offloading via BENNY_LEMONADE_ENDPOINTS processes ~80% of queries locally at $0.00 cloud cost.
Legacy Cloud Agent Monthly Burn
$2,363/mo
~35k tokens/query (Full Manifest & AST Context Dumps)
Prime-Silo Local-First Tri-Graph Burn
$34/mo
~2.3k tokens/query + 80% Local LAN Offloading
Total Monthly Savings via Tri-Graph CAG
$2,329/mo
98% Token Tax Elimination
⚡ BRIDGE COCKPIT // INSTITUTIONAL TELEMETRY HUD LENS 01 // PULSE
[STATUS] All local systems nominal. 2 active LAN workers fanned out.
---
Host: ryzen.local:13305/api/v1   [ACTIVE] Model: Qwen-2.5-Coder-32B  Latency: 12ms  GPU: 18.4GB/24GB
Host: t480.local:13305/api/v1    [ACTIVE] Model: Llama-3-8B-Instruct  Latency: 28ms  CPU: 16 threads
---
[TELEMETRY] Local Cache Hit Rate: 88.4% | Tokens Saved This Session: 412,890
Lens 01 // Live Health

Pulse // LAN Cluster Telemetry

Real-time telemetry monitoring distributed Benny Lemonade endpoints, local RAM watchdogs, and zero-token-tax cache hit rates across your institutional mesh. Notice how scrolling automatically transitions the live background HUD!

  • BENNY_LEMONADE_ENDPOINTS LAN pool auto-discovery
  • Local CPU/GPU memory watchdog and thermal throttling guard
  • Cryptographic HMAC signing status monitor
Lens 02 // Chronology

Memory // Memo-Ray Entity Tree

Navigate institutional cognition through time. Every session, reasoning trace, and compiled artifact is structured into an auditable tree without external vector database costs.

  • Session ➔ Thought ➔ Artifact hierarchical lineage
  • Cross-session semantic similarity matching locally
  • Instant rollback to historical reasoning states
Lens 03 // Knowledge

Documents // TOGAF SAD RAG

Ingest PDFs, markdown, and compliance guidelines locally. Tree-Sitter parsing indexes document structure with zero external data leakage or privacy risks.

  • Local-first embedding generation via Quantized ONNX models
  • Section-level citation anchors with direct diff overlays
  • Automatic redaction of PII before intermediate synthesis
Lens 04 // AST Index

Code // Tree-Sitter Graph

Real-time structural indexing of your Python, Rust, and TypeScript repositories. Queries resolve exact AST nodes rather than dumping raw files into prompt windows.

  • Sub-millisecond AST boundary resolution for functions and classes
  • Automated impact analysis before code mutation
  • Seamless integration with local Benny Pypes execution engine
Lens 05 // Pipelines

Flows // Pypes Layer 0 Algebra

Design and monitor staged Bronze ➔ Silver ➔ Gold data pipelines. High-performance Polars execution with step-level checkpointing and outlier rejection.

  • Vectorized dataframe processing outperforming pandas 10x
  • Automatic schema enforcement and outlier rejection gates
  • Step-level checkpoint resume (--resume <id>)
Lens 06 // Ledgers

Runs // CLP Audit Ledger

Track parallel agentic swarms across local and remote clusters. The Chronological Lineage Protocol (CLP) logs every prompt, tool call, and result.

  • 100% reproducible execution logs with cryptographic checksums
  • Swarm worker fan-out monitoring with automatic retry logic
  • One-click export for regulatory compliance audits
╭─ 🧠 BENNY SUBSTRATE // RICH TUI TASK OFFLOADING CONSOLE
● SCROLL TO OFFLOAD TASKS ─╮
│ ⚡ TASKS OFFLOADED │
▶ 01. AST REFACTOR
02. REGULATORY AUDIT
03. LAN SWARM RESEARCH
04. SAD RAG INDEXING
05. CI/CD BLAST RADIUS
06. CRYPTO SEAL & PIN
│ // Welcome to Benny Execution Substrate v1.4.0 (Institutional Task Offloading Engine) │
│ // Scroll down to explore how enterprises offload complex engineering & compliance tasks... │
✉️ INSTITUTIONAL ONBOARDING // LIVE DEMO

Ready to Deploy Deterministic Institutional Cognition?

Stop bleeding capital on context dumps and non-deterministic agent loops. Schedule an architectural deep dive or request access to the Canonical Prime-Silo substrate for your enterprise.

✉️ Book a Live Demo // binary16.primesilo@gmail.com ⭐ Explore Code on GitHub
Neuro-Assist Ergonomics ADHD / DYSLEXIC
OpenDyslexic / Lexend Font
Bionic Reading Anchors
Expanded Line Spacing
💡 Tip: Combine with Audio Zen Mode in top navigation for maximum cognitive flow.