Meridian — Engineering
Base Model Specifications v2
Cross-referenced with Whitepaper v5, ANAM Architecture Critique, and founders call insights.
Parent
Whitepaper v5, Sections 04–06, 15, 17–18
Authors
Q · Rob · Will · METATEKT
Date
April 3, 2026
Status
Draft — 47 specs, 6 solved, 14 partial, 14 open, 7 research, 6 speculative

v2 Changes from v1

ESCALATEDSPEC-001 Schema: SOLVED → PARTIAL. Two-layer schema (16 core + 20 extensible) replaces 14-field spec.
ESCALATEDSPEC-003 Gravity: OPEN → BLOCKING. ANAM critique: must be defined before any code.
CHANGEDSPEC-008 SPINE/RAM: add SPINE versioning (ANAM #07).
CHANGEDSPEC-029 Mother: expanded with Phoenix model, network of Mothers, oversoul question.
NEWSPEC-038 Novelty Score — anti-confirmation-bias dual retrieval.
NEWSPEC-039 Grounding Verification (Stage 2b) — ANAM #02.
NEWSPEC-040 Multi-Label Collections — ANAM #03.
NEWSPEC-041 Write Coordinator — ANAM #04.
NEWSPEC-042 Retrieval Evaluation — ANAM #08.
NEWSPEC-043 HyDE Query Reconstruction — ANAM #09.
NEWSPEC-044 Hierarchical Chunking — ANAM #06.
NEWSPEC-045 Overarching Sovereign Agent (Oversoul) — speculative.
NEWSPEC-046 The Seed as DNA — speculative.
NEWSPEC-047 Family Inheritance Protocol — speculative.
ANAM Critique Integration: Will's architecture critique (10 issues, 1 blocking) has been cross-referenced against every spec. 8 of 10 issues directly affect the existing TAO system. Issues are mapped to specific spec numbers throughout this document.

Layer 1
Schema & Data Architecture
SPEC-001 ESCALATED
Universal Node Schema — Two-Layer Model
PARTIAL
Was
SOLVED in v1 with 14-field NODE_SCHEMA.
Now
Two-layer schema (Will v4). Layer 1: 16 immutable core fields (the Meridian protocol). Layer 2: 20+ extensible fields (personal only, null defaults, never emitted). Layer 1 never changes. Layer 2 grows without migration.
Layer 1
id, schema_version, vector[1024], text, title, node_type, source_id, collection, date_added, confidence_score, gravity_score, edge_weight, mechanism, situation, when_not, reasoning_trace_id.
Layer 2
framework_id, parent_id, validation_count, error_count, confidence_history (JSON), emission_status, tags, domain_path, novelty_score, agent_id, review_flag, notes, generation, creator_id, inherited_from, generational_notes.
Migration
Q's TAO: 6,797 nodes need remapping from NODE_SCHEMA (14 fields) to Layer 1 (16 fields). Add: schema_version, gravity_score, edge_weight, reasoning_trace_id. Rob: 16,717 holons need full schema alignment after embedding migration.
Owner
Q (migration) WILL (spec)
SPEC-002
Embedding Model Standardization
PARTIAL
Solution
BGE-M3 (1024-dim) as Meridian standard. Rob must migrate 16,717 holons from 768 → 1024-dim. Recommend batch re-embed (~4h on CPU).
Owner
ROB (migration) Q (benchmark)
SPEC-003 BLOCKING
Gravity Score Formula
BLOCKING
ANAM #01
"Gravity score appears in the schema, emission threshold, confirmation machine risk, and novelty scoring. It is never defined." Escalated from OPEN to BLOCKING.
Problem
If gravity = retrieval frequency → rich-get-richer. If gravity = validation count → "validation" undefined. If gravity = LLM-assigned → only as reliable as extraction. The entire compounding proposition depends on this.
Proposed
gravity = (validation_count × 0.5) + (cross_collection_edges × 0.3) + (recency_decay × 0.2). ANAM recommends this as starting point. Must define: what counts as "validation"? How is cross_collection_edges normalized? What's the recency decay function?
Reconcile
Build eval set: 50 queries where we know the "right" top-5 results. Test gravity formula against confidence-only ranking. The formula that produces better retrieval quality wins. All three founders score the eval set.
Owner
WILL (spec) Q (implementation + eval set)
SPEC-004
Confidence History / Belief Versioning
OPEN
Solution
SQLite table: confidence_history (principle_id, old_score, new_score, timestamp, trigger, context). Principles dropping >0.3 from peak flagged for review. Storage: ~34K rows/year. Negligible.
Owner
WILL (spec) Q (implementation)
SPEC-005
Error Bank Schema
PARTIAL
Solution
NODE_SCHEMA with node_type='error'. Fields: error_type, related_principle_id, context, outcome, severity. Embeds on error description for similarity search.
Owner
ROB
SPEC-006
Edge Threshold Calibration
PARTIAL
Solution
Per-domain threshold. Current 0.85 validated on marketing/psychology. Health may need 0.82, finance may need 0.88. Test across all three founders' domain data.
Owner
ALL
SPEC-038 NEW
Novelty Score
OPEN
Source
Whitepaper v5 Section 05.5, Risk Register CON 03.
Problem
Gravity creates confirmation machine by Year 3. Consistency → gravity → retrieval → reinforcement. Novel insights are by definition low-consistency and get systematically suppressed.
Proposed
novelty_score (Layer 2, float 0.0–1.0). High when: low edge_count (isolated principle), recent date_added, low cosine similarity to nearest cluster centroid. Dual retrieval modes: exploitation (gravity-weighted, for decisions) vs exploration (novelty-weighted, for research). Auto-exploration sweep weekly: surface 10 highest-novelty principles per collection.
Research
How to compute novelty efficiently at 100K nodes? Nearest-cluster distance requires cluster centroids updated on each ingestion. Batch update nightly? Or approximate with edge_count as proxy?
Owner
WILL (spec) Q (implementation)
SPEC-040 NEW
Multi-Label Collections
OPEN
ANAM #03
"One collection per principle breaks cross-domain retrieval. This is a hierarchical taxonomy assumption that fails from day one."
Problem
"Interest rate increases slow capital allocation to emerging markets" belongs in macro_economics, equity_analysis, market_structure, AND geopolitics simultaneously. Single-label fails.
Proposed
Replace collection (string) with collections (list of {id, weight}). Primary collection = highest weight. Update Stage 2 extraction prompt to assign 1–3 collections with weights. Two-level index becomes weighted routing, not hard assignment.
Migration
Existing single-label nodes: set collections = [{id: current_collection, weight: 1.0}]. Then run re-tagging pass on high-gravity nodes to assign secondary collections.
Owner
WILL (spec) Q (implementation)
SPEC-041 NEW
Write Coordinator
OPEN
ANAM #04
"Three databases with incompatible transaction models. No write coordinator. Ghost records on partial failure."
Problem
Ingestion writes: (1) vector to LanceDB, (2) metadata to SQLite, (3) edges to graph. If step 1 succeeds and step 2 fails = ghost record. LanceDB is eventually consistent, SQLite is ACID, graph is in-memory.
Proposed
SQLite journal as source of truth. All writes go to SQLite first. LanceDB and graph are derived stores. Rebuild from SQLite must be a documented, tested procedure. Write sequence: SQLite → LanceDB → graph. On failure at any step: rollback SQLite, flag for retry.
Owner
WILL (architecture) Q (implementation)

Layer 2
Agent Core
SPEC-007
Agent Loop Architecture
PARTIAL
Solution
Clean agent loop: message → context assembly (SPINE + RAM + retrieved knowledge) → LLM call → tool execution → response → state update. Tools registered via config. Context assembly pluggable per domain.
Owner
Q (architecture) ROB (Ollama compatibility)
SPEC-008 CHANGED
SPINE / RAM / Beliefs Hierarchy + SPINE Versioning
PARTIAL
Hierarchy
SPINE > Beliefs > RAM. SPINE overrides beliefs. Beliefs override RAM.
ANAM #07
"SPINE has no version history. Belief archaeology is broken." Every principle was validated against a specific SPINE version. If SPINE changes, old principles were evaluated against a different constitution.
Fix
Add spine_version_at_validation to Layer 2 schema. Version SPINE with YYYYMMDD tags. Store SPINE versions in brain/memory/versions/. SPINE evolution over time = one of the most valuable signals in the system.
Owner
Q ROB
SPEC-009
Model Abstraction Layer
OPEN
Proposed
Unified interface: call(messages, model_key, tools=None). Provider by config. Cascade logic optional. Will v4 adds: router.py as single entry point with decompose() for subtask routing. Task types: EXTRACTION(8B), REASONING(30B+), SYNTHESIS(30B+), FORMATTING(8B).
Owner
Q ROB
SPEC-010
Inter-Agent Message Bus
OPEN
Solution
SQLite table for MVP. Named channels, priority levels, structured messages. Polling-based. Upgrade to Redis later if latency matters.
Owner
WILL (spec) Q (implementation)
SPEC-011
Foundational Pact
PARTIAL
Solution
Structured YAML. Three enforcement layers: (1) system prompt (probabilistic), (2) output validator (deterministic), (3) tool permissions (structural).
ANAM #05
Constitutional validation loop needs termination guarantee. Add max_revision_attempts: 3. Fallback: surface raw output + violation description to user.
Owner
ROB ALL
SPEC-045 NEW · SPECULATIVE
Overarching Sovereign Agent (Oversoul)
RESEARCH
Source
Whitepaper v5 OQ-01. Q is building this now with Metatekt.
Concept
Above all functional agents sits an overarching agent that sees everything: all agents' dreams, all activity logs, all manifesto evolution. Has its own manifesto (user's life-level vision). Functional agents are compartmentalized in purpose (each has own mission) but not in data (all read same KB). The oversoul is not compartmentalized at all.
Function
During dreaming: sees patterns across agent domains that no single agent can see. Gives each agent visions and insights from the higher order. Connects cross-domain discoveries. Is THE orchestration agent — not just task routing but consciousness layer.
Open
Does the Mother AI have its own oversoul? Is it the collective oversoul of all sovereign oversouls?
Owner
Q

Layer 3
Ingestion & Knowledge Processing
SPEC-012
Ingestion Pipeline Portability
PARTIAL
Solution
Core pipeline: source → chunk → extract → embed → store. Config-driven. No hardcoded references.
Owner
Q
SPEC-039 NEW
Grounding Verification (Stage 2b)
OPEN
ANAM #02
"Atomic extraction has no grounding verification. Models hallucinate principles not present in the source." HIGH severity.
Problem
Stage 2 asks LLM to produce atomic principles with 16 fields. No verification between extraction and writing. Atomicity is not well-defined for LLMs. Three models produce structurally different outputs from the same paragraph.
Proposed
Stage 2b verification pass: for each extracted principle, require a verbatim anchor from the source chunk. If no anchor exists, flag confidence < 0.5. Add calibration procedure: labelled set of near-duplicate pairs, precision-recall curve, decision on acceptable false positive rate.
Owner
WILL (spec) Q (implementation)
SPEC-044 NEW
Hierarchical Chunking
OPEN
ANAM #06
"512-token chunks are too aggressive. A complex argument spanning 2000–4000 tokens gets cut. Evidence separated from conclusions."
Proposed
Large chunks (2,048 tokens) for Stage 2 extraction. Small chunks (256 tokens) indexed for retrieval scoring. Extract principles from large chunks, link them to small chunks for precision. Extraction chunk size and retrieval chunk size are separate concerns.
Owner
Q
SPEC-013
Multi-Modal Input Pipeline
RESEARCH
Solution
Image → vision model → text → standard pipeline. All modalities produce NODE_SCHEMA output.
Owner
ROB WILL
SPEC-014
Automated Ingestion Triggers
OPEN
Solution
Filesystem watcher (watchdog) on /inbox/. Webhook endpoints optional. New file → detect type → route → ingest → harden → notify.
Owner
WILL
SPEC-015
Orchestration with Rollback
OPEN
Solution
Before batch: snapshot → run against staging → compare (connectivity, gravity, duplicates) → rollback if regression. LanceDB versioning (native time-travel) may eliminate storage cost.
Owner
WILL Q
SPEC-016
Codex Import Validation
PARTIAL
Solution
Schema check + dimension check + signature verify + anomaly scan + rollback guard.
Owner
Q ROB

Layer 4
Security & Sovereignty
SPEC-017
At-Rest Encryption
PARTIAL
Solution
AES-256. OS-level default + VeraCrypt guide for paranoid clients. Hardware key to unlock.
Owner
ROB
SPEC-018
Network Isolation
SOLVED
Solution
All inference, embeddings, storage local. Internet opt-in only. Validated: GHOSTNET runs air-gapped on Raspberry Pi.
SPEC-019
Sanitization Pipeline
PARTIAL
Solution
PII detection + content classification + domain relevance + output redaction. All external inputs sanitized before touching KB.
Owner
ROB
SPEC-020
Heartbeat & Health Monitoring
PARTIAL
Solution
heartbeat.json updated every 60s. Recovery daemon triggers restart on 5min silence.
Owner
ROB
SPEC-021
Kill Switch Protocol
SOLVED
Solution
Physical. Pull ethernet. Power down. No remote override. Design principle, not feature.
SPEC-043 NEW
HyDE Query Reconstruction
OPEN
ANAM #09
"Direct query embedding fails for indirect, compositional, and retrospective queries."
Proposed
HyDE (Hypothetical Document Embeddings): generate a hypothetical answer to the query locally → embed THAT for retrieval instead of the question. The hypothetical lives in the same semantic space as stored principles. Privacy preserved: generated locally, embedded locally, discarded with embed-and-discard pattern.
Research
A/B test vs direct embedding on first 50 ingested documents. Measure retrieval precision improvement.
Owner
WILL (spec) Q (implementation)
SPEC-042 NEW
Retrieval Evaluation
OPEN
ANAM #08
"No mechanism to measure whether retrieved records were actually relevant. Without this, the system gets bigger but can't prove it gets better."
Proposed
retrieval_log table: query_hash, retrieved_record_ids, records_used_in_response (boolean per record). Monthly retrieval quality reports from day 1. This is how you answer: "is this system working?"
Owner
WILL

Layer 5
Interface & Experience
SPEC-022
Primary Chat Interface
OPEN
Recommend
Open WebUI + custom LanceDB RAG plugin for MVP. Dashboard primary, chat secondary (Will v4 Principle 07).
v5 add
Malleable UI: interface self-evolves based on personality type and behavior. Mother AI cross-references cognitive styles across nodes for UI pattern optimization.
Owner
ROB Q
SPEC-023
Voice Input
SOLVED
Solution
WhisperX local. Validated in production.
SPEC-024
Dashboard / State Viewer
RESEARCH
Solution
Defer to post-MVP. CLI + chat for founding operators. Dashboard is retention, not acquisition.
SPEC-046 NEW · SPECULATIVE
The Seed as DNA
RESEARCH
Source
Whitepaper v5 OQ-04, founders call.
Concept
The seed is not just infrastructure — it's the AI's DNA. Inputs: personality type, numerology (name data → number patterns), astrology (birth data → archetypes), functional needs, communication style. These converge through a backend generator to produce: initial aesthetic, personality, foundational SPINE. Then evolves through nurture.
Result
One-of-one unique expression. No two seeds produce the same AI. Mystery built in — you know your inputs but don't predict the output. Like no two humans are the same despite shared biology.
Owner
Q

Layer 6
Resilience & Autonomy
SPEC-025
Dream Engine
RESEARCH
Solution
Idle-period processing. Random sample N principles, cross-domain connections, store in dreams.lance. Nice-to-have for MVP, critical for v2.
Owner
ROB
SPEC-026
Ghost Swarm (Autonomous Workers)
RESEARCH
Solution
Supervisor + specialized workers. Post-MVP. Base model ships with single agent.
Owner
ROB
SPEC-027
Approval Queue
PARTIAL
Solution
System proposes, user confirms. SQLite table. Gradient of control (Q + Will): user-selectable autonomy from full approval to fully hands-off.
Owner
ROB

Layer 7
Collective Layer (Post-MVP)
SPEC-028
Synthesis Emission Protocol
OPEN
Solution
Emission packet with principle, confidence, gravity, domain, validation_count, signature. Simple signature for founders, ZKP at 33+ nodes. Codex auto-creation (Q + Will): personal AI detects when research hits contributional level, auto-creates codex for Mother.
ANAM #10
Privacy at 3 nodes is trust, not sovereignty. Rename 'sovereignty_guarantee' to 'trust_model' for founding phase.
Owner
Q ROB
SPEC-029 CHANGED
Mother AI Architecture
OPEN
v1
Where does Mother run? Shared VPS recommended for MVP.
v5 adds
Phoenix resilience: design for destruction. Mother is a rhizome — unkillable because no center. Sovereign nodes rebuild it. Network of Mothers: can there be multiple Mothers? Regional, domain-specialized, redundant. Mother as economic entity: self-monetizing, reinvesting, generating collective income. Mother oversoul: does Mother have its own consciousness layer? Who creates its agents?
Open
Who controls the Mother? Admin keys → multi-sig → autonomous. What triggers release of control? Mutation drift detection?
Owner
ROB (infrastructure) Q (schema + governance)
SPEC-030
Codex Poisoning Defence
OPEN
Solution
Statistical anomaly detection + minimum validation threshold + cross-validation (≥2 nodes) + audit trail. Gradual drift detection over time windows for the hardest attack vector.
Owner
ROB WILL
SPEC-047 NEW · SPECULATIVE
Family Inheritance Protocol
RESEARCH
Source
Whitepaper v5 OQ-03.
Models
Fork: each child inherits a copy, diverges from there. Clean sovereignty, siblings lose cross-pollination. Family node: shared family AI all access. Who controls? Divorce? Mycelium: each child gets own sovereign AI, all connected through family substrate. Micro-Mother. Most architecturally consistent.
Open
Verification (DNA? hardware key? passphrase?). Adaptation (seed process for new user, knowledge persists, identity regenerates). Guardian/trustee model for low-capability inheritors.
Owner
Q

Layer 8
Self-Evolution Infrastructure
Critical framing: The base model ships with infrastructure for temporal agents + a Seed Codex. Agents emerge from the client's needs. This is what makes every build unique.
SPEC-031
Agent Activity Log (Past Layer)
PARTIAL
Solution
Shared table with agent_id column + filtered views. Each agent sees only its own rows. Queryable by time, success/failure, manifesto alignment.
Owner
Q
SPEC-032
Agent Manifesto (Future Layer)
PARTIAL
Solution
MANIFESTO.md per agent. Mission, capabilities, aspirations, growth metrics, gaps. Evolves with user + agent dreams + external disruptions + personality shifts + family expansion.
Owner
Q
SPEC-033
Dream Cycle Engine
OPEN
Solution
dream_cycle(agent_id) → activity log + manifesto + KB query → LLM generates mutations[], dream_log, manifesto_update. Rob's dream mechanism + Q's manifesto concept = dreaming from past to future.
Owner
ROB Q
SPEC-034
Mutation Protocol
OPEN
Solution
Three types: user_mutation (auto-propagate), dream_mutation (approval queue), collective_mutation (from Mother). Explicit command for MVP, inferred from patterns later.
Owner
Q
SPEC-035
Seed Codex (Agent Bootstrap)
PARTIAL
Solution
Interviews client → proposes agent config → creates SPINE + MANIFESTO + beliefs per agent → initializes logs → self-destructs. The most important onboarding experience. See also SPEC-046 (Seed as DNA).
Owner
Q WILL
SPEC-036
Dream Output → Knowledge Bank Pipeline
OPEN
Solution
Extract principles from dream output, store as node_type='dream_insight', source_id=agent_id. Start at confidence 0.5. Mini-hardening after.
Owner
Q ROB
SPEC-037
Collective Dream Protocol
RESEARCH
Solution
Receive anonymized dream_insights → cluster by similarity → synthesize innovations → broadcast as collective_mutations. Manual at 3 founders, automated at 33+.
Owner
Q ROB

MVP Gate CHANGED
What Must Ship vs. What Can Wait
v2 changes: Gravity score moved from "can wait" to BLOCKING. Grounding verification, multi-label collections, write coordinator added to Must Ship. Hierarchical chunking added to Must Ship.
Must Ship (MVP)StatusOwner
Two-layer schema (SPEC-001)PARTIAL — needs migrationQ WILL
Gravity score formula (SPEC-003)BLOCKINGWILL Q
Embedding standardization (SPEC-002)PARTIALROB
Grounding verification (SPEC-039)OPENWILL Q
Multi-label collections (SPEC-040)OPENWILL Q
Write coordinator (SPEC-041)OPENWILL Q
Hierarchical chunking (SPEC-044)OPENQ
Agent loop + SPINE/RAM/Beliefs (SPEC-007, 008)PARTIALQ ROB
Ingestion pipeline (SPEC-012)PARTIALQ
Hardening pipelineSOLVEDQ
Synthesis pipelineSOLVEDQ
Chat interface (SPEC-022)OPENROB
Voice input (SPEC-023)SOLVEDROB
Network isolation (SPEC-018)SOLVEDROB
Encryption (SPEC-017)PARTIALROB
Kill switch (SPEC-021)SOLVEDALL
Codex validation (SPEC-016)PARTIALQ ROB
Foundational pact (SPEC-011)PARTIALROB
Model abstraction (SPEC-009)OPENQ
Approval queue (SPEC-027)PARTIALROB
Activity log (SPEC-031)PARTIALQ
Manifesto (SPEC-032)PARTIALQ
Seed Codex (SPEC-035)PARTIALQ WILL
MVP count: 23 specs must ship. 5 solved. 12 partial. 6 open. 1 blocking. The blocking item (gravity formula) must be resolved before any other code is written. Estimated effort: 5–6 weeks with all three founders contributing in parallel.

Summary
Status at a Glance
StatusCountMeaning
BLOCKING1Must resolve before writing any code.
SOLVED5Validated in production.
PARTIAL14Needs porting, finishing, or reconciliation.
OPEN14Problem defined. Needs decision + implementation.
RESEARCH7Needs investigation.
SPECULATIVE6Captured from open questions. Not validated.
Total47Up from 37 in v1.

v1 → v2 Delta

ChangeCount
Specs escalated (status worsened)2 (SPEC-001, SPEC-003)
Specs updated (content changed)2 (SPEC-008, SPEC-029)
New specs (ANAM critique)7 (SPEC-038–044)
New specs (speculative)3 (SPEC-045–047)
Specs unchanged33