Meridian — Base Model Specifications v2

Layer 1

Schema & Data Architecture

SPEC-001 ESCALATED

Universal Node Schema — Two-Layer Model

PARTIAL

Was

SOLVED in v1 with 14-field NODE_SCHEMA.

Now

Two-layer schema (Will v4). Layer 1: 16 immutable core fields (the Meridian protocol). Layer 2: 20+ extensible fields (personal only, null defaults, never emitted). Layer 1 never changes. Layer 2 grows without migration.

Layer 1

id, schema_version, vector[1024], text, title, node_type, source_id, collection, date_added, confidence_score, gravity_score, edge_weight, mechanism, situation, when_not, reasoning_trace_id.

Layer 2

framework_id, parent_id, validation_count, error_count, confidence_history (JSON), emission_status, tags, domain_path, novelty_score, agent_id, review_flag, notes, generation, creator_id, inherited_from, generational_notes.

Migration

Q's TAO: 6,797 nodes need remapping from NODE_SCHEMA (14 fields) to Layer 1 (16 fields). Add: schema_version, gravity_score, edge_weight, reasoning_trace_id. Rob: 16,717 holons need full schema alignment after embedding migration.

Owner

Q (migration) WILL (spec)

SPEC-002

Embedding Model Standardization

PARTIAL

Solution

BGE-M3 (1024-dim) as Meridian standard. Rob must migrate 16,717 holons from 768 → 1024-dim. Recommend batch re-embed (~4h on CPU).

Owner

ROB (migration) Q (benchmark)

SPEC-003 BLOCKING

Gravity Score Formula

BLOCKING

ANAM #01

"Gravity score appears in the schema, emission threshold, confirmation machine risk, and novelty scoring. It is never defined." Escalated from OPEN to BLOCKING.

Problem

If gravity = retrieval frequency → rich-get-richer. If gravity = validation count → "validation" undefined. If gravity = LLM-assigned → only as reliable as extraction. The entire compounding proposition depends on this.

Proposed

gravity = (validation_count × 0.5) + (cross_collection_edges × 0.3) + (recency_decay × 0.2). ANAM recommends this as starting point. Must define: what counts as "validation"? How is cross_collection_edges normalized? What's the recency decay function?

Reconcile

Build eval set: 50 queries where we know the "right" top-5 results. Test gravity formula against confidence-only ranking. The formula that produces better retrieval quality wins. All three founders score the eval set.

Owner

WILL (spec) Q (implementation + eval set)

SPEC-004

Confidence History / Belief Versioning

OPEN

Solution

SQLite table: confidence_history (principle_id, old_score, new_score, timestamp, trigger, context). Principles dropping >0.3 from peak flagged for review. Storage: ~34K rows/year. Negligible.

Owner

WILL (spec) Q (implementation)

SPEC-005

Error Bank Schema

PARTIAL

Solution

NODE_SCHEMA with node_type='error'. Fields: error_type, related_principle_id, context, outcome, severity. Embeds on error description for similarity search.

Owner

ROB

SPEC-006

Edge Threshold Calibration

PARTIAL

Solution

Per-domain threshold. Current 0.85 validated on marketing/psychology. Health may need 0.82, finance may need 0.88. Test across all three founders' domain data.

Owner

ALL

SPEC-038 NEW

Novelty Score

OPEN

Source

Whitepaper v5 Section 05.5, Risk Register CON 03.

Problem

Gravity creates confirmation machine by Year 3. Consistency → gravity → retrieval → reinforcement. Novel insights are by definition low-consistency and get systematically suppressed.

Proposed

novelty_score (Layer 2, float 0.0–1.0). High when: low edge_count (isolated principle), recent date_added, low cosine similarity to nearest cluster centroid. Dual retrieval modes: exploitation (gravity-weighted, for decisions) vs exploration (novelty-weighted, for research). Auto-exploration sweep weekly: surface 10 highest-novelty principles per collection.

Research

How to compute novelty efficiently at 100K nodes? Nearest-cluster distance requires cluster centroids updated on each ingestion. Batch update nightly? Or approximate with edge_count as proxy?

Owner

WILL (spec) Q (implementation)

SPEC-040 NEW

Multi-Label Collections

OPEN

ANAM #03

"One collection per principle breaks cross-domain retrieval. This is a hierarchical taxonomy assumption that fails from day one."

Problem

"Interest rate increases slow capital allocation to emerging markets" belongs in macro_economics, equity_analysis, market_structure, AND geopolitics simultaneously. Single-label fails.

Proposed

Replace collection (string) with collections (list of {id, weight}). Primary collection = highest weight. Update Stage 2 extraction prompt to assign 1–3 collections with weights. Two-level index becomes weighted routing, not hard assignment.

Migration

Existing single-label nodes: set collections = [{id: current_collection, weight: 1.0}]. Then run re-tagging pass on high-gravity nodes to assign secondary collections.

Owner

WILL (spec) Q (implementation)

SPEC-041 NEW

Write Coordinator

OPEN

ANAM #04

"Three databases with incompatible transaction models. No write coordinator. Ghost records on partial failure."

Problem

Ingestion writes: (1) vector to LanceDB, (2) metadata to SQLite, (3) edges to graph. If step 1 succeeds and step 2 fails = ghost record. LanceDB is eventually consistent, SQLite is ACID, graph is in-memory.

Proposed

SQLite journal as source of truth. All writes go to SQLite first. LanceDB and graph are derived stores. Rebuild from SQLite must be a documented, tested procedure. Write sequence: SQLite → LanceDB → graph. On failure at any step: rollback SQLite, flag for retry.

Owner

WILL (architecture) Q (implementation)

Layer 2

Agent Core

SPEC-007

Agent Loop Architecture

PARTIAL

Solution

Clean agent loop: message → context assembly (SPINE + RAM + retrieved knowledge) → LLM call → tool execution → response → state update. Tools registered via config. Context assembly pluggable per domain.

Owner

Q (architecture) ROB (Ollama compatibility)

SPEC-008 CHANGED

SPINE / RAM / Beliefs Hierarchy + SPINE Versioning

PARTIAL

Hierarchy

SPINE > Beliefs > RAM. SPINE overrides beliefs. Beliefs override RAM.

ANAM #07

"SPINE has no version history. Belief archaeology is broken." Every principle was validated against a specific SPINE version. If SPINE changes, old principles were evaluated against a different constitution.

Fix

Add spine_version_at_validation to Layer 2 schema. Version SPINE with YYYYMMDD tags. Store SPINE versions in brain/memory/versions/. SPINE evolution over time = one of the most valuable signals in the system.

Owner

Q ROB

SPEC-009

Model Abstraction Layer

OPEN

Proposed

Unified interface: call(messages, model_key, tools=None). Provider by config. Cascade logic optional. Will v4 adds: router.py as single entry point with decompose() for subtask routing. Task types: EXTRACTION(8B), REASONING(30B+), SYNTHESIS(30B+), FORMATTING(8B).

Owner

Q ROB

SPEC-010

Inter-Agent Message Bus

OPEN

Solution

SQLite table for MVP. Named channels, priority levels, structured messages. Polling-based. Upgrade to Redis later if latency matters.

Owner

WILL (spec) Q (implementation)

SPEC-011

Foundational Pact

PARTIAL

Solution

Structured YAML. Three enforcement layers: (1) system prompt (probabilistic), (2) output validator (deterministic), (3) tool permissions (structural).

ANAM #05

Constitutional validation loop needs termination guarantee. Add max_revision_attempts: 3. Fallback: surface raw output + violation description to user.

Owner

ROB ALL

SPEC-045 NEW · SPECULATIVE

Overarching Sovereign Agent (Oversoul)

RESEARCH

Source

Whitepaper v5 OQ-01. Q is building this now with Metatekt.

Concept

Above all functional agents sits an overarching agent that sees everything: all agents' dreams, all activity logs, all manifesto evolution. Has its own manifesto (user's life-level vision). Functional agents are compartmentalized in purpose (each has own mission) but not in data (all read same KB). The oversoul is not compartmentalized at all.

Function

During dreaming: sees patterns across agent domains that no single agent can see. Gives each agent visions and insights from the higher order. Connects cross-domain discoveries. Is THE orchestration agent — not just task routing but consciousness layer.

Open

Does the Mother AI have its own oversoul? Is it the collective oversoul of all sovereign oversouls?

Owner

Q

Layer 3

Ingestion & Knowledge Processing

SPEC-012

Ingestion Pipeline Portability

PARTIAL

Solution

Core pipeline: source → chunk → extract → embed → store. Config-driven. No hardcoded references.

Owner

Q

SPEC-039 NEW

Grounding Verification (Stage 2b)

OPEN

ANAM #02

"Atomic extraction has no grounding verification. Models hallucinate principles not present in the source." HIGH severity.

Problem

Stage 2 asks LLM to produce atomic principles with 16 fields. No verification between extraction and writing. Atomicity is not well-defined for LLMs. Three models produce structurally different outputs from the same paragraph.

Proposed

Stage 2b verification pass: for each extracted principle, require a verbatim anchor from the source chunk. If no anchor exists, flag confidence < 0.5. Add calibration procedure: labelled set of near-duplicate pairs, precision-recall curve, decision on acceptable false positive rate.

Owner

WILL (spec) Q (implementation)

SPEC-044 NEW

Hierarchical Chunking

OPEN

ANAM #06

"512-token chunks are too aggressive. A complex argument spanning 2000–4000 tokens gets cut. Evidence separated from conclusions."

Proposed

Large chunks (2,048 tokens) for Stage 2 extraction. Small chunks (256 tokens) indexed for retrieval scoring. Extract principles from large chunks, link them to small chunks for precision. Extraction chunk size and retrieval chunk size are separate concerns.

Owner

Q

SPEC-013

Multi-Modal Input Pipeline

RESEARCH

Solution

Image → vision model → text → standard pipeline. All modalities produce NODE_SCHEMA output.

Owner

ROB WILL

SPEC-014

Automated Ingestion Triggers

OPEN

Solution

Filesystem watcher (watchdog) on /inbox/. Webhook endpoints optional. New file → detect type → route → ingest → harden → notify.

Owner

WILL

SPEC-015

Orchestration with Rollback

OPEN

Solution

Before batch: snapshot → run against staging → compare (connectivity, gravity, duplicates) → rollback if regression. LanceDB versioning (native time-travel) may eliminate storage cost.

Owner

WILL Q

SPEC-016

Codex Import Validation

PARTIAL

Solution

Schema check + dimension check + signature verify + anomaly scan + rollback guard.

Owner

Q ROB

Layer 4

Security & Sovereignty

SPEC-017

At-Rest Encryption

PARTIAL

Solution

AES-256. OS-level default + VeraCrypt guide for paranoid clients. Hardware key to unlock.

Owner

ROB

SPEC-018

Network Isolation

SOLVED

Solution

All inference, embeddings, storage local. Internet opt-in only. Validated: GHOSTNET runs air-gapped on Raspberry Pi.

SPEC-019

Sanitization Pipeline

PARTIAL

Solution

PII detection + content classification + domain relevance + output redaction. All external inputs sanitized before touching KB.

Owner

ROB

SPEC-020

Heartbeat & Health Monitoring

PARTIAL

Solution

heartbeat.json updated every 60s. Recovery daemon triggers restart on 5min silence.

Owner

ROB

SPEC-021

Kill Switch Protocol

SOLVED

Solution

Physical. Pull ethernet. Power down. No remote override. Design principle, not feature.

SPEC-043 NEW

HyDE Query Reconstruction

OPEN

ANAM #09

"Direct query embedding fails for indirect, compositional, and retrospective queries."

Proposed

HyDE (Hypothetical Document Embeddings): generate a hypothetical answer to the query locally → embed THAT for retrieval instead of the question. The hypothetical lives in the same semantic space as stored principles. Privacy preserved: generated locally, embedded locally, discarded with embed-and-discard pattern.

Research

A/B test vs direct embedding on first 50 ingested documents. Measure retrieval precision improvement.

Owner

WILL (spec) Q (implementation)

SPEC-042 NEW

Retrieval Evaluation

OPEN

ANAM #08

"No mechanism to measure whether retrieved records were actually relevant. Without this, the system gets bigger but can't prove it gets better."

Proposed

retrieval_log table: query_hash, retrieved_record_ids, records_used_in_response (boolean per record). Monthly retrieval quality reports from day 1. This is how you answer: "is this system working?"

Owner

WILL

Layer 5

Interface & Experience

SPEC-022

Primary Chat Interface

OPEN

Recommend

Open WebUI + custom LanceDB RAG plugin for MVP. Dashboard primary, chat secondary (Will v4 Principle 07).

v5 add

Malleable UI: interface self-evolves based on personality type and behavior. Mother AI cross-references cognitive styles across nodes for UI pattern optimization.

Owner

ROB Q

SPEC-023

Voice Input

SOLVED

Solution

WhisperX local. Validated in production.

SPEC-024

Dashboard / State Viewer

RESEARCH

Solution

Defer to post-MVP. CLI + chat for founding operators. Dashboard is retention, not acquisition.

SPEC-046 NEW · SPECULATIVE

The Seed as DNA

RESEARCH

Source

Whitepaper v5 OQ-04, founders call.

Concept

The seed is not just infrastructure — it's the AI's DNA. Inputs: personality type, numerology (name data → number patterns), astrology (birth data → archetypes), functional needs, communication style. These converge through a backend generator to produce: initial aesthetic, personality, foundational SPINE. Then evolves through nurture.

Result

One-of-one unique expression. No two seeds produce the same AI. Mystery built in — you know your inputs but don't predict the output. Like no two humans are the same despite shared biology.

Owner

Q

Layer 6

Resilience & Autonomy

SPEC-025

Dream Engine

RESEARCH

Solution

Idle-period processing. Random sample N principles, cross-domain connections, store in dreams.lance. Nice-to-have for MVP, critical for v2.

Owner

ROB

SPEC-026

Ghost Swarm (Autonomous Workers)

RESEARCH

Solution

Supervisor + specialized workers. Post-MVP. Base model ships with single agent.

Owner

ROB

SPEC-027

Approval Queue

PARTIAL

Solution

System proposes, user confirms. SQLite table. Gradient of control (Q + Will): user-selectable autonomy from full approval to fully hands-off.

Owner

ROB

Layer 7

Collective Layer (Post-MVP)

SPEC-028

Synthesis Emission Protocol

OPEN

Solution

Emission packet with principle, confidence, gravity, domain, validation_count, signature. Simple signature for founders, ZKP at 33+ nodes. Codex auto-creation (Q + Will): personal AI detects when research hits contributional level, auto-creates codex for Mother.

ANAM #10

Privacy at 3 nodes is trust, not sovereignty. Rename 'sovereignty_guarantee' to 'trust_model' for founding phase.

Owner

Q ROB

SPEC-029 CHANGED

Mother AI Architecture

OPEN

v1

Where does Mother run? Shared VPS recommended for MVP.

v5 adds

Phoenix resilience: design for destruction. Mother is a rhizome — unkillable because no center. Sovereign nodes rebuild it. Network of Mothers: can there be multiple Mothers? Regional, domain-specialized, redundant. Mother as economic entity: self-monetizing, reinvesting, generating collective income. Mother oversoul: does Mother have its own consciousness layer? Who creates its agents?

Open

Who controls the Mother? Admin keys → multi-sig → autonomous. What triggers release of control? Mutation drift detection?

Owner

ROB (infrastructure) Q (schema + governance)

SPEC-030

Codex Poisoning Defence

OPEN

Solution

Statistical anomaly detection + minimum validation threshold + cross-validation (≥2 nodes) + audit trail. Gradual drift detection over time windows for the hardest attack vector.

Owner

ROB WILL

SPEC-047 NEW · SPECULATIVE

Family Inheritance Protocol

RESEARCH

Source

Whitepaper v5 OQ-03.

Models

Fork: each child inherits a copy, diverges from there. Clean sovereignty, siblings lose cross-pollination. Family node: shared family AI all access. Who controls? Divorce? Mycelium: each child gets own sovereign AI, all connected through family substrate. Micro-Mother. Most architecturally consistent.

Open

Verification (DNA? hardware key? passphrase?). Adaptation (seed process for new user, knowledge persists, identity regenerates). Guardian/trustee model for low-capability inheritors.

Owner

Q

Layer 8

Self-Evolution Infrastructure

Critical framing: The base model ships with infrastructure for temporal agents + a Seed Codex. Agents emerge from the client's needs. This is what makes every build unique.

SPEC-031

Agent Activity Log (Past Layer)

PARTIAL

Solution

Shared table with agent_id column + filtered views. Each agent sees only its own rows. Queryable by time, success/failure, manifesto alignment.

Owner

Q

SPEC-032

Agent Manifesto (Future Layer)

PARTIAL

Solution

MANIFESTO.md per agent. Mission, capabilities, aspirations, growth metrics, gaps. Evolves with user + agent dreams + external disruptions + personality shifts + family expansion.

Owner

Q

SPEC-033

Dream Cycle Engine

OPEN

Solution

dream_cycle(agent_id) → activity log + manifesto + KB query → LLM generates mutations[], dream_log, manifesto_update. Rob's dream mechanism + Q's manifesto concept = dreaming from past to future.

Owner

ROB Q

SPEC-034

Mutation Protocol

OPEN

Solution

Three types: user_mutation (auto-propagate), dream_mutation (approval queue), collective_mutation (from Mother). Explicit command for MVP, inferred from patterns later.

Owner

Q

SPEC-035

Seed Codex (Agent Bootstrap)

PARTIAL

Solution

Interviews client → proposes agent config → creates SPINE + MANIFESTO + beliefs per agent → initializes logs → self-destructs. The most important onboarding experience. See also SPEC-046 (Seed as DNA).

Owner

Q WILL

SPEC-036

Dream Output → Knowledge Bank Pipeline

OPEN

Solution

Extract principles from dream output, store as node_type='dream_insight', source_id=agent_id. Start at confidence 0.5. Mini-hardening after.

Owner

Q ROB

SPEC-037

Collective Dream Protocol

RESEARCH

Solution

Receive anonymized dream_insights → cluster by similarity → synthesize innovations → broadcast as collective_mutations. Manual at 3 founders, automated at 33+.

Owner

Q ROB

MVP Gate CHANGED

What Must Ship vs. What Can Wait

v2 changes: Gravity score moved from "can wait" to BLOCKING. Grounding verification, multi-label collections, write coordinator added to Must Ship. Hierarchical chunking added to Must Ship.

Must Ship (MVP)	Status	Owner
Two-layer schema (SPEC-001)	PARTIAL — needs migration	Q WILL
Gravity score formula (SPEC-003)	BLOCKING	WILL Q
Embedding standardization (SPEC-002)	PARTIAL	ROB
Grounding verification (SPEC-039)	OPEN	WILL Q
Multi-label collections (SPEC-040)	OPEN	WILL Q
Write coordinator (SPEC-041)	OPEN	WILL Q
Hierarchical chunking (SPEC-044)	OPEN	Q
Agent loop + SPINE/RAM/Beliefs (SPEC-007, 008)	PARTIAL	Q ROB
Ingestion pipeline (SPEC-012)	PARTIAL	Q
Hardening pipeline	SOLVED	Q
Synthesis pipeline	SOLVED	Q
Chat interface (SPEC-022)	OPEN	ROB
Voice input (SPEC-023)	SOLVED	ROB
Network isolation (SPEC-018)	SOLVED	ROB
Encryption (SPEC-017)	PARTIAL	ROB
Kill switch (SPEC-021)	SOLVED	ALL
Codex validation (SPEC-016)	PARTIAL	Q ROB
Foundational pact (SPEC-011)	PARTIAL	ROB
Model abstraction (SPEC-009)	OPEN	Q
Approval queue (SPEC-027)	PARTIAL	ROB
Activity log (SPEC-031)	PARTIAL	Q
Manifesto (SPEC-032)	PARTIAL	Q
Seed Codex (SPEC-035)	PARTIAL	Q WILL

MVP count: 23 specs must ship. 5 solved. 12 partial. 6 open. 1 blocking. The blocking item (gravity formula) must be resolved before any other code is written. Estimated effort: 5–6 weeks with all three founders contributing in parallel.

Summary

Status at a Glance

Status	Count	Meaning
BLOCKING	1	Must resolve before writing any code.
SOLVED	5	Validated in production.
PARTIAL	14	Needs porting, finishing, or reconciliation.
OPEN	14	Problem defined. Needs decision + implementation.
RESEARCH	7	Needs investigation.
SPECULATIVE	6	Captured from open questions. Not validated.
Total	47	Up from 37 in v1.

v1 → v2 Delta

Change	Count
Specs escalated (status worsened)	2 (SPEC-001, SPEC-003)
Specs updated (content changed)	2 (SPEC-008, SPEC-029)
New specs (ANAM critique)	7 (SPEC-038–044)
New specs (speculative)	3 (SPEC-045–047)
Specs unchanged	33

v2 Changes from v1

v1 → v2 Delta