GHOST_PROTOCOL.md — Canonical Ghost Protocol Specification¶

Last updated: April 13, 2026 (Omega v2.7) Supersedes: GHOST_PROTOCOL_UNIFIED.md (March 24), ghost_protocol_v2_1_LOCKED.md, ghost_protocol_v3_0_spec.md, sprint_B12_ghost_protocol.md, epsilon_jtask_update.md, all earlier per-skill-pricing specs, all references to Qwen 3.5 9B as the Ghost server model, all references to ElevenLabs/WaveNet/Google Cloud TTS as the Ghost voice engine.

Newer sprint/handover content overrides older content. Where a sprint file and this doc disagree on a tactical server detail, the sprint file wins until this doc is republished. Where a locked decision in CLAUDE.md contradicts anything below, CLAUDE.md wins.

1. WHAT GHOST PROTOCOL IS¶

Ghost Protocol is alaivOS's private infrastructure layer. It is a server controlled by Citerius Holdings LLC that sits between Laiv and the outside world. Every time Laiv needs capability beyond the phone, it routes through Ghost — anonymously, with zero logging.

One sentence: Ghost is Laiv on steroids — a more powerful brain, smarter search, a richer voice, and overnight Checkup analysis. All powered by your own private server.

The privacy promise: Personal data never leaves the phone. When Laiv reaches out, Ghost strips identity from every request. Search engines never see IPs. Voice services never see accounts. Anthropic never sees raw user data (dual anonymization).

Cardinal rule (IMMUTABLE): Ghost tiers differ ONLY in credit allocation. All Ghost capabilities — Brain, Deep Search, Voice HD, Agent research, Agent cart fill, Agent transactional — are available at EVERY Ghost tier. Credits are the only gate. Laiv Checkup is NOT credit-gated — it is tier-baked (see §7).

2. LOCKED DECISIONS¶

Ghost server model = Gemma 4 E4B Q4_K_M on CX43. 12 tok/s, 10 GB loaded RAM, native function calling verified EN/ES/PT, native audio input, native vision. 2× faster than Qwen 3.5 9B on same hardware. Qwen 3.5 9B preserved in Ollama as fallback.
Qwen 3.5 9B on Ghost is DEAD as the primary model. All references to "Ghost uses Qwen 9B" in older docs are superseded.
Ghost voice = Kokoro 82M (StyleTTS 2, Apache 2.0, 54 voices, 8 languages, CPU-only). ElevenLabs, WaveNet, Google Cloud TTS are DEAD in production — kept only as historical training references in the Bishop pipeline.
Credits ONLY gate in-app Ghost skills. No per-skill pricing. No capability gating.
Laiv Checkup is tier-baked, not credit-gated. Cadence by tier (§7.2). Day 28 trial Checkup is FREE regardless of subscription status.
Dual anonymization on Checkup: device PII strip → Gemma 4 E4B narrative digest → Anthropic Batch API. No raw data ever reaches Anthropic.
Anthropic API account = Citerius Holdings LLC ($20 initial credits). Sonnet 4.6 Batch API @ $1.50/$7.50 per MTok.
Thinking mode OFF by default on Gemma 4 E4B to prevent 30-second Instant response spikes. Off at server (/api/chat) + toggle at device.
CX43 stays (€17/mo, Helsinki). Gemma 4 E4B at 12 tok/s makes Ghost viable on current hardware. Scaling triggers in §11.
Cloud Gemini on-device is DEAD. AiProvider enum = 2 values: local, ghost. (gemini, openai, claude removed from on-device path. Anthropic is Checkup-only server-side.)
E2EE between device and Ghost for chat/call/Checkup payloads (Signal Protocol for chat, AES-256-GCM for Checkup payloads, TLS 1.3 base). Ghost relay cannot read E2EE chat.
Checkup return_token model: device generates UUID v4, Ghost cannot map token to user. 72-hour auto-delete.
Voice pipeline inversion — Kokoro-first: EL reference → Kokoro style vector → Kokoro generates reference corpus → fine-tune Piper (on-device ONNX) + Voxtral 3B zero-shot (Ghost HD v1.1+). Users never hear the ElevenLabs voice; Kokoro IS Laiv's voice.
17 tool definitions for native Gemma 4 E4B function calling (Alpha-A, TAW10). Laiv skill execution uses these.

3. CREDIT MODEL (Launch)¶

3.1 Two credit types¶

Type	Symbol	When consumed	Processing	Cost to Citerius
Deferred (D)	🌙	Overnight batch (midnight–6 AM, WiFi + charging)	Queued, no priority	~$0.005/credit
Instant (I)	⚡	Real-time, immediate	Priority	~$0.008/credit

Graceful degradation: Instant exhausted → queue to Deferred. Deferred exhausted → on-device + DDG only. Ghost never goes dark.

3.2 Ghost subscription tiers¶

Standard:

Tier	Price	D	I	Annual (10 for 12)
Ghost	$3.99/mo	50	20	$39.90/yr
Ghost Plus	$7.99/mo	80	80	$79.90/yr
Ghost Max	$14.99/mo	100	300	$149.90/yr

Deferred-only (35% off):

Tier	Price	D
Ghost Def.	$2.59/mo	70
Ghost Plus Def.	$5.19/mo	160
Ghost Max Def.	$9.74/mo	400

Elite ($23.99) includes Ghost base (50D / 20I). Regional: LatAm ~30% below USD. Ghost Def entry: ~MX$32/mo.

3.3 Credit burn¶

Capability	D	I	Notes
Ghost Brain (deep reasoning)	1	1	Complex questions beyond on-device
Ghost Search: Deep (Brave)	1	1	DDG insufficient
Ghost Voice HD (Kokoro)	—	1/min	Always real-time
Ghost Agent: Research [v1.2+]	3	3	Browse, compare, report
Ghost Agent: Interactive [v1.2+]	—	5	Cart fill, form fill
Ghost Agent: Transactional [v1.2+]	—	8–15	Checkout with rules
Traffic: live anomaly	—	1	Only when cache flags anomaly
Traffic: cached patterns	0	0	Always free
Morning Pulse (Ghost-enhanced)	1	—	Core+ local = FREE
Laiv Checkup	0	0	Tier-baked, NOT credit-gated

Always FREE: DDG Standard Search, on-device AI, cached traffic patterns, cached price reads, all local data ops, Laiv Checkup runs.

3.4 Credit packs (overage)¶

In-app purchase. Instant-only. Never expire. FIFO.

Pack	Price	I
Small	$1.99	15
Medium	$4.99	50
Large	$9.99	120

3.5 Rollover¶

None at launch. Consider soft cap (20% of monthly, max 1 month) as retention lever post-launch if underuse data warrants it.

3.6 RevenueCat products¶

Product ID	Price	Grants
`ghost_standard`	$3.99/mo	50D + 20I
`ghost_plus`	$7.99/mo	80D + 80I
`ghost_max`	$14.99/mo	100D + 300I
`ghost_deferred`	$2.59/mo	70D + 0I
`ghost_plus_deferred`	$5.19/mo	160D + 0I
`ghost_max_deferred`	$9.74/mo	400D + 0I
`ghost_credits_small/medium/large`	$1.99/$4.99/$9.99	15/50/120 I consumable
`ghost_voice_custom`	$7.99	1 voice non-consumable

Entitlements (2 total): ghost_active (any ghost_* sub OR Elite), ghost_voice_custom. All capabilities are credit-gated, not entitlement-gated.

4. GENERAL INTELLIGENCE LAYER (FREE — all tiers)¶

Before Ghost's paid capabilities, every user gets basic intelligence via anonymous search.

4.1 DDG Standard Search (Cloudflare Worker)¶

Phone → Cloudflare Worker (search.alaivos.com, global edge, <50ms) → DDG Instant Answer API → back. Cost to Citerius: $0.

Tier	DDG/day
Starter	5
Spark	15
Core	30
Pro	50
Elite	Unlimited

Cache: Weather/scores 1h TTL; facts/definitions 24h. Key = SHA-256 of query (no user info).

4.2 Emergency fallback (sacred)¶

DDG always works for emergency queries (medical/safety keywords, crisis protocol, explicit "emergency"). Bypasses all limits. We NEVER strand a user in an emergency.

5. GHOST SERVER (ghost-01, Hetzner CX43, Helsinki)¶

5.1 Hardware¶

Host: ghost-01, Hetzner CX43, 46.62.149.145, Helsinki
Specs: 8 vCPU, 16 GB RAM, 240 GB NVMe, Ubuntu 24.04
Kernel: 6.8.0-107 (updated April 13, pending reboot)
Cost: €17/mo
Domain: ghost.alaivos.com (Let's Encrypt)

5.2 Services¶

Service	Port	Purpose	Status
Ollama	11434	Gemma 4 E4B inference (fallback: Qwen 3.5 9B)	Running
ghost-router	internal :11435	nginx → Ollama model-injection proxy (default `gemma4:e4b`)	Running
sports-cache	8300	31-league multi-sport cache (1h TTL, stale-on-error)	Running
checkup-relay	8100	Checkup Relay (FastAPI) — Gemma anonymizer → Anthropic Batch	Running (awaits API key)
Kokoro TTS	(HTTP shim)	Ghost Voice HD	Eval done, awaiting voice pick
nginx	443	Reverse proxy, SSL, auth (X-Ghost-Token)	Running, headers updated for `gemma4:e4b`
coturn	3478 / 5349	WebRTC TURN relay (NAT traversal, E2EE-transparent)	Running
Traffic collector	cron	290-city pipeline feed	Running

Ollama config: OLLAMA_HOST=0.0.0.0, OLLAMA_KEEP_ALIVE=-1 (model persistent in RAM). Thinking mode OFF by default for Instant routes.

5.3 Endpoints¶

Endpoint	Purpose	Auth
`GET /`	Banner	None
`GET /api/health`	Health + routing info	None
`GET /api/tags`	Ollama tags passthrough	`X-Ghost-Token`
`POST /api/chat`	Ghost Brain: Instant (thinking OFF)	Token + `ghost_active`
`POST /api/chat/deferred`	Ghost Brain: Deferred (thinking ON allowed)	Token + `ghost_active`
`POST /api/search/deep`	Deep Search (Brave)	Token + `ghost_active`
`POST /api/tts/hd`	Ghost Voice HD (Kokoro 82M)	Token + `ghost_active`
`POST /api/tts/train`	Custom Voice training [v1.1+]	Token + payment verify
`POST /api/agent/research`	Agent: Research [v1.2+]	Token + `ghost_active`
`POST /api/agent/interactive`	Agent: Interactive [v1.2+]	Token + `ghost_active`
`POST /api/agent/transactional`	Agent: Transactional [v1.2+]	Token + `ghost_active`
`POST /api/traffic/pattern`	Cached traffic patterns	Token
`POST /api/traffic/live`	Live anomaly check	Token + `ghost_active`
`POST /api/checkup/submit`	Queue Checkup (anonymized)	Token
`GET /api/checkup/result/<return_token>`	Poll result	Token
`POST /api/checkup/aggregate`	Capsule stub [v1.1+, returns 501]	Token
`POST /api/sports/*`	Sports cache (31 leagues)	Token

Auth model: app-level bearer token rotated per release (X-Ghost-Token header). Entitlement verified via RevenueCat receipt. No user accounts on Ghost — validates "this phone has a valid sub" without knowing WHO.

5.4 Scaling triggers¶

Trigger	Action	Monthly cost
>500 Ghost subscribers	Evaluate second CX43 node OR upgrade to CX51	+€17–35
>1,000 Deferred active	Dedicated GPU node for overnight Brain	~€230 (GEX44 €184 net-of-discounts)
Agent launch (v1.2+)	Dedicated NanoClaw container node	+€35–70
>10,000 total Ghost subs	Split: Brain / TTS / Agent / Traffic on separate nodes	+€300–500

5.5 Disk budget (year 1)¶

Traffic snapshots ~6 GB · Patterns ~600 MB · Agent cache ~1.2 GB · Brave cache ~2.4 GB · Checkup ephemeral (72h) ~<100 MB · Total ~10 GB (~4% of 240 GB NVMe). No concern.

6. GHOST PROTOCOL LIFECYCLE (request flow)¶

6.1 In-app Ghost Brain (Instant)¶

Device                             Ghost (ghost-01)
──────                             ─────────────────
User asks Laiv something on-device
  └→ Can local (Qwen 3.5) handle?
     NO, user has ghost_active
  └→ POST /api/chat
     X-Ghost-Token: <rotated>
     {messages, thinking: false}
                                   nginx → ghost-router → Ollama(gemma4:e4b)
                                   12 tok/s, 10 GB loaded RAM
                                   Native function calling EN/ES/PT
                                   Response streamed back
  └→ Credit burned (1 I)
  └→ Laiv synthesizes with local personal context
  └→ User sees answer, "no fanfare"

6.2 In-app Ghost Brain (Deferred / overnight)¶

Evening: Laiv queues deferred queries to device NightShift.
Midnight–6 AM (WiFi + charging): device POSTs /api/chat/deferred.
  Gemma 4 E4B processes (thinking ON allowed).
  Results cached locally. Surfaced in Morning Pulse + Thinking Room.
  1 D burned per query.

6.3 Laiv Checkup (overnight, tier-baked, NOT credit)¶

Device collects domain data (Wellbeing / Planning / Financial).
Device applies rigid PII strip (no names, no exact dates, categories only).
Device generates return_token (UUID v4).
Device POST /api/checkup/submit {return_token, domain, checkup_type, data_summary}.
  No user ID, no device ID.
  Ghost queues in /opt/ghost/data/checkup_queue.db.

Cron at 02:00 UTC (checkup_batch.py):
  For each pending row:
    1. anonymize_with_gemma(): Gemma 4 E4B rewrites JSON as ~500-word
       anonymized narrative digest. Strips implicit PII code rules miss.
       ~60–70% token reduction before Anthropic.
    2. Build Anthropic Batch API request (Sonnet 4.6, system prompt by
       domain × checkup_type, custom_id = return_token).
  Submit batch to Anthropic (Batch API, 50% off standard rate).
  Poll every 5 min until batch ends (typically 1–4 h, SLA <24 h).
  Store result in checkup_results keyed by return_token.

Device polls GET /api/checkup/result/<return_token>.
  Receives {summary, insights, recommendations, patterns_detected,
  watch_items, next_checkup_note}.
  Stores permanently in local SQLite.
  Results auto-deleted from server after 72 h.

Per-checkup cost to Citerius: ~$0.012–0.015.
At 10k paid users × avg ~6–12 checkups/yr: ~$600–900/yr total.

6.4 Voice HD (Kokoro) — Ghost¶

Device sends text + voice_id to POST /api/tts/hd.
Ghost runs Kokoro 82M (StyleTTS 2 architecture, CPU, ~2–5 s per utterance).
Streams WAV/Opus back. 1 I/min burned.
Fallback chain on failure: Custom Std (on-device) → Laiv Std (bundled) →
Platform TTS → text-only.

6.5 Cart fill / Agent [v1.2+]¶

Device → Ghost: {task, store, items, budget_max}.
NanoClaw spawns ephemeral Docker container (outbound-only internet,
no access to Ghost filesystem/Ollama/DBs).
Container: headless Chromium + store-skill module performs browse/search/add.
Container returns {cart_total, found, missing, substitutions}.
Container destroyed. Credit burned (8 I cached / 15 I fresh).
User confirms or cancels on device. Ghost never checks out autonomously
outside rule-based transactional mode.

7. LAIV CHECKUP (v1.0 feature)¶

7.1 Three domains¶

Wellbeing — sleep, exercise, nutrition, mood, cross-signal correlations.
Planning — goals, calendar density, projects, learning sessions, focus, habits.
Financial Health — category spending, budget adherence, recurring expenses, unusual spikes, savings trend.

Each domain has baseline / mid_trial / full / followup prompt variants (10 prompt files in /opt/ghost/prompts/).

7.2 Cadence (tier-baked, not credit)¶

Tier	Cadence
Starter	— (no recurring)
Spark	Every 6 months (2/yr)
Core	Every 3 months (4/yr)
Pro	Every 2 months (6/yr)
Elite	Monthly (12/yr)

7.3 Trial flow (21-day trial)¶

Day 0 (post-onboarding) — Baseline Checkup (Planning-only, built from onboarding data). Submitted immediately, result ready in the first Morning Briefing on Day 1. Hook set instantly.
Day 14 — Mid-trial (Planning updated + Wellbeing baseline). Delivered alongside Elite trial unlock.
Day 28 — FULL Checkup (all 3 domains). FREE regardless of subscription status. The conversion moment. CTA is "subscribe to keep these coming," not "pay to see this." Checkup history stays in local SQLite forever.

7.4 Privacy chain (dual anonymization)¶

Device code-strip: rigid rules remove names, specific dates (day-of-week only), locations, merchants, account numbers. Outputs structured JSON with categories + aggregates.
Gemma 4 E4B on Ghost: rewrites into ~500-word anonymized narrative digest. Catches implicit PII code rules miss ("my Tuesday marketing standup" → "a recurring mid-week work meeting"). ~60–70% token reduction.
Anthropic Batch API (Sonnet 4.6): receives the lean double-anonymized digest. No user identity possible. Overnight processing.
Return: Ghost stores result against return_token only. Device polls, fetches, stores locally. Server auto-deletes after 72 h.

7.5 Hints between checkups¶

Beta-2 "Checkup Teaser Hints" (35 tests): surfaces light pattern nudges in the days between full checkups to maintain the felt value of the feature.

8. VOICE ARCHITECTURE¶

8.1 Tiers¶

Voice	Setup	Monthly	Runs on	Available
Text only	—	free	—	Starter
Laiv Voice Standard	—	free w/ tier	Device (Piper ONNX bundled in APK)	Spark+
Laiv Voice HD	—	1 I/min in Ghost	Ghost (Kokoro 82M)	Ghost subs
Custom Voice Std	$7.99 one-time	free w/ tier	Device (downloaded)	Spark+
Custom Voice HD	same training	1 I/min in Ghost	Ghost	Ghost subs

No third-party TTS in production. ElevenLabs/WaveNet/Google Cloud TTS are DEAD at runtime.

8.2 Kokoro-first pipeline (inversion)¶

ElevenLabs custom voice (existing ref audio, never shipped to users)
  └→ Bishop extracts StyleTTS 2 style vector
Kokoro 82M "Laiv voice" = canonical reference (what users actually hear)
  └→ generates reference corpus (500–1000 sentences EN/ES/PT)
     ├→ Fine-tune Piper VITS → ONNX on-device (v1.0 if Bishop ready)
     └→ Voxtral 3B zero-shot embedding → Ghost HD v1.1+

Rationale: Users never heard the EL voice. Kokoro's approximation IS the first voice they hear. Piper trained to match Kokoro (not EL) → quality degrades gracefully. Voxtral clones Kokoro output for Ghost HD — same person, frontier quality.

8.3 Fallback chain¶

Custom HD (Ghost) → Custom Std (on-device if downloaded) → Laiv HD (Kokoro on Ghost) → Laiv Std (Piper ONNX bundled) → Platform TTS → text-only (Starter).

8.4 Voice during trial¶

No Ghost access during trial. Standard voice only. Starter expiry = silence. This is the strongest upgrade motivator in the app.

Cross-reference: VOICE_AND_AI.md for full pipeline, model selection, and AMI dynamic loading behaviour.

9. LAIV ROUTING INTELLIGENCE¶

9.1 Decision tree¶

User triggers query / workflow
 │
 ▼ Can on-device (Qwen 3.5) handle?    YES → local, FREE
 │
 ▼ Factual lookup?                     YES → DDG (FREE, rate-limited by tier)
 │
 ▼ Has Ghost?                          NO  → "With Ghost Protocol, I can do real research..."
 │
 ▼ Urgent / real-time?                 YES → Instant credit check
 │                                            Has capacity → Ghost Instant (gemma4:e4b, thinking OFF)
 │                                            None        → offer to defer
 │
 ▼                                     NO  → queue Deferred → Morning Pulse + Thinking Room

9.2 DDG-first optimization¶

Every Deep Search hits DDG first (free). Escalate to Brave only if DDG returns thin/empty/disambiguation-only response, OR query contains "research/compare/analyze." Estimated 70–80% resolved by DDG without touching Brave.

9.3 Price pre-fetching (Agent v1.2+, Deferred credits)¶

Ghost Agent pre-fetches grocery/pharmacy prices overnight. Results cached locally (7 days grocery, 14 days pharmacy). Daytime comparisons read from cache = 0 credits. "Another store is cheaper" notifications = FREE.

Action	Credits
Price pre-fetch for user's list	3 D (per store batch)
Price compare with cache	0
Price compare no cache	8 I
Cart fill with cache	8 I
Cart fill no cache	15 I
"Another store is cheaper" push	0

9.4 Laiv personality on routing¶

Proactive about deferring, not apologetic: - "Great question — I'll dig into that tonight and have a thorough answer in your Morning Pulse." - "I could look that up right now, or do a deeper dive overnight. Which would you prefer?"

Laiv NEVER mentions DuckDuckGo, Brave, Gemma, Kokoro, Anthropic, servers, APIs, infrastructure. Laiv just knows things.

10. MAPS, ROUTING & TRAFFIC (Ghost's role)¶

10.1 Philosophy¶

alaivOS is the intelligence layer ("when to leave, what to expect"), not a navigation app. Interactive map + voice nav + motorcycle time + OSRM routing are FREE for ALL tiers (Starter included). Traffic features are gated per MAP_MODULE_SPEC.md / PRICING_AND_TIERS.md:

Gate	Min tier
`interactiveMap`	Starter
`mapTrafficPatterns`	Spark
`mapFamilySharing`	Spark
`mapLiveTraffic`	Core
`mapTrafficNavigate`	Pro

10.2 Traffic Intelligence Engine (5-layer composite ETA)¶

baseline_spline × live_calibration × weather × calendar × event. Catmull-Rom cubic spline for Gold cities, linear for Standard. Factor chips display minutes, not percentages ("Rain expected — adds ~8 min"). Calendar adjustment covers 20 countries (holidays, puentes, Semana Santa, Buen Fin, Día de Muertos).

10.3 Data pipeline¶

3 servers (ghost-01 CX43 + cx23 + cx23-b), 290 cities Full tier, ZERO paid API deps. Overpass/OSM (backbone) + DDG (enrichment) + Photon Cloudflare Worker (autocomplete) + Nominatim (geocoding) + My Places FTS5. Google Places = temporary real-time luxury layer with dart-define kill-switch, $5,154 MXN credit expires June 5, 2026. Foursquare/Yelp cancelled.

Full pipeline details: DATA_PIPELINE.md.

11. GHOST AGENT [v1.2+ beta infrastructure]¶

11.1 Principle¶

Build everything. Test internally (friends/family). Flip switches for public release. No store skill goes public until 95%+ reliability over 5 test phases.

11.2 Levels¶

Level	Credits	Capability
Research	3 D or I	Browse, compare, report. Read-only web.
Interactive	5 I	Fill forms, build carts, apply coupons. Human confirms.
Transactional	8–15 I	Rule-based recurring purchases. Auto-approve under $X.

11.3 NanoClaw architecture¶

Ephemeral Docker container per task. Container has outbound internet only. CANNOT access Ghost filesystem, Ollama models, traffic DB, any alaivOS infra. Compromise blast radius = zero.

Hero use case: grocery-to-cart (MX retailers: Soriana, Chedraui, Walmart MX, Rappi, UberEats). Pharmacy (Similares, Guadalajara, Benavides, del Ahorro) with delivery-markup comparison.

Security: container-isolated, no identity/payment/personal data access, mandatory human approval gates on Interactive, circuit breakers on Transactional (hard cap, cooldown, anomaly detection).

Full spec: ghost_agent_beta_spec.md.

12. VISUAL IDENTITY¶

12.1 Concept¶

Ghost Protocol gets its own visual mode: dark, private, unmistakable. Like Chrome incognito trained users to associate dark UI with privacy, Ghost's dark treatment = instant shorthand in app, website, and marketing. "Dark mode, but for your data."

12.2 Design tokens¶

Token	Value	Usage
`ghost-bg`	`#0D0D0F`	Primary dark background
`ghost-surface`	`#161619`	Cards, elevated surfaces
`ghost-surface-hover`	`#1E1E22`	Hover/active
`ghost-border`	`rgba(255,255,255,0.06)`	Subtle card borders
`ghost-border-glow`	`rgba(255,23,68,0.15)`	Active/focus borders
`ghost-accent`	`#FF1744`	Primary accent
`ghost-accent-dim`	`rgba(255,23,68,0.6)`	Secondary accent
`ghost-accent-subtle`	`rgba(255,23,68,0.08)`	Banner bg, badge bg
`ghost-text`	`rgba(255,255,255,0.92)`	Primary text
`ghost-text-muted`	`rgba(255,255,255,0.45)`	Secondary text
`ghost-glow`	`0 0 40px rgba(255,23,68,0.08)`	Ambient surface glow
`ghost-scanline`	`rgba(255,255,255,0.015)`	Scanline overlay

Typography: Inter, +0.02em letter-spacing on labels for "terminal" feel. Icon: minimalist ghost silhouette or shield-with-slashed-eye (stealth, not Halloween).

12.3 In-app (Flutter)¶

GhostProtocolTheme (dark global theme) applied via AnimatedTheme (600 ms crossfade) when user enables Ghost.
Persistent GhostProtocolBanner below app bar: pulsing red dot + "GHOST PROTOCOL ACTIVE" + shield icon.
Activation moment (when user enables): dim 200 ms → red pulse from toggle → crossfade to dark 600 ms → banner slides in → medium haptic.
Settings > Ghost Protocol card styled in dark treatment even when app is in light mode (previews the experience).
Ghost cards: Color(0xFF161619) surface, 6% white border, 4% red ambient box-shadow.

Builder refs: Ghost_Protocol_App_Spec.md, Ghost_Protocol_Visual_Identity_Spec.md.

12.4 Settings > Ghost Protocol (credit display)¶

┌─────────────────────────────────────────┐
│  Ghost Plus — $7.99/mo                  │
│  Resets March 28                        │
│                                         │
│  🌙 Overnight     ████████░░  ~75%     │
│  ⚡ Real-time     ██████░░░░  ~50%     │
│                                         │
│  [Get More Credits]  [Manage Plan]      │
└─────────────────────────────────────────┘

Rounded percentages only (~100%/75%/50%/25%/Low). No exact numbers primary. Bar color green → amber → red. "Details" tap for granular breakdown.

13. WEBSITE — GHOST PROTOCOL SECTION¶

13.1 Structural change¶

Break Ghost Protocol out of <section class="psec"> pricing container into its own full-bleed <section class="ghost-zone"> between pricing and comparison/FAQ. Dark background spans edge-to-edge — scrolling in = entering a different environment.

13.2 Ambient effects¶

Scanline overlay — repeating-linear-gradient pseudo-element, rgba(255,255,255,0.015) lines. Surveillance/tech feel without distraction.
Red ambient glow — two blurred radial orbs (top-right 400×400, bottom-left 300×300), dimmer than main page orbs.
Scroll-triggered activation — @keyframes ghost-activate (flicker: 0→60%→30%→80%→100%) over 1.2 s, one-time. Via IntersectionObserver.
Pulsing badge — ghost-pulse 2.5 s loop on ::before red dot.

13.3 Copy¶

Headline: "Dark mode, but for your data." Subline: "Add sovereign AI to your Pro or Premium plan. Your prompts never leave your ecosystem."

Brand rule: Ghost Protocol always appears in dark context. Never on a light background in marketing. Trains audience: dark = sovereign.

Full spec: Ghost_Protocol_Website_Spec.md.

13.4 Marketing assets¶

Split-screen Normal vs Ghost Protocol (light glass vs dark vault) — natural TikTok/Reel format.
"The Switch" 15-second short: toggle → dim → red pulse → dark crossfade → banner slide → "Dark mode, but for your data."

14. TRIAL GHOST ACCESS¶

Trial users do NOT get Ghost. On-device only during 21-day trial (14 Pro + 7 Elite). Ghost enters surgically on Days 5–7 as a single carefully chosen Ghost-powered insight (with Ghost icon visible). Cost ~$0.08 per trial user. Shows what Ghost CAN do. Gift, not entitlement.

EXCEPT Checkup Day 28, which runs via Ghost Relay regardless of subscription and is FREE.

Trial details: TRIAL_AND_ONBOARDING.md.

15. USER COMMUNICATION¶

Laiv NEVER mentions DuckDuckGo, Brave, Gemma, Kokoro, Anthropic, Ghost, servers, credits, APIs. Laiv just knows things.

Situation	Laiv says
DDG answer	"It's 18°C and rainy in Tokyo right now."
Ghost Brain Deferred	"I researched that overnight — here's what I found."
Ghost Brain Instant	"Here's what I found..." (no fanfare)
Agent result	"Done! $82.47 — 21 of 23 items found."
Suggesting Ghost (non-sub)	"With Ghost Protocol, I can do real research — privately."
~25% credits remaining (once)	"I have a few Ghost sessions left this month. I'll prioritize your biggest questions."
Zero Instant	"I'll queue that for tonight's research. Ghost resets on [date]."
Zero everything	"I'm in on-device mode until [date]. I can still help with everything on your phone."

16. COMPLETE TIER CAPABILITY MATRIX¶

Capability	Starter	Spark	Core	Pro	Elite
On-device AI (AMI cascade)	—	0.8B/2B/4B per RAM	same	same	same
DDG Standard Search	5/day	15/day	30/day	50/day	Unlimited
Voice	Text only	Laiv Voice Std (Piper)	same	same	same
Ghost Protocol sub	—	+$3.99	+$3.99	+$3.99	Base included (50D/20I)
Ghost Voice HD	—	w/ Ghost	w/ Ghost	w/ Ghost	w/ Ghost
Custom Voice	—	$7.99 × 3 max	× 3 max	× 5 max	× 5 max
Laiv Checkup cadence	—	2/yr	4/yr	6/yr	12/yr
Day 28 trial Checkup	✅ FREE	✅	✅	✅	✅
Traffic: cold OSRM	✅	✅	✅	✅	✅
Traffic: patterns	—	✅	✅	✅	✅
Traffic: live + smart departure	—	—	✅	✅	✅
Traffic: coloring + navigate + multi-stop	—	—	—	✅	✅

Per-tier gate detail: PRICING_AND_TIERS.md.

17. COSTS & INFRA SUMMARY¶

Line	Monthly
ghost-01 CX43 (Ghost + Kokoro + Checkup Relay + Coturn + Sports Cache + pipeline feed)	€17
cx23 (Europe traffic)	€4
cx23-b (Expansion + DDG enrichment)	€4
Infra total	€25
TheSportsDB Patreon	$3
Anthropic (Checkup, variable)	~$50–75 @ 10k users
Brave Search API (variable)	~$5–40
Zero paid APIs in POI/search core pipeline	—

Full breakdown: INFRASTRUCTURE.md.

18. CANONICAL CROSS-REFERENCES¶

Topic	Read
AMI dynamic load/unload, Qwen 3.5 on-device, Gemma 4 E4B, Kokoro pipeline, Piper, Voxtral	`VOICE_AND_AI.md`
CX43 specs, cx23 / cx23-b, Bishop mini-PC, CDN, Cloudflare, cost breakdown	`INFRASTRUCTURE.md`
All 119+ feature gates × 7 tiers, pricing constants, annual (10-for-12), regional	`PRICING_AND_TIERS.md`
5-layer composite ETA, 290 cities, Gold/Standard/Lite, calendar countries	`DATA_PIPELINE.md`
5 map views, voice nav, dock, 5-layer search	`MAP_MODULE_SPEC.md`
7 defense layers, device fingerprint, Day-14 phone verify	`ANTI_ABUSE_SPEC.md`
21-day trial, 45-question interview, Day-14 verify	`TRIAL_AND_ONBOARDING.md`

19. CHANGE LOG (condensed)¶

Apr 13, 2026 (v2.7): Gemma 4 E4B confirmed as Ghost model. Checkup Relay endpoints added (/api/checkup/submit, /api/checkup/result/<return_token>, /api/checkup/aggregate stub). Dual anonymization (device → Gemma → Anthropic Batch) locked. Laiv Checkup 3-domain cadence baked into tiers. Kokoro 82M locked as Ghost TTS. Voice pipeline inversion (Kokoro-first) locked. Thinking mode OFF default for Instant. Sports cache (31 leagues, :8300) live. nginx headers updated for gemma4:e4b.
Apr 4, 2026 (v2.6): Coturn on ghost-01:3478/5349 for WebRTC NAT traversal (E2EE-transparent).
Mar 27, 2026: Qwen 3.5 9B moved from primary to fallback. Simplified to single Ollama model path.
Mar 24, 2026 (v2.1): Credit model locked as THE launch model. Per-skill pricing killed. Capability gating killed. 2 entitlements only.

This is the canonical Ghost Protocol document. If a sprint file disagrees on a tactical server detail, the sprint file wins until this doc is republished. If CLAUDE.md locked decisions disagree, CLAUDE.md wins. All other Ghost Protocol files are historical.