v3.1 Production Runtime + Self-Healing Worker Cloud OS

Production Runtime

AutoBuilder Production Runtime is the reliability layer for the autonomous venture factory. It prepares worker queues, schedules, heartbeats, leases, retries, logs, recovery flows, quarantine, cost controls, alerts, backups, and runbooks.

DEMO / LOCAL MODE — workers not running live. DEMO RUNTIME STATE — LOCAL MODE

CONTROLLED RUNTIMEWORKER NOT RUNNINGNO SECRET VALUES SHOWNNO AUTO DEPLOY
CONTROLLED RUNTIMEAutomation is bounded, logged, and approval-gated.WORKER NOT RUNNINGWorker lane is represented in UI — no background process is executing this job.NO AUTO DEPLOYDeployment requires manual founder/operator approval.BACKUP NOT VERIFIEDBackup status is not confirmed — do not claim backups are active.

Next: Configure Redis + worker lease consumer with approval gates

AutoBuilder Production Runtime prepares worker queues, schedules, heartbeats, leases, retries, logs, recovery, quarantine, cost controls, alerts, backups, and runbooks. Runtime status is labeled honestly. No deploy, push, spend, contact, or publish without approval. Secret values are never displayed.

Command

Runtime command center

Runtime mode

READINESS ONLY — LOCAL MODE

Worker status

WORKER NOT RUNNING

Queue status

Queue schemas prepared — no live consumer

Scheduler

Schedules defined — not auto-running

Failed jobs

0

Stuck jobs

0

Quarantined

1

Approvals needed

2

Cost status

NO AUTO SPEND — limits configured

Server readiness

Hetzner plan prepared — NOT VERIFIED

Backup readiness

BACKUP NOT VERIFIED

Alert readiness

Telegram templates ready — NO AUTO CONTACT

Next runtime action: Configure Redis + worker lease consumer with approval gates

READINESS ONLYPrepared for future implementation or review — not verified production behavior.

Workers

Worker cloud dashboard

LaneNameHeartbeatLast seenCurrent jobCompletedFailedCostStatus
buildBuild Workerno heartbeat00$0.00WORKER NOT RUNNINGWorker lane is represented in UI — no background process is executing this job.
evalEval Workerno heartbeat00$0.00WORKER NOT RUNNINGWorker lane is represented in UI — no background process is executing this job.
recoveryRecovery Workerno heartbeat00$0.00WORKER NOT RUNNINGWorker lane is represented in UI — no background process is executing this job.

Kill/pause actions: APPROVAL REQUIRED — not wired in v3.1

Schedule

Job scheduler

JobScheduleTypeEnabledApprovalStatus
Lightweight scan*/15 * * * *health_scannonoPLANNEDOn the roadmap; not available in production yet.
Venture review0 * * * *venture_reviewnoyesAPPROVAL REQUIREDAutoBuilder prepared this; you must approve before it executes.
Daily portfolio review0 9 * * *portfolio_dailynoyesAPPROVAL REQUIREDAutoBuilder prepared this; you must approve before it executes.
Weekly portfolio report0 10 * * 1portfolio_weeklynoyesAPPROVAL REQUIREDAutoBuilder prepared this; you must approve before it executes.
Monthly strategy report0 8 1 * *strategy_monthlynoyesAPPROVAL REQUIREDAutoBuilder prepared this; you must approve before it executes.
Failed job retry scan*/30 * * * *retry_scannoyesAPPROVAL REQUIREDAutoBuilder prepared this; you must approve before it executes.
Quarantine review0 */6 * * *quarantine_reviewnoyesAPPROVAL REQUIREDAutoBuilder prepared this; you must approve before it executes.
Approval reminder0 8,16 * * *approval_remindernonoPLANNEDOn the roadmap; not available in production yet.
Cost summary0 23 * * *cost_summarynonoPLANNEDOn the roadmap; not available in production yet.
Noaerth candidate scan0 7 * * *noaerth_scannoyesAPPROVAL REQUIREDAutoBuilder prepared this; you must approve before it executes.

Schedules disabled by default — not auto-running.

Queues

Queue system readiness

job

primary

pending 0 · running 0 · failed 0 · DLQ 0

pausedschema ready

READINESS ONLYPrepared for future implementation or review — not verified production behavior.

priority

priority

pending 0 · running 0 · failed 0 · DLQ 0

pausedschema ready

READINESS ONLYPrepared for future implementation or review — not verified production behavior.

approval

approval

pending 2 · running 0 · failed 0 · DLQ 0

activedemo items

DEMOInteractive simulation or sample output — not a live deployment.

recovery

recovery

pending 0 · running 0 · failed 0 · DLQ 0

pausedschema ready

READINESS ONLYPrepared for future implementation or review — not verified production behavior.

dead-letter

dead_letter

pending 0 · running 0 · failed 0 · DLQ 0

pausedschema ready

READINESS ONLYPrepared for future implementation or review — not verified production behavior.

quarantine

quarantine

pending 1 · running 0 · failed 0 · DLQ 0

activedemo item

DEMOInteractive simulation or sample output — not a live deployment.

scheduled

scheduled

pending 0 · running 0 · failed 0 · DLQ 0

pausedschema ready

READINESS ONLYPrepared for future implementation or review — not verified production behavior.

report

report

pending 0 · running 0 · failed 0 · DLQ 0

pausedschema ready

READINESS ONLYPrepared for future implementation or review — not verified production behavior.

alert

alert

pending 0 · running 0 · failed 0 · DLQ 0

pausedschema ready

READINESS ONLYPrepared for future implementation or review — not verified production behavior.

Leases

Worker lease system

Job job_demo_001 · Worker w_build · Owner unassigned

Start · Expires · Timeout 300s · Heartbeat none

DEMOInteractive simulation or sample output — not a live deployment.

Retry

Timeout and retry rules

small jobs

Max retries 3 · Backoff exponential · Quarantine after 3

Retry safe: yes

READINESS ONLYPrepared for future implementation or review — not verified production behavior.

medium jobs

Max retries 2 · Backoff exponential · Quarantine after 2

Retry safe: yes

READINESS ONLYPrepared for future implementation or review — not verified production behavior.

large jobs

Max retries 1 · Backoff linear · Quarantine after 1

Retry safe: no — approval required

APPROVAL REQUIREDAutoBuilder prepared this; you must approve before it executes.

approval_required jobs

Max retries 0 · Backoff none · Quarantine after 1

Retry safe: no — approval required

NO AUTO SPENDCredit spend and payments require approval.

DLQ

Dead-letter queue

job_failed_demo · build · venture venture_demo

Example failure — no real job executed

Recovery: Inspect build logs, verify env, retry only after approval

DEMOInteractive simulation or sample output — not a live deployment.

Recovery

Recovery automation readiness

build_timeouttimeout

Check worker lease expiry, increase timeout for large jobs, verify build command

Rollback: Revert to last known good build artifact

Human review: required

READINESS ONLYPrepared for future implementation or review — not verified production behavior.

Quarantine

Quarantine system

unsafe_public_claim

Demo item — marketing copy exceeded verified capability

landing page · severity medium

Action: Replace with READINESS ONLY label; operator review

QUARANTINEDJob isolated after failures — human review before retry.

Logs

Logs and observability

runtime · info · production-runtime

Runtime dashboard initialized — LOCAL MODE, no workers connected

Secrets redacted: yes · 2026-05-29T10:31:28.507Z

DEMOInteractive simulation or sample output — not a live deployment.

No fake historical logs — only initialization events in demo.

Env

Environment variable checker

NO SECRET VALUES SHOWN — PRESENT or MISSING only.

VariableRequired forStatusClient safe
NEXT_PUBLIC_SUPABASE_URLSupabase clientPRESENTyes
NEXT_PUBLIC_SUPABASE_ANON_KEYSupabase authPRESENTyes
SUPABASE_SERVICE_ROLE_KEYSupabase serverPRESENTserver only
OPENROUTER_API_KEYModel routingPRESENTserver only
VERCEL_TOKENDeploy automationMISSINGserver only
GITHUB_TOKENGitHub integrationMISSINGserver only
TELEGRAM_BOT_TOKENTelegram alertsMISSINGserver only
STRIPE_SECRET_KEYBillingMISSINGserver only
NEXT_PUBLIC_APP_URLApp URLPRESENTyes
DATABASE_URLDatabaseMISSINGserver only
REDIS_URLQueue backendMISSINGserver only

Secrets

Secret safety checker

env_exposure_risk

Verify NEXT_PUBLIC_* vars contain only public-safe keys

Client-side bundles · risk medium

Never expose service role or API keys in client bundles

READINESS ONLYPrepared for future implementation or review — not verified production behavior.

hardcoded_key_risk

Run secret scan before production; no automated scan in v3.1

Source scan — manual review · risk high

Use env vars; enable pre-commit secret detection

PLANNEDOn the roadmap; not available in production yet.

Deploy

GitHub / Vercel monitor readiness

Repo: NOT VERIFIED — connect token to check

Commit:

Build: NOT VERIFIED

Deploy: NOT VERIFIED

Deploy approval required · Auto deploy disabled

NO AUTO DEPLOYDeployment requires manual founder/operator approval.

Database

Supabase runtime readiness

Tables: NOT VERIFIED

RLS: NOT VERIFIED

Migrations: NOT VERIFIED

Auth: NOT VERIFIED

Service role: Server-only — never client-side

Backup: BACKUP NOT VERIFIED

Missing: Live migration audit, RLS policy review, Backup schedule

NOT VERIFIEDNo evidence submitted yet — do not treat as proven.

Queue backend

Redis / queue backend readiness

Redis URL: MISSING CONFIG

Backend: MISSING CONFIG

Namespace: autobuilder:v31

Consumer: WORKER NOT RUNNING

MISSING CONFIGRequired configuration is not present.

Infrastructure

Hetzner worker runtime plan

Role: worker-fleet-primary

Process manager: systemd (planned)

Restart: on-failure, max 3 restarts/hour

Resources: 2 vCPU, 4GB RAM, 40GB disk per worker node

Deploy: Manual deploy with operator approval — NO AUTO DEPLOY

Rollback: Restore previous systemd unit + artifact tag

PLANNEDOn the roadmap; not available in production yet.

Alerts

Telegram operator alert readiness

MISSING CONFIG — NO AUTO CONTACT

Templates prepared — NO AUTO CONTACT until operator enables.

MISSING CONFIGRequired configuration is not present.

Cost

Cost and runaway controls

Daily spend

$0.00

Weekly spend

$0.00

Max jobs/hr

20

Max retries

3

Max workers

3

Approval threshold

$25/day

NO AUTO SPENDCredit spend and payments require approval.

Safety

Emergency stop system

Scope: global · Status: armed — not triggered

Workers paused: no · Paid jobs paused: no · External actions paused: no

CONTROLLED RUNTIMEAutomation is bounded, logged, and approval-gated.

Backup

Backup and restore readiness

Database: BACKUP NOT VERIFIED

Venture export: Local export available in workspace

Logs backup: NOT VERIFIED

Last backup:

Verified: no

1) Verify backup 2) Test restore in staging 3) Document RTO/RPO

BACKUP NOT VERIFIEDBackup status is not confirmed — do not claim backups are active.

Runbooks

Production runbooks

Build failed

Trigger: CI or worker build returns error

  • Contain
  • Classify
  • Generate recovery packet
  • Operator review
  • Retry only if safe
READINESS ONLYPrepared for future implementation or review — not verified production behavior.

Worker stuck

Trigger: Lease expired without heartbeat

  • Contain
  • Classify
  • Generate recovery packet
  • Operator review
  • Retry only if safe
READINESS ONLYPrepared for future implementation or review — not verified production behavior.

Queue backed up

Trigger: Pending count exceeds threshold

  • Contain
  • Classify
  • Generate recovery packet
  • Operator review
  • Retry only if safe
READINESS ONLYPrepared for future implementation or review — not verified production behavior.

Environment missing

Trigger: Required env var absent

  • Contain
  • Classify
  • Generate recovery packet
  • Operator review
  • Retry only if safe
READINESS ONLYPrepared for future implementation or review — not verified production behavior.

Unsafe claim published

Trigger: Public page exceeds verified capability

  • Contain
  • Classify
  • Generate recovery packet
  • Operator review
  • Retry only if safe
READINESS ONLYPrepared for future implementation or review — not verified production behavior.

Incidents

Incident response runtime

template template

We are investigating a runtime issue. No user data exposed.

  • Pause affected lane
  • Quarantine risky output
  • Alert operator
PLANNEDOn the roadmap; not available in production yet.

QA

Runtime QA checklist

CheckStatusFindingRisk
No fake worker statuspassWorkers labeled WORKER NOT RUNNINGLIVEShipped in production and reachable on this site.
No fake uptimepassNo uptime claims displayedLIVEShipped in production and reachable on this site.
No secrets exposedpassEnv checker shows PRESENT/MISSING onlyNO SECRET VALUES SHOWNOnly presence or missing status — secret values never displayed.
No auto-deploypassNO AUTO DEPLOY labels appliedNO AUTO DEPLOYDeployment requires manual founder/operator approval.
Redis consumerwarnQueue consumer not runningWORKER NOT RUNNINGWorker lane is represented in UI — no background process is executing this job.
Backup verifiedfailBACKUP NOT VERIFIEDBACKUP NOT VERIFIEDBackup status is not confirmed — do not claim backups are active.

Execution

Cursor / Aider runtime packets

Copy packets for controlled runtime implementation.

Add Production Runtime Route

route

Add Runtime Command Center

command

Add Worker Cloud Dashboard

worker

Add Job Scheduler

scheduler

Add Queue System Readiness

queues

Add Worker Lease System

leases

Add Timeout and Retry Rules

retry

Add Dead-Letter Queue

dlq

Add Recovery Automation Readiness

recovery

Add Quarantine System

quarantine

Add Logs and Observability

logs

Add Environment Variable Checker

env

Add Secret Safety Checker

secret

Add GitHub/Vercel Monitor Readiness

github

Add Supabase Runtime Readiness

supabase

Add Redis Queue Readiness

redis

Add Hetzner Worker Runtime Plan

hetzner

Add Telegram Alert Readiness

telegram

Add Cost and Runaway Controls

cost

Add Emergency Stop System

estop

Add Backup and Restore Readiness

backup

Add Production Runbooks

runbooks

Add Incident Response Runtime

incident

Add Runtime QA Checklist

qa

Add Worker Cloud Final QA

final

Wire local worker lease consumer (Aider)

APPROVAL REQUIREDAutoBuilder prepared this; you must approve before it executes.