Documentation Index
Fetch the complete documentation index at: https://docs.kindling.systems/llms.txt
Use this file to discover all available pages before exploring further.
Operations runbook
Use this page as the starting index for day-two operations. Detailed topology rules remain in the repo HA and networking guides.Install and bootstrap
- Production Setup — first node, worker join, WireGuard mesh env.
- Topology and Hostnames — DNS and hostname expectations.
Upgrade
- Read release notes for the target
kindlingversion. - Back up PostgreSQL (control plane) on a schedule compatible with your rollback plan.
- Replace the
kindlingbinary on each host (control plane and workers) with the same version skew policy your team allows (prefer minimal skew). - Restart systemd units (
kindling@api,kindling@edge,kindling@worker, …) in an order that keeps the API reachable. - Validate: Production Setup checklist, deploy a canary workload.
Backup and disaster recovery
- Control-plane backup and DR — what must be protected.
- Backup and restore — Postgres vs volumes.
- Repo:
docs/control-plane-backup-and-dr.md
Maintenance: drain and reactivate
- Server Drain — evacuation semantics vs live migration.
- Platform admins use
POST /api/servers/{id}/drainandPOST /api/servers/{id}/activate(see API reference).
Audit trail
Cluster-global admin actions (server drain/activate, cluster meta updates, auth provider admin saves) append rows tocluster_audit_events.
Design and action names: docs/cluster-audit-events.md.
Ad-hoc inspection (SQL, superuser only in production):