Senior Engineering Build

OpsCommand: CTO Operations Control Plane for Hosting Infrastructure

This build sits above the live hosting stack and focuses on governance, observability, incident response, deployment control, backup verification, security posture, and business workflow traceability.

Hosting model: this portfolio page lives on GitHub Pages, while the live application runtime is hosted on the UpCloud VPS.

Access options: Direct IP (most compatible) | Secure hostname
If one link is blocked by your network, use the other.

Service Health Dashboard

Example operational snapshot: values below are demonstration records showing how the control plane presents live telemetry and service state.

13/13

Containers healthy

99.94%

7-day service uptime

31 ms

Median API response latency

12d

Last unplanned restart window

Service Status Uptime Last Restart Dependency Check Notes
Paymenter Online 99.91% 2026-06-01 04:22 MariaDB + Redis reachable Billing API healthy
Nginx Proxy Manager Online 99.98% 2026-05-29 01:11 SSL renew queue clear Host routes valid
Discord Bot Degraded 99.72% 2026-06-04 22:05 Gateway reconnect spikes Reconnect policy tuning pending
Nextdactyl Integration Online 99.88% 2026-05-31 03:44 Panel API auth valid Provisioning queue stable

Incident Management

Sample incident record: illustrates severity handling, timeline tracking, and postmortem capture format.

Active Incident Example

Incident: Paymenter unavailable

Severity: High

Affected: billing portal, customer invoices, provisioning flow

Detected by: synthetic health check + container liveness alert

Deployment Ledger

Demonstration ledger entries: format for change governance, rollback visibility, and migration risk tracking.

Date Change Commit Affected Services Migration Rollback Result
2026-06-05 Health-check worker retry policy update f7c3a4d ops-control-worker, paymenter-monitor No Yes Success
2026-06-03 Discord event ingest queue tuning 8ea2b11 skyhigh-bot, ops-control-api No Yes Success
2026-05-31 Billing event webhook schema update 1ac68de paymenter, ops-control-api Yes Yes Monitored

Backup and Restore Verification

Sample backup panel: demonstrates restore-test reporting, retention policy display, and encryption status tracking.

MariaDB Backup

Successful

Last backup: 2026-06-05 03:00 UTC

Restore test: Passed

Retention: 7 daily / 4 weekly / 3 monthly

Encryption: Enabled

Redis Snapshot

Successful

Last backup: 2026-06-05 03:05 UTC

Restore test: Passed

Checksum verification: Valid

Alerting

Watch Backup size delta threshold enabled

Failure alert channel: Discord + email

Escalation path: on-call + incident board

Security and Access Control View

Admin Port Exposure

Restricted Admin paths limited via firewall + routing rules.

Cloudflare Route Protection

Enabled Protected routes and scoped tokens configured.

Public vs Internal Services

Mapped Internal services not exposed directly.

Secrets Rotation

Due in 12d API key rotation reminder active.

Business Workflow Map

Customer signup -> Paymenter account event -> Provisioning queue -> Nextdactyl/Pterodactyl action -> Discord role/ticket update -> Ops ledger event + compliance trail

Architecture Decision Records (ADR)

ADR-001: Docker Compose over Kubernetes

Reason: faster delivery, lower operational burden, better fit for current scale. Tradeoff: lower native autoscaling. Future path: migrate critical services to k3s/managed Kubernetes when utilization warrants.

ADR-002: Nginx Proxy Manager as reverse-proxy + SSL layer

Reason: host-based routing speed and operational usability. Tradeoff: less dynamic than Traefik-native discovery. Future path: move to Traefik if service churn increases.

Build Order

Phase 1

Read-only service health, uptime, SSL/domain and connectivity checks.

Phase 2

Incident management and deployment ledger with rollback visibility.

Phase 3

Backup verification, restore testing records, and security exposure dashboards.

Phase 4

Business workflow observability for billing, provisioning, and Discord operations.

Phase 5

Case-study evidence pack with architecture diagrams, screenshots, and outcomes.