SettleMint
ArchitectureSelf-HostingHigh Availability

Cloud-Native (Recommended)

Single-region multi-AZ deployment using managed Kubernetes services, managed PostgreSQL, and Velero backups. The recommended HA approach for most self-hosted DALP deployments.

Purpose: Describe the recommended cloud-native HA deployment pattern.


Single-region, multi-AZ deployment using managed services and Velero backups. This is the recommended approach for most deployments.

Architecture

Rendering diagram...

Recovery metrics

MetricTargetNotes
RTO2–15 minutesAutomatic failover for most failures
RPOSeconds–1 minuteSynchronous replication
RTT15–60 minutesIncluding verification

Setup and maintenance

TaskTime estimateClient role
Kubernetes cluster provisioning2–4 hoursClient platform engineer
Managed PostgreSQL setup1–2 hoursClient platform engineer
Velero installation and config2–4 hoursClient platform engineer
Backup verification2–4 hoursClient platform engineer
Monitoring and alerting2–4 hoursClient platform engineer
Documentation and runbooks4–8 hoursClient platform engineer
Total initial setup2–3 days1 client engineer
ActivityFrequencyTime per cycle
Backup verificationWeekly30 minutes
Helm chart updatesMonthly1–2 hours
DR drill / restore testQuarterly4–8 hours
Security patchingMonthly2–4 hours
Capacity reviewQuarterly2–4 hours
Monthly effort8–16 hours

Team requirements

  • Minimum: Part of platform team responsibilities (~0.25 FTE)
  • Recommended: Dedicated on-call rotation for production incidents

Required skills: Kubernetes/OpenShift administration (intermediate), Helm chart management (basic), cloud provider managed services, basic PostgreSQL operations, Prometheus/Grafana monitoring.

Cloud provider configurations

AWS (EKS): EKS control plane multi-AZ by default, Auto Scaling Groups across AZs, RDS Multi-AZ, ElastiCache Multi-AZ, S3 (99.999999999% durability).

Azure (AKS): Zone-redundant control plane, Availability Zones for workers, Azure Database zone-redundant HA, Azure Cache zone redundancy, Blob Storage ZRS or GRS.

GCP (GKE): Regional multi-zone control plane, multi-zone node pool, Cloud SQL Regional HA, Memorystore Standard tier, Cloud Storage multi-regional.

OpenShift (OCP/OKD): Multi-master with etcd quorum, workers across failure domains, OpenShift Data Foundation (Ceph) for storage, router sharding for HA.

On this page