Infrastructure

Infrastructure Layer

Manage your entire infrastructure lifecycle with Northflank. Define cloud resources, clusters, and integrations in code or through the UI. Built-in multi-tenancy, disaster recovery, and enterprise-grade security.

Complete infrastructure control

Projects, clusters, and cloud providers

Northflank manages your entire infrastructure lifecycle—from provisioning cloud resources and Kubernetes clusters to configuring networks and integrations. Define everything in code or self-service through the UI.

Organize workloads into projects that map to Kubernetes namespaces. Deploy across multiple regions and cloud providers. Connect your own cloud accounts (BYOC) or use Northflank’s managed infrastructure.

Project organization

Group services, databases, and jobs into projects. Projects map to Kubernetes namespaces with isolated networking and resource quotas. Organize by application, team, or environment.

Multi-region deployment

Deploy workloads to specific regions for low latency and data residency. Northflank provides regions in North America, Europe, and Asia-Pacific. Choose regions per project or cluster.

BYOC providers

Connect AWS, GCP, Azure, or Civo cloud accounts. Northflank provisions and manages Kubernetes clusters within your infrastructure. Retain full control over cloud resources, networking, and billing.

Kubernetes cluster lifecycle

Northflank handles cluster provisioning, upgrades, scaling, and maintenance. Automatic version updates with configurable maintenance windows. Import existing clusters (BYOK) from any provider.

Cluster configuration

Configure node pools, machine types, autoscaling policies, and network settings. Define infrastructure requirements per cluster. Support for ARM and x86 architectures, GPUs, and specialized hardware.

Infrastructure as code

Define clusters, projects, integrations, and configuration in JSON templates. Store infrastructure definitions in Git. Changes sync bi-directionally between code and the platform.

Integrations and registries

Connect version control and container registries

Link Git repositories for automated builds and GitOps workflows. Configure private container registries for pulling and pushing images. All integrations available via UI, API, or infrastructure templates.

Screenshot of Northflank VCS integration

Version control integration

Connect GitHub, GitLab, and Bitbucket organizations. OAuth authentication for cloud-hosted services. Support for GitHub Enterprise, GitLab CE/EE, and self-hosted Bitbucket Server.

Private image registries

Configure Docker Hub, GitHub Container Registry, GitLab Registry, Amazon ECR, Google Artifact Registry, Azure Container Registry, or custom registries. Credentials encrypted and securely stored.

Infrastructure templates

Define VCS and registry integrations in templates. Reuse connection configurations across projects. Manage credentials separately from infrastructure definitions for security.

Security and isolation

Secure multi-tenancy with microVMs and gVisor

Northflank implements multiple isolation layers for secure multi-tenant infrastructure. Workloads run in sandboxed environments with strict resource boundaries, network policies, and namespace isolation.

Sandboxed execution

Workloads run in microVMs via Kata Containers or gVisor for VM-grade isolation. Each container gets its own kernel, preventing host and cross-tenant access. Complete runtime, network, and storage isolation.

Network isolation

Service mesh with mutual TLS encryption between workloads. Network policies enforce project-level boundaries. Private networking prevents unauthorized cross-project communication.

Namespace boundaries

Projects map to Kubernetes namespaces with resource quotas and limits. RBAC policies enforce access controls at namespace level. Complete separation between projects on shared clusters.
Screenshot of Northflank cluster security

Resilience and recovery

Disaster recovery and backup management

Automated backup systems for stateful workloads with configurable retention policies. Point-in-time recovery for databases. Cross-region replication and failover capabilities for high availability.

Automated backups

Scheduled backups for all managed databases with configurable frequency and retention. Automatic backup before destructive operations. Manual snapshot creation for important milestones.

Point-in-time recovery

Restore databases to any point within the retention window. Continuous backup of transaction logs. Recovery to specific timestamps for precise rollback.

Cross-region replication

Replicate stateful workloads across multiple regions for disaster recovery. Automated failover to secondary regions. Configure recovery time objectives (RTO) and recovery point objectives (RPO).

Monitoring and observability

Real-time logs, metrics, and alerting

Centralized logging and metrics for all infrastructure components. Real-time log streaming with 60-day retention. Configurable alerts for resource usage, health checks, and platform events.

Screenshot of Northflank service logs view

Real-time log tailing

Stream logs from services, jobs, builds, and infrastructure components in real-time. Filter by resource, time range, or log level. WebSocket-based streaming for instant updates.

Infrastructure metrics

CPU, memory, disk, and network metrics for all workloads and cluster nodes. Built-in dashboards and charts. Export metrics to Prometheus, Datadog, or custom monitoring systems.

60-day retention

Logs and metrics stored for 60 days with full searchability. Query historical data via API or UI. Export data for long-term archival or compliance requirements.

Log forwarding

Forward logs to external platforms like Datadog, Splunk, or New Relic. Filter logs before forwarding to reduce costs. Configure per-project or account-wide log sinks.

Logs and metrics API

Query logs and metrics programmatically via HTTP and WebSocket APIs. Integrate with custom dashboards, alerting systems, or data pipelines. Full API parity with UI features.

Infrastructure alerts

Platform and resource monitoring

Configure alerts for infrastructure events, resource usage thresholds, and platform activities. Notifications delivered via Slack, Discord, webhooks, or Microsoft Teams.

Screenshot of Northflank alert configuration

Resource usage alerts

Alert when CPU, memory, disk, or network usage exceeds thresholds. Configure per-service or cluster-wide monitoring. Multiple threshold levels for warning and critical states.

Platform event notifications

Receive notifications for builds, deployments, backup completions, job runs, and autoscaling events. Alert on failures, successes, or both. Filter events by project or resource type.

Integration channels

Send alerts to Slack channels, Discord servers, custom webhooks, or Microsoft Teams. Configure multiple notification channels per alert. Route different alert types to different channels.

High availability

Health checks and automatic recovery

Configure readiness, liveness, and startup probes to ensure service availability. Automatic container restarts on health check failures. Zero-downtime deployments with health validation.

Screenshot of Northflank health check configuration

Readiness probes

Test whether containers can receive traffic after initialization. HTTP, TCP, or command-based checks. Traffic routed only to healthy containers.

Liveness probes

Monitor running containers for failures. Automatic container restart when liveness checks fail. Configurable failure thresholds and timeout periods.

Startup probes

Delay liveness and readiness checks until container initialization completes. Prevent premature health check failures during slow startup. Configure longer timeouts for initialization.

Health monitoring

Dashboard showing health status for all services. Historical health check data and failure patterns. Alerts triggered on repeated health check failures.

Infrastructure as code

OpenTofu and template-driven infrastructure

Define infrastructure using Northflank templates or integrate OpenTofu for advanced infrastructure provisioning. Templates support composition, variables, and version control for repeatable infrastructure.

Screenshot of Northflank infrastructure code

Northflank templates

JSON-based templates for defining projects, services, databases, jobs, integrations, and configuration. Compose multiple templates into complete environments. Variables and conditionals for flexibility.

OpenTofu integration

Use OpenTofu nodes in Northflank templates to provision cloud resources. Combine Northflank-managed Kubernetes workloads with custom infrastructure. Execute Tofu plans during deployments.

Version-controlled infrastructure

Store templates in Git repositories with bi-directional sync. Track infrastructure changes with commit history. Review infrastructure modifications via pull requests.

Template composition

Reference and reuse templates across projects. Override variables for environment-specific configuration. Build libraries of reusable infrastructure patterns.

Enterprise governance

RBAC, audit logs, and access control

Enterprise-grade security and governance for infrastructure management. Fine-grained role-based access control, complete audit trails, and compliance reporting. Control who can provision infrastructure, modify clusters, or access production environments.

Role-based access control

Granular permissions for infrastructure operations. Define custom roles for cluster management, project creation, integration configuration, and resource provisioning. Assign roles at team or project level.

Complete audit logs

Track all infrastructure changes with detailed audit trails. Log cluster modifications, integration updates, configuration changes, and access events. Export logs for compliance and security analysis.

Self-service with guardrails

Enable developer self-service while maintaining governance. Define what resources teams can provision and in which regions. Set resource quotas and spending limits per project or team.

Features

Developer experience

Collapse details
  • Choice of UI, CLI, APIs & GitOps
  • Runs on AWS, GCP, Azure, Oracle
  • Build re-useable templates

Polyglot platform

Expand details
  • Run any language or framework
  • All envs, from preview to production
  • Works with GitHub, GitLab and Bitbucket

Run your AI

Expand details
  • Scale GPUs across clouds
  • Run untrusted code at scale
  • Serve your inference & models
  • Support fractional GPU workloads

We fixed Kubernetes

Expand details
  • Kubernetes-ready app platform
  • Run anywhere, on any Kubernetes cluster, on any cloud
  • EKS, GKE, AKS, Rancher, OpenShift, Tanzu ready
Developer experience
Polyglot platform
Run your AI
We fixed Kubernetes

Making Kubernetes
boring

If name-dropping helps, here you go.

sentry logowriter logonorthfield logoquiet logochaidiscovery logo