

How much does an NVIDIA RTX PRO 6000 GPU cost?
- The NVIDIA RTX PRO 6000 Blackwell Workstation Edition is a 96 GB GDDR7 GPU built on the NVIDIA Blackwell architecture with 5th-generation Tensor Cores, 4,000 AI TOPS, and 1,792 GB/sec memory bandwidth. It targets AI inference, data science, video, and professional graphics workloads that benefit from high memory capacity and bandwidth.
- RTX PRO 6000 pricing varies across providers depending on what is included. RunPod lists rates from $1.69/hr (Community Cloud) to $2.09/hr (Secure Cloud) but provides raw GPU access without a managed platform layer. Modal lists the GPU only at $3.03/hr with CPU and RAM billed separately on top.
- On Northflank, the NVIDIA RTX PRO 6000 96 GB costs $3.00/hour, with GPU, CPU, RAM, and storage included in that price. RTX PRO 6000 workloads run on Northflank's managed cloud or on your own cloud account via BYOC (GCP, AWS, Azure, OCI, or CoreWeave). GPU and CPU workloads run on the same platform alongside databases, services, and jobs. GPU workloads on Northflank Cloud use gVisor isolation by default. Discounts are available for volume and longer-term commits.
View RTX PRO 6000 instance configurations and availability or request RTX PRO 6000 GPU capacity for volume or reservation requirements.
The NVIDIA RTX PRO 6000 Blackwell Workstation Edition is a GPU designed for AI development, data science, video, professional graphics, and HPC workloads.
This article covers the RTX PRO 6000's specs, use cases, pricing across GPU cloud providers, and how to deploy RTX PRO 6000 GPU workloads on Northflank.
The NVIDIA RTX PRO 6000 Blackwell Workstation Edition is a professional GPU built on the NVIDIA Blackwell architecture. It is equipped with 96 GB of GDDR7 memory with error-correcting code (ECC) and 1,792 GB/sec of memory bandwidth.
Official specifications from NVIDIA:
| Specification | Detail |
|---|---|
| Architecture | NVIDIA Blackwell |
| AI TOPS | 4,000 |
| GPU memory | 96 GB GDDR7 with ECC |
| Memory bandwidth | 1,792 GB/sec |
| Tensor Cores | 5th Gen |
| Ray Tracing Cores | 4th Gen |
| NVIDIA Encoder (NVENC) | 4x 9th Gen |
| NVIDIA Decoder (NVDEC) | 4x 6th Gen |
| Max power consumption | 600W |
The NVIDIA RTX PRO 6000 is designed for AI development, data science, video content and streaming, HPC, AI-driven rendering and graphics, and game development workloads. NVIDIA positions it as a professional GPU for workloads that benefit from high memory capacity, high bandwidth, and 5th-generation Tensor Core performance.
Common workload types include:
- AI development: AI inference and training workloads that benefit from 96 GB of VRAM and 5th-generation Tensor Core performance
- Data science: GPU-accelerated data pipelines and analytics workloads
- Video content and streaming: professional video encoding and decoding via 4x 9th-gen NVENC and 4x 6th-gen NVDEC engines
- AI-driven rendering and graphics: ray tracing, simulation, and 3D design workflows
- HPC: scientific computing and simulation workloads
- Game development: game development workflows supported by AI
Workloads that require multi-GPU high-bandwidth interconnects for distributed training at scale may be better served by A100, H100, H200, or B200 instances.
For pricing and deployment guides, see how much does an NVIDIA A100 GPU cost, how much does an NVIDIA H100 GPU cost, and how much does an NVIDIA B200 GPU cost.
On Northflank, the NVIDIA RTX PRO 6000 96 GB costs $3.00/hour. That price includes GPU, CPU, RAM, and storage. On the pay-as-you-go plan, billing is pro-rated to the second, so you pay only for the time your workload runs. RTX PRO 6000 workloads run on Northflank's managed cloud or on your own cloud account via BYOC.
Discounts are available for volume and longer-term commits. (Request GPU capacity)

See the NVIDIA RTX PRO 6000 on Northflank page for current availability and instance configurations, visit the pricing page for the full GPU pricing table, or use the pricing calculator to estimate your monthly spend.
For a broader comparison of AI sandbox pricing including GPU access costs across platforms, see AI sandbox pricing comparison.
For teams deploying GPU workloads alongside APIs, workers, databases, and other services, Northflank supports GPU and CPU workloads, managed databases, CI/CD pipelines, secrets, autoscaling, observability, IaC templates, and BYOC in one platform. Get started (self-serve) or book a demo if you have specific infrastructure or compliance requirements.
- GPU workloads on Northflank: overview of inference, training, and notebook workloads on the platform
- NVIDIA RTX PRO 6000 on Northflank: available RTX PRO 6000 instance configurations
- Northflank pricing: full GPU and compute pricing
- Request GPU capacity: for volume or reservation requirements
RTX PRO 6000 pricing varies depending on whether the rate covers the GPU only or a bundled compute unit including CPU and RAM. The table below reflects published rates at the time of writing. Prices, especially on marketplace platforms, are subject to change.
| Provider | RTX PRO 6000 (96 GB) price | What's included | Notes |
|---|---|---|---|
| Northflank | $3.00/hr | GPU, CPU, RAM, storage | Managed cloud or BYOC (GCP, AWS, Azure, OCI, CoreWeave); GPU and CPU workloads on one platform; gVisor isolation on managed cloud; workload optimisation controls |
| Modal | $3.03/hr | GPU only | CPU billed at $0.0473/physical core/hr (2 vCPU equivalent) and RAM at $0.0080/GiB/hr on top; region selection adds a 1.5x to 1.75x price multiplier |
| RunPod | $1.69/hr (Community Cloud) / $2.09/hr (Secure Cloud) | GPU, CPU, RAM (pod) | Community Cloud and Secure Cloud tiers at different rates; raw GPU rental |
Modal's headline RTX PRO 6000 rate is slightly above Northflank's at $3.03/hr, but CPU and RAM are metered separately on top of that at $0.0473/physical core/hr (each physical core is equivalent to 2 vCPUs) and $0.0080/GiB/hr, respectively.
RunPod lists RTX PRO 6000 pods from $1.69/hr to $2.09/hr depending on the tier, but provides raw GPU access without a managed platform layer.
Northflank's $3.00/hr rate covers the full compute stack (GPU, CPU, RAM, and storage). Northflank manages the orchestration layer and supports CPU services, databases, and jobs alongside GPU workloads on the same platform.
Northflank supports two deployment paths for RTX PRO 6000 GPU workloads.
Northflank Cloud is a managed environment where you create a GPU-enabled project and Northflank manages the underlying infrastructure. GPU workloads on Northflank Cloud use gVisor isolation by default, which can be relevant for teams running multi-tenant inference or executing untrusted code.
Bring your own cloud (BYOC) lets you deploy RTX PRO 6000 GPU nodes on your own cloud account using GCP, AWS, Azure, OCI, or CoreWeave, while Northflank manages the orchestration layer. This path suits teams that need to retain control of their infrastructure or billing relationships with a specific cloud provider.
Both paths use the same Northflank UI, API, and CLI, so the deployment workflow is consistent regardless of where the workload runs.
See the following guides to get started with either deployment path:
- Deploy GPUs on Northflank Cloud: step-by-step guide for deploying GPU workloads on Northflank's managed infrastructure
- Deploy GPUs in your own cloud: configuring GPU node pools and workloads on BYOC clusters
The steps below apply to deploying on Northflank's managed cloud. BYOC deployment requires a configured cluster first.
- Create a new project in a GPU-enabled region on Northflank Cloud.
- Create a deployment service or job within that project.
- Select NVIDIA RTX PRO 6000 as the GPU type and set the GPU count in the resources configuration.
- Use a container image compatible with CUDA 12.0 or later. For example,
nvidia/cuda:12.8.0-cudnn-runtime-ubuntu22.04or an official framework image such aspytorch/pytorch:2.6.0-cuda12.4-cudnn9-runtime. - Mount a persistent volume to the default model cache path for your framework (for example,
/root/.cache/huggingfacefor Hugging Face models) to avoid re-downloading weights on every restart.
Your application also needs to be configured to use the GPU at the framework level. For PyTorch, check device availability with torch.device("cuda" if torch.cuda.is_available() else "cpu"). For TensorFlow, confirm GPU visibility with tf.config.list_physical_devices('GPU').
Deploy RTX PRO 6000 GPU workloads on Northflank
- GPUs on Northflank: overview of GPU deployment options on managed cloud and BYOC
- Deploy GPUs on Northflank Cloud: step-by-step guide for GPU projects on Northflank's managed infrastructure
- Deploy GPUs in your own cloud: configuring GPU node pools on BYOC clusters
- Configure and optimise workloads for GPUs: right-sizing CPU and memory, persisting models, and using GPU-optimised base images
- Sandboxes on Northflank: microVM-backed isolation for GPU and CPU workloads
Get started (self-serve), or book a session with an engineer if you have specific infrastructure or compliance requirements.
Northflank supports sandbox deployments backed by microVM-based isolation. GPU workloads on Northflank Cloud use gVisor isolation by default, which provides kernel-level separation between containers.
This can be relevant for teams that run workloads involving user-submitted code, LLM-generated code execution, or multi-tenant inference pipelines where container isolation requirements are stricter than standard Kubernetes defaults. Whether this isolation model fits a given security posture depends on the specific workload and compliance requirements.
See Sandboxes on Northflank for full details on the isolation model and how to configure GPU sandboxes. For a broader look at GPU sandbox isolation models and platform support, see GPU sandboxes: isolation models and platform support.
The NVIDIA RTX PRO 6000 Blackwell Workstation Edition is a professional GPU built on the NVIDIA Blackwell architecture with 96 GB of GDDR7 memory with ECC, 1,792 GB/sec memory bandwidth, 5th-generation Tensor Cores, and 4,000 AI TOPS. It is designed for AI development, data science, video, professional graphics, and HPC workloads.
On Northflank, the NVIDIA RTX PRO 6000 96 GB costs $3.00/hour with GPU, CPU, RAM, and storage included. Northflank manages the orchestration layer and runs GPU workloads alongside CPU services, databases, and jobs. Modal lists the RTX PRO 6000 at $3.03/hr for the GPU only, with CPU and RAM billed separately on top. RunPod lists RTX PRO 6000 pods from $1.69/hr (Community Cloud) and $2.09/hr (Secure Cloud), providing raw GPU access without a managed platform layer.
The RTX PRO 6000 is designed for AI development, data science, video content and streaming, HPC, AI-driven rendering and graphics, and game development workloads.
Yes. The $3.00/hour rate on Northflank includes GPU, CPU, RAM, and storage.
See the NVIDIA RTX PRO 6000 on Northflank page for current availability, instance configurations, and supported cloud providers.

