Blog
AI
Infrastructure Management
Product
5
minutes

GPU workloads on EKS just got way simpler with Qovery

Running GPU workloads on EKS has never been easy, until now. With Qovery’s latest update, you can enable GPU nodes, configure GPU access, and optimize costs automatically, all without writing a single line of YAML or touching Helm charts. Qovery now handles everything behind the scenes so you can focus entirely on your applications.
October 21, 2025
Alessandro Carrano
Head of Product
Summary
Twitter icon
linkedin icon

Key Points:

  • Qovery radically simplifies GPU provisioning on EKS: It automates the entire complex process, which previously involved manually defining node pools, configuring YAMLs, installing NVIDIA plugins, and modifying application manifests; reducing it to a few simple steps and eliminating significant DevOps overhead.
  • Controlled and cost-optimized GPU access: Users can easily enable GPU node pools, select instance types (mixing On-Demand and Spot instances for cost/performance), define cluster-wide limits, and specify the GPU needs per application.
  • Automatic setup and optimization: Qovery automatically handles the technical backend, including installing and configuring the NVIDIA Kubernetes Device Plugin and ensuring cost efficiency by selecting the best instance type combinations through its Karpenter implementation.

Run GPU-Powered applications on an EKS cluster in minutes

Whether you’re training models, running inference pipelines, or powering compute-heavy workloads, Qovery makes GPU provisioning on a EKS cluster simple as that:

  • Enable the GPUs on your cluster:
    • enable GPU node pools in your cluster settings
    • choose the instance types you need.
    • Mix On-Demand and Spot instances for the best balance of cost and performance.
    • Define limits on the number of GPUs that can be used overall by your cluster.
  • Define GPU needs per application: specify how many GPUs your app requires. Qovery handles provisioning, scheduling, and placement for you.
<\div>

Qovery's Provision (our Kubernetes deployment platform) automatically takes care of:

  • NVIDIA plugin setup: Qovery installs and configures the NVIDIA Kubernetes Device Plugin automatically, so your applications can access GPUs right away.
  • Cost optimization: Qovery selects the most cost-effective instance type combination based on your workload’s GPU requirements, thanks to the Karpenter implementation.

What it used to look like (before Qovery)

Before this release, setting up GPU workloads on Kubernetes meant doing everything yourself:

  1. Manually define and deploy GPU node pools
    • Create YAML or CLI definitions for GPU-capable nodes, labels, taints, and autoscaling.
    • Configure Spot instance handling and scaling behavior.
  2. Install NVIDIA components
    • Add the NVIDIA Helm charts for the device plugin and drivers.
    • Manage chart values, version compatibility, and updates across clusters.
  3. Modify application manifests
    • Add resources.requests/limits for nvidia.com/gpu.
    • Set node selectors, tolerations, and affinities for GPU nodes.
    • Tune and redeploy Helm charts for GPU access.
  4. Maintain and optimize over time
    • Monitor GPU utilization and keep costs under control.
    • Update plugins and drivers as Kubernetes versions evolve.

Now, Qovery does all that for you.

Why this matters

GPU workloads are core to modern applications but Kubernetes wasn’t built to make GPU management simple.

Qovery bridges that gap by abstracting away the complexity. In just a few clicks, your applications can access powerful GPUs, without the DevOps overhead.

Get started

You can start using GPU node provisioning today.

Check out our documentation to learn how to enable GPU support for your clusters and applications.

Share on :
Twitter icon
linkedin icon
Tired of fighting your Kubernetes platform?
Qovery provides a unified Kubernetes control plane for cluster provisioning, security, and deployments - giving you an enterprise-grade platform without the DIY overhead.
See it in action

Suggested articles

Kubernetes
 minutes
Stopping Kubernetes cloud waste: agentic automation for enterprise fleets

Agentic Kubernetes resource reclamation is the practice of using an autonomous control plane to continuously identify, suspend, and delete idle infrastructure across a multi-cloud Kubernetes fleet. It replaces manual cleanup and reactive autoscaling with intent-based policies that act on business state, eliminating the configuration drift and cloud waste typical of unmanaged fleets.

Mélanie Dallé
Senior Marketing Manager
Platform Engineering
Kubernetes
DevOps
10
 minutes
What is Kubernetes? The reality of Day-2 enterprise fleet orchestration

Kubernetes focuses on container orchestration, but the reality on the ground is far less forgiving. Provisioning a single cluster is a trivial Day-1 exercise. The true operational nightmare begins on Day 2. Teams that treat multi-cloud fleets like isolated pets inevitably face crushing YAML configuration drift, runaway AWS bills, and severe scaling bottlenecks.

Morgan Perry
Co-founder
AI
Compliance
Healthtech
 minutes
Agentic AI infrastructure: moving beyond Copilots to autonomous operations

The shift from AI copilots to autonomous agents is redefining infrastructure requirements. Discover how to build secure, stateful, and compliant Agentic AI systems using Kubernetes, sandboxing, and observability while meeting EU AI Act standards

Mélanie Dallé
Senior Marketing Manager
Kubernetes
8
 minutes
The 2026 guide to Kubernetes management: master day-2 ops with agentic control

Effective Kubernetes management in 2026 demands a shift from manual cluster building to intent-based fleet orchestration. By implementing agentic automation on standard EKS, GKE, or AKS clusters, enterprises eliminate operational weight, prevent configuration drift, and proactively control cloud spend without vendor lock-in, enabling effective scaling across massive fleets.

Mélanie Dallé
Senior Marketing Manager
Kubernetes
 minutes
Building a single pane of glass for enterprise Kubernetes fleets

A Kubernetes single pane of glass is a centralized management layer that unifies visibility, access control, cost allocation, and policy enforcement across § cluster in an enterprise fleet for all cloud providers. It replaces the fragmented practice of switching between AWS, GCP, and Azure consoles to govern infrastructure, giving platform teams a single source of truth for multi-cloud Kubernetes operations.

Mélanie Dallé
Senior Marketing Manager
Kubernetes
 minutes
How to deploy a Docker container on Kubernetes (and why manual YAML fails at scale)

Deploying a Docker container on Kubernetes requires building an image, authenticating with a registry, writing YAML deployment manifests, configuring services, and executing kubectl commands. While necessary to understand, executing this manual workflow across thousands of clusters causes severe configuration drift. Enterprise platform teams use agentic platforms to automate the entire deployment lifecycle.

Mélanie Dallé
Senior Marketing Manager
Qovery
Cloud
AWS
Kubernetes
8
 minutes
10 best practices for optimizing Kubernetes on AWS

Optimizing Kubernetes on AWS is less about raw compute and more about surviving Day-2 operations. A standard failure mode occurs when teams scale the control plane while ignoring Amazon VPC IP exhaustion. When the cluster autoscaler triggers, nodes provision but pods fail to schedule due to IP depletion. Effective scaling requires network foresight before compute allocation.

Morgan Perry
Co-founder
Kubernetes
Terraform
 minutes
Managing Kubernetes deployment YAML across multi-cloud enterprise fleets

At enterprise scale, managing provider-specific Kubernetes YAML across multiple clouds creates crippling configuration drift and operational toil. By adopting an agentic Kubernetes management platform, infrastructure teams abstract cloud-specific configurations (like ingress controllers and storage classes) into a single, declarative intent that automatically reconciles across 1,000+ clusters.

Mélanie Dallé
Senior Marketing Manager

It’s time to change
the way you manage K8s

Turn Kubernetes into your strategic advantage with Qovery, automating the heavy lifting while you stay in control.