Blog
AI
Infrastructure Management
Product
5
minutes

GPU workloads on EKS just got way simpler with Qovery

Running GPU workloads on EKS has never been easy, until now. With Qovery’s latest update, you can enable GPU nodes, configure GPU access, and optimize costs automatically, all without writing a single line of YAML or touching Helm charts. Qovery now handles everything behind the scenes so you can focus entirely on your applications.
Alessandro Carrano
Head of Product
Summary
Twitter icon
linkedin icon

Key Points:

  • Qovery radically simplifies GPU provisioning on EKS: It automates the entire complex process, which previously involved manually defining node pools, configuring YAMLs, installing NVIDIA plugins, and modifying application manifests; reducing it to a few simple steps and eliminating significant DevOps overhead.
  • Controlled and cost-optimized GPU access: Users can easily enable GPU node pools, select instance types (mixing On-Demand and Spot instances for cost/performance), define cluster-wide limits, and specify the GPU needs per application.
  • Automatic setup and optimization: Qovery automatically handles the technical backend, including installing and configuring the NVIDIA Kubernetes Device Plugin and ensuring cost efficiency by selecting the best instance type combinations through its Karpenter implementation.

Run GPU-Powered applications on an EKS cluster in minutes

Whether you’re training models, running inference pipelines, or powering compute-heavy workloads, Qovery makes GPU provisioning on a EKS cluster simple as that:

  • Enable the GPUs on your cluster:
    • enable GPU node pools in your cluster settings
    • choose the instance types you need.
    • Mix On-Demand and Spot instances for the best balance of cost and performance.
    • Define limits on the number of GPUs that can be used overall by your cluster.
  • Define GPU needs per application: specify how many GPUs your app requires. Qovery handles provisioning, scheduling, and placement for you.
<\div>

Qovery's Provision (our Kubernetes deployment platform) automatically takes care of:

  • NVIDIA plugin setup: Qovery installs and configures the NVIDIA Kubernetes Device Plugin automatically, so your applications can access GPUs right away.
  • Cost optimization: Qovery selects the most cost-effective instance type combination based on your workload’s GPU requirements, thanks to the Karpenter implementation.

What it used to look like (before Qovery)

Before this release, setting up GPU workloads on Kubernetes meant doing everything yourself:

  1. Manually define and deploy GPU node pools
    • Create YAML or CLI definitions for GPU-capable nodes, labels, taints, and autoscaling.
    • Configure Spot instance handling and scaling behavior.
  2. Install NVIDIA components
    • Add the NVIDIA Helm charts for the device plugin and drivers.
    • Manage chart values, version compatibility, and updates across clusters.
  3. Modify application manifests
    • Add resources.requests/limits for nvidia.com/gpu.
    • Set node selectors, tolerations, and affinities for GPU nodes.
    • Tune and redeploy Helm charts for GPU access.
  4. Maintain and optimize over time
    • Monitor GPU utilization and keep costs under control.
    • Update plugins and drivers as Kubernetes versions evolve.

Now, Qovery does all that for you.

Why this matters

GPU workloads are core to modern applications but Kubernetes wasn’t built to make GPU management simple.

Qovery bridges that gap by abstracting away the complexity. In just a few clicks, your applications can access powerful GPUs, without the DevOps overhead.

Get started

You can start using GPU node provisioning today.

Check out our documentation to learn how to enable GPU support for your clusters and applications.

Share on :
Twitter icon
linkedin icon
Ready to rethink the way you do DevOps?
Qovery is a DevOps automation platform that enables organizations to deliver faster and focus on creating great products.
Book a demo

Suggested articles

Product
Infrastructure Management
Deployment
 minutes
Stop tool sprawl - Welcome to Terraform/OpenTofu support

Provisioning cloud resources shouldn’t require a second stack of tools. With Qovery’s new Terraform and OpenTofu support, you can now define and deploy your infrastructure right alongside your applications. Declaratively, securely, and in one place. No external runners. No glue code. No tool sprawl.

Alessandro Carrano
Head of Product
AI
DevOps
 minutes
Integrating Agentic AI into Your DevOps Workflow

Eliminate non-coding toil with Qovery’s AI DevOps Agent. Discover how shifting from static automation to specialized DevOps AI agents optimizes FinOps, security, and infrastructure management.

Mélanie Dallé
Senior Marketing Manager
DevOps
 minutes
Top 10 Flux CD Alternatives: Finding a Better Way to Deploy Your Code

Looking for a Flux CD alternative? Discover why Qovery stands out as the #1 choice. Compare features, pros, and cons of the top 10 platforms to simplify your deployment strategy and empower your team.

Mélanie Dallé
Senior Marketing Manager
DevOps
5
 minutes
The 6 Best GitOps Tools for Developers

Discover the top 6 GitOps tools to streamline your development workflow. Compare Qovery, ArgoCD, GitHub Actions, and more to find the perfect solution for automating your infrastructure and deployments.

Morgan Perry
Co-founder
AWS
Heroku
13
 minutes
Heroku vs AWS: Differences & What to Choose for Mid-Size & Startups?

Heroku and AWS offer distinct benefits for startups and mid-size companies. This guide compares the differences between pricing, scalability, security, and developer experience to help you choose the right cloud platform based on your team’s needs and growth goals.

Mélanie Dallé
Senior Marketing Manager
Product
Observability
 minutes
RDS monitoring is now available in Qovery Observe

Starting today, get full visibility on your RDS databases directly inside Qovery. Troubleshoot app and database issues from one place without jumping into the AWS console

Alessandro Carrano
Head of Product
Compliance
Azure
 minutes
The Definitive Guide to HIPAA Compliance on Microsoft Azure

Master HIPAA compliance on Azure. Understand the Shared Responsibility Model, the critical role of the BAA, and how to configure Access Control, Encryption, and Networking. See how Qovery automates security controls for continuous compliance.

Mélanie Dallé
Senior Marketing Manager
DevOps
 minutes
Top 10 Portainer Alternatives: Finding a More Powerful & Scalable DevOps Platform

Looking for a Portainer alternative? Discover why Qovery stands out as the #1 choice. Compare features, pros, and cons of the top platforms to simplify your deployment strategy and empower your team.

Mélanie Dallé
Senior Marketing Manager

It’s time to rethink
the way you do DevOps

Say goodbye to DevOps overhead. Qovery makes infrastructure effortless, giving you full control without the trouble.