No items found.

minutes

How to migrate your PostHog self-hosted to PostHog Cloud

PostHog is an open-source product analytics tool that we use at Qovery to improve the developer experience. PostHog is similar to famous proprietary product analytics tools like Mixpanel, Heap, Amplitude.

Romaric Philogène

CEO & Co-founder

Summary

At Qovery, we were using PostHog self-hosted for 8 months in production. It was running nicely on Qovery (yes, we deployed PostHog with Qovery 😎 #eatYourOwnDogFood), but we decided to move to the PostHog Cloud. Here are the two main reasons why we decided to make this move:

If you are not yet using it, give it a try with Qovery or PostHog Cloud version.

To support the PostHog project: Because we love their product, keeping them by using their Cloud version makes complete sense to us.
Stay focused on our business: Using the self-hosted version of PostHog requires you to spend time to maintain it. Meaning, you have to handle the upgrade yourself and make sure the service is up and running all day long.

How to migrate

Looking at their documentation, there is no guide to migrate from PostHog self-hosted to their Cloud version. I asked them the procedure on Slack, and Paolo from the PostHog team responded that it should not be too complicated to transfer the data by fetching the data from the PostHog source and pushing them to the PostHog destination via the web API.

Conversation with Paolo from Posthog on Slack

So the idea was to make a Python script to fetch the data from our self-hosted PostHog instance and forward the data to the PostHog Cloud version.

(Self-hosted PostHog) <--[fetch event data via web API]-- Python Script --[send event data via web API]--> (PostHog Cloud)

Before migrating

As I know that we have more than one million events to send, I notified the PostHog support team that we will migrate our self-hosted version. Just in case they have to adjust their infrastructure. Who knows :) Paolo (once again, he is everywhere 🙂) responded to me and was super proactive.

I encourage you always to keep informed the support team of service when you are about to migrate. They can help you. It was the case here with the PostHog support team.

We can migrate now

To migrate, I made a simple Python script using no external dependencies. Only HTTP requests, and that's it. Note: This script is compatible with Python 3.6+.

#!/usr/bin/env python
import copy
import time
import uuid
from urllib.parse import urlparse

import requests

# Your source PostHog instance
source_posthog_scheme_and_host = 'https://posthog.your-domain.tld'

# Generate a personal API key to read the data from your source PostHog instance
source_api_key = 'xxx'

# Your project API key provided by PostHog in your project settings
dest_project_api_key = 'xxx'


def is_valid(key: str, data: dict) -> bool:
    """
    Helper function to check if the value is empty or none
    :param key: data key to check
    :param value: dict
    :return: true if valid (not empty, None), false otherwise
    """
    if key not in data or not data[key] or str(data[key]).strip().lower() == 'none':
        return False

    return True


def clean_source_data(results: [dict]) -> [dict]:
    """
    Clean up data from PostHog source
    Note: this function do not mutate `results`
    :param results:
    :return:
    """
    _results = []
    if not results:
        return _results

    distinct_id = 'distinct_id'

    for result in results:
        data = copy.deepcopy(result)
        del data['id']

        if not is_valid(distinct_id, data):
            data[distinct_id] = str(uuid.uuid4())

        if 'properties' in data:
            properties = data['properties']
            if not is_valid(distinct_id, properties):
                # copy distinct_id value from the parent object
                properties[distinct_id] = data[distinct_id]

        _results.append(data)

    return _results


def capture(data: [dict], count: int = 0):
    """
    Function to send data to the dest PostHog instance
    :param data:
    :param count:
    :return:
    """
    res = requests.post('https://app.posthog.com/capture', json={'api_key': dest_project_api_key, 'batch': data},
                        headers={'Content-type': 'application/json'})

    if res.status_code != 200:
        if count >= 100:
            print('retry exceeded')
            exit(1)
        time.sleep(3)
        print('Retry sending data (status code: {}) to dest PostHog with data {}'.format(res.status_code, data))
        return capture(data, count + 1)

    return res


def get_source_data() -> dict:
    headers = {'Authorization': 'Bearer {}'.format(source_api_key)}

    url = '{}/api/event'.format(source_posthog_scheme_and_host)

    with open('posthog_completed_urls', 'a') as f:
        while 1:
            query = urlparse(url).query
            if query:
                # PostHog return weird next URL with tons of 'before' params
                url = '{}/api/event?before={}'.format(source_posthog_scheme_and_host, query.split('=')[-1])

            res = requests.get(url, headers=headers)
            if res.status_code == 200:
                j_res = res.json()
                _data = clean_source_data(j_res['results'])

                yield _data

                f.write(url + '\n')
                url = j_res['next']
            else:
                print('Retry fetching events (status code: {}) from PostHog source with URL {}'.format(res.status_code, url))
                time.sleep(3)


if __name__ == '__main__':
    for data in get_source_data():
        capture(data)
        print('{} lines migrated'.format(len(data)))

    print('ok')

Before running the migration script, I strongly recommend making your apps send the data to the PostHog Cloud instance before and waiting for the old instance to stop receiving new events.

Now it is time to:

Copy this script.
Change the value of the variables source_posthog_scheme_and_host, source_api_key, dest_project_api_key
Run the script and wait until it is done 👌

Wrapping up

The migration went smoothly and took one day because we had more than 1 million events. We are super excited to use PostHog Cloud. It is fast and efficient for improving the developer experience on Qovery. Any questions about our usage of PostHog? Join our Discord to chat about it.

Resources:

PostHog: open-source product analytics tool.
Qovery: the simplest way to deploy your apps on AWS.

Share on :

Ready to rethink the way you do DevOps?

Qovery is a DevOps automation platform that enables organizations to deliver faster and focus on creating great products.

Book a demo

Suggested articles

DevOps

minutes

December 9, 2025

TPUs vs. GPUs: The DevOps Guide to AI Hardware Selection

Stop guessing on AI hardware. This DevOps guide details when to use TPUs vs. GPUs for optimal performance, cost, and framework compatibility in MLOps.

Mélanie Dallé

Senior Marketing Manager

Cloud

Business

minutes

December 8, 2025

The DevOps Guide to Docker Monitoring: Tools, Best Practices, and Unified Observability

Stop tool sprawl. Compare top Docker monitoring tools (Prometheus, Datadog, Qovery) and learn how unified observability simplifies K8s debugging and speeds up feature delivery.

Romaric Philogène

CEO & Co-founder

Cloud

Heroku

Internal Developer Platform

Platform Engineering

minutes

December 5, 2025

The Top 8 Tools to Build a Zero-Toil PaaS on Your Cloud

Stop managing K8s complexity. Discover the top 8 platform tools (Qovery, Rancher, Dokku) that let you build a customizable, zero-maintenance PaaS on your cloud.

Morgan Perry

Co-founder

Kubernetes

minutes

December 4, 2025

How to Deploy a Docker Container on Kubernetes: Step-by-Step Guide

Simplify Kubernetes Deployment. Learn the difficult 6-step manual process for deploying Docker containers to Kubernetes, the friction of YAML and kubectl, and how platform tools like Qovery automate the entire workflow.

Mélanie Dallé

Senior Marketing Manager

Observability

DevOps

minutes

December 4, 2025

Observability in DevOps: What is it, Observe vs. Monitoring, Benefits

Observability in DevOps: Diagnose system failures faster. Learn how true observability differs from traditional monitoring. End context-switching, reduce MTTR, and resolve unforeseen issues quickly.

Mélanie Dallé

Senior Marketing Manager

DevOps

Cloud

minutes

December 4, 2025

6 Best Practices to Automate DevSecOps in Days, Not Months

Integrate security seamlessly into your CI/CD pipeline. Learn the 6 best DevSecOps practices—from Policy as Code to continuous monitoring—and see how Qovery automates compliance and protection without slowing development.

Morgan Perry

Co-founder

Heroku

minutes

December 4, 2025

Top 10 Heroku Alternatives: When Simplicity Hits the Scaling Wall

Escape rising Heroku costs & outages. Compare top alternatives that deliver PaaS simplicity on your own cloud and scale without limits.

Mélanie Dallé

Senior Marketing Manager

Product

Infrastructure Management

Deployment

minutes

December 1, 2025

Stop tool sprawl - Welcome to Terraform/OpenTofu support

Provisioning cloud resources shouldn’t require a second stack of tools. With Qovery’s new Terraform and OpenTofu support, you can now define and deploy your infrastructure right alongside your applications. Declaratively, securely, and in one place. No external runners. No glue code. No tool sprawl.

Alessandro Carrano

Head of Product

It’s time to rethink the way you do DevOps

Say goodbye to DevOps overhead. Qovery makes infrastructure effortless, giving you full control without the trouble.

Book a demo Get Qovery free

How to migrate your PostHog self-hosted to PostHog Cloud

How to migrate

Before migrating

We can migrate now

Wrapping up

Suggested articles

It’s time to rethink the way you do DevOps

It’s time to rethink the way you do DevOps