Sync data from Kubernetes to Databricks
CloudQuery is the simple, fast data integration platform that can fetch your data from Kubernetes APIs and load it into Databricks
Trusted by
Self-hosted
Start locally, then deploy to a Virtual Machine, Kubernetes, or anywhere else. Full instructions on CLI setup are available in our documentation.
Cloud-hosted
Start syncing in a few clicks. No need to deploy your own infrastructure.
Fast and reliable
CloudQuery’s efficient design means our syncs are fast and a sync from Kubernetes to Databricks can be completed in a fraction of the time compared to other tools.
Easy to use, easy to maintain
Kubernetes syncing using CloudQuery is easy to set up and maintain thanks to its simple YAML configuration. Once synced, you can use normal SQL queries to work with your data.
A huge library of supported destinations
Databricks isn’t the only place we can sync your Kubernetes data to. Whatever you need to do with your Kubernetes data, CloudQuery can make it happen. We support a huge range of destinations, customizable transformations for ETL, and we regularly release new plugins.
Extensible and Open Source SDK
Write your own connectors in any language by utilizing the CloudQuery open source SDK powered by Apache Arrow. Get out-of-the-box scheduling, rate-limiting, transformation, documentation and much more.
Step by step guide for how to export data from Kubernetes to Databricks
Table of Contents
MacOS Setup
Step 1: Install CloudQuery
To install CloudQuery, run the following command in your terminal:
brew install cloudquery/tap/cloudquery
Step 2: Create a Configuration File
Next, run the following command to initialize a sync configuration file for Kubernetes to Databricks:
cloudquery init --source=k8s --destination=databricks
This will generate a config file named k8s_to_databricks.yaml. Follow the instructions to fill out the necessary fields to authenticate against your own environment.
Step 3: Log in to CloudQuery CLI
Next, log in to the CloudQuery CLI. If you have't already, you can sign up for a free account as part of this step:
cloudquery login
Step 4: Run a Sync
cloudquery sync k8s_to_databricks.yaml
This will start syncing data from the Kubernetes API to your Databricks database! 🚀
See the CloudQuery documentation portal for more deployment guides, options and further tips.