Sync data from GCP to Databricks

CloudQuery is the simple, fast data integration platform that can fetch your data from GCP APIs and load it into Databricks

Trusted by

https://cdn.cloudquery.io/hub/7msht52er/_next/static/media/zendesk.b9ccef9b.svg

https://cdn.cloudquery.io/hub/7msht52er/_next/static/media/palo_alto_networks.e10e2208.svg

https://cdn.cloudquery.io/hub/7msht52er/_next/static/media/instructure.d7553cf6.svg

https://cdn.cloudquery.io/hub/7msht52er/_next/static/media/ridgeline.40e37703.svg

Enterprise Ready

Customize & Extend

Query Assets with SQL

Non-invasive account access for better security and efficiency.

Import data with CloudQuery SDKs and build your own plugins.

Query cloud assets and security with a simple SQL-based UI.

Step by step guide for how to export data from GCP to Databricks

MacOS Windows Linux

MacOS Setup

Step 1: Install CloudQuery

To install CloudQuery, run the following command in your terminal:

brew install cloudquery/tap/cloudquery

Next, log in to the CloudQuery CLI. If you have't already, you can sign up for a free account as part of this step:

cloudquery login

Step 3: Create a Configuration File

Next, run the following command to initialize a sync configuration file for GCP to Databricks:

cloudquery init --source=gcp --destination=databricks

This will generate a config file named gcp_to_databricks.yaml. Follow the instructions to fill out the necessary fields to authenticate against your own environment.

Step 4: Run a Sync

cloudquery sync gcp_to_databricks.yaml

This will start syncing data from the GCP API to your Databricks database! 🚀

See the CloudQuery documentation portal for more deployment guides, options and further tips.

FAQs

What is CloudQuery?

CloudQuery is the operating platform for modern cloud infrastructure. It gives platform engineering and cloud operations teams a single control plane for cloud visibility, SQL-based policies, and intelligent automation across their entire cloud estate.

Why does CloudQuery require login?

Logging in allows CloudQuery to authenticate your access to the CloudQuery Hub and monitor usage for billing purposes. Data synced with CloudQuery remains private to your environment and is not shared with our servers or any third parties.

What data does CloudQuery have access to?

CloudQuery accesses only the metadata and configurations of your cloud resources that you specify without touching sensitive data or workloads.

How is CloudQuery priced?

CloudQuery offers flexible pricing based on the number of cloud accounts and usage. Visit our pricing page for detailed plans.

Is there a free version of CloudQuery?

Yes, CloudQuery offers a free plan that includes basic features, perfect for smaller teams or personal use. More details can be found on our pricing page.

What credentials are required to sync from GCP to Databricks?

CloudQuery uses application default credentials in order to sync from GCP to Databricks. The best option to use depends on the environment in which your sync is running, whether it is local or cloud based and your own preferences. Full details can be found in the GCP documentation.

Can I restrict the sync to an individual project within my GCP environment?

Yes, if you only want CloudQuery to use a particular project when syncing to Databricks, you can specify this in the project_ids field. If you leave this field blank, CloudQuery will use all projects that it has been granted access to by your chosen authentication method.

Can I use wildcards when selecting which projects to sync from?

Yes, CloudQuery supports wildcards when searching for projects within your GCP environment. For example, if you want to select all projects which begin with data, you would specify data* in the project_filter field.

CloudOps

Sync data from GCP to Databricks

Trusted by

Enterprise Ready

Customize & Extend

Query Assets with SQL

Step by step guide for how to export data from GCP to Databricks

Table of Contents

MacOS Setup

Step 1: Install CloudQuery

Step 3: Create a Configuration File

Step 4: Run a Sync

FAQs

What is CloudQuery?

Why does CloudQuery require login?

What data does CloudQuery have access to?

How is CloudQuery priced?

Is there a free version of CloudQuery?

What credentials are required to sync from GCP to Databricks?

Can I restrict the sync to an individual project within my GCP environment?

Can I use wildcards when selecting which projects to sync from?

CloudOps

Sync data from GCP to Databricks

Trusted by

Enterprise Ready

Customize & Extend

Query Assets with SQL

Step by step guide for how to export data from GCP to Databricks

Table of Contents

MacOS Setup

Step 1: Install CloudQuery

Step 2: Log in to CloudQuery CLI

Step 3: Create a Configuration File

Step 4: Run a Sync

FAQs

What is CloudQuery?

Why does CloudQuery require login?

What data does CloudQuery have access to?

How is CloudQuery priced?

Is there a free version of CloudQuery?

What credentials are required to sync from GCP to Databricks?

Can I restrict the sync to an individual project within my GCP environment?

Can I use wildcards when selecting which projects to sync from?