Sync data from GCP to Apache Arrow Flight
CloudQuery is the simple, fast data integration platform that can fetch your data from GCP APIs and load it into Apache Arrow Flight
Trusted by
Self-hosted
Start locally, then deploy to a Virtual Machine, Kubernetes, or anywhere else. Full instructions on CLI setup are available in our documentation.
Cloud-hosted
Start syncing in a few clicks. No need to deploy your own infrastructure.
Fast and reliable
CloudQuery’s efficient design means our syncs are fast and a sync from GCP to Apache Arrow Flight can be completed in a fraction of the time compared to other tools.
Easy to use, easy to maintain
GCP syncing using CloudQuery is easy to set up and maintain thanks to its simple YAML configuration. Once synced, you can use normal SQL queries to work with your data.
A huge library of supported destinations
Apache Arrow Flight isn’t the only place we can sync your GCP data to. Whatever you need to do with your GCP data, CloudQuery can make it happen. We support a huge range of destinations, customizable transformations for ETL, and we regularly release new plugins.
Extensible and Open Source SDK
Write your own connectors in any language by utilizing the CloudQuery open source SDK powered by Apache Arrow. Get out-of-the-box scheduling, rate-limiting, transformation, documentation and much more.
Step by step guide for how to export data from GCP to Apache Arrow Flight
Table of Contents
MacOS Setup
Step 1: Install CloudQuery
To install CloudQuery, run the following command in your terminal:
brew install cloudquery/tap/cloudquery
Step 2: Create a Configuration File
Next, run the following command to initialize a sync configuration file for GCP to Apache Arrow Flight:
cloudquery init --source=gcp --destination=arrowflight
This will generate a config file named gcp_to_arrowflight.yaml. Follow the instructions to fill out the necessary fields to authenticate against your own environment.
Step 3: Log in to CloudQuery CLI
Next, log in to the CloudQuery CLI. If you have't already, you can sign up for a free account as part of this step:
cloudquery login
Step 4: Run a Sync
cloudquery sync gcp_to_arrowflight.yaml
This will start syncing data from the GCP API to your Apache Arrow Flight database! 🚀
See the CloudQuery documentation portal for more deployment guides, options and further tips.
FAQs
What is CloudQuery?
Why does CloudQuery require login?
What data does CloudQuery have access to?
How is CloudQuery priced?
Is there a free version of CloudQuery?
What credentials are required to sync from GCP to Apache Arrow Flight?
Can I restrict the sync to an individual project within my GCP environment?
project_ids
field. If you leave this field blank, CloudQuery will use all projects that it has been granted access to by your chosen authentication method.Can I use wildcards when selecting which projects to sync from?
project_filter
field.