Sync data from AWS to Google Cloud Storage

CloudQuery is a simple, fast and extensible data movement platform that allows you to sync data from any source to any destination.

aws
Destinations

Trusted by

Why CloudQuery?

We took care of everything, so you can do your job easily and efficiently.

Fast and reliable

CloudQuery’s efficient design means our syncs are fast and a sync from AWS to Google Cloud Storage can be completed in a fraction of the time compared to other tools.

Easy to get started, easy to maintain

AWS syncing using CloudQuery is easy to set up and maintain thanks to its simple YAML configuration. Once synced, you can use normal SQL queries to work with your data.

How to sync AWS data to Google Cloud Storage

CloudQuery is the simple, fast data integration platform that can fetch your data from AWS APIs and load it into Google Cloud Storage.

AWS

AWS

Source

Documentation
Google Cloud Storage

Google Cloud Storage

Destination

Documentation

Step 1: Install CloudQuery

Follow the steps below to start syncing data with CloudQuery.

Your operating system
Installation method

Copy&paste the following command to download

brew install cloudquery/tap/cloudquery

Sign in with CloudQuery

To sign in from the CLI, run the following command.

cloudquery login

A new browser window will open where you will complete the sign-in process.

Auto-generate sync configuration

Run the following command to create a configuration file:

cloudquery init --source aws --destination gcs --spec-path aws_to_gcs.yaml

Step 2: Additional source and destination configuration (optional)

AWS source plugin configuration

You can find more information about the configuration in the plugin documentation.

# aws.yml kind: source spec: name: aws path: cloudquery/aws spec: # per documentation at:

PGoogle Cloud Storage plugin configuration

You can find more information about the configuration in the plugin documentation.

# gcs.yml kind: destination spec: name: gcs path: cloudquery/gcs spec: # per documentation at:

Step 3: Run the sync

Step 1. Copy and paste the command to trigger the sync

cloudquery sync aws_to_gcs.yaml

Frequently asked questions about plugins

Detailed answers are here to help you get started.

Google Cloud Storage FAQ

What file formats can CloudQuery load from AWS to Google Cloud Storage?

CloudQuery can output CSV, JSON and Parquet files to Google Cloud. If you choose a CSV file, there are a number of CSV specific parameters that you can set before running the sync such as the delimiter and whether the header row is skipped.

The CloudQuery GCS integration authenticates by using your Application Default Credentials and all of the options are available. For testing purposes, it is possible to specify your GCS access credentials via the GOOGLE_APPLICATION_CREDENTIALS parameter but this is not recommended for long-term usage.

AWS FAQ

What permissions does the AWS integration require to run?

To sync your data to Google Cloud Storage the CloudQuery AWS integration only requires read permissions. The integration will never need to make any changes to your AWS setup, and we recommend that you only grant read permissions.

CloudQuery does not have access to any of the data you sync from AWS. We only collect only metadata about usage for billing purposes. The CloudQuery CLI will only have access to the data that you grant it permission to access. You will always be in control of the data that is being synced from AWS to Google Cloud Storage and will have the ability to revoke permissions at any time.

Anyone can try CloudQuery completely free of charge for 14 days, regardless of how much data you move from AWS to Google Cloud Storage. After your trial ends, you can continue to sync one million rows every month completely free of charge - any further syncs are charged based on a simple pay as you go pricing model per million rows. The more you sync, the less you pay per row.
Fast and reliable

CloudQuery’s efficient design means our syncs are fast and a sync from AWS to Google Cloud Storage can be completed in a fraction of the time compared to other tools.

Easy to use, easy to maintain

AWS syncing using CloudQuery is easy to set up and maintain thanks to its simple YAML configuration. Once synced, you can use normal SQL queries to work with your data.

A huge library of supported destinations

Google Cloud Storage isn’t the only place we can sync your AWS data to. Whatever you need to do with your AWS data, CloudQuery can make it happen. We support a huge range of destinations, customizable transformations for ETL, and we regularly release new plugins.

Extensible and Open Source SDK

Write your own connectors in any language by utilizing the CloudQuery open source SDK powered by Apache Arrow. Get out-of-the-box scheduling, rate-limiting, transformation, documentation and much more.

Turn cloud chaos into clarity

Find out how CloudQuery can help you get clarity from a chaotic cloud environment with a personalized conversation and demo.

Join our mailing list

Subscribe to our newsletter to make sure you don't miss any updates.

Legal

© 2024 CloudQuery, Inc. All rights reserved.

We use tracking cookies to understand how you use the product and help us improve it. Please accept cookies to help us improve. You can always opt out later via the link in the footer.