Back to source plugin

Sync data from AWS to S3

CloudQuery is the simple, fast data integration platform that can fetch your data from AWS APIs and load it into S3
AWS
S3

Trusted by

Self-hosted

Start locally, then deploy to a Virtual Machine, Kubernetes, or anywhere else. Full instructions on CLI setup are available in our documentation.

Cloud-hosted

Start syncing in a few clicks. No need to deploy your own infrastructure.

Fast and reliable

CloudQuery’s efficient design means our syncs are fast and a sync from AWS to S3 can be completed in a fraction of the time compared to other tools.

Easy to use, easy to maintain

AWS syncing using CloudQuery is easy to set up and maintain thanks to its simple YAML configuration. Once synced, you can use normal SQL queries to work with your data.

A huge library of supported destinations

S3 isn’t the only place we can sync your AWS data to. Whatever you need to do with your AWS data, CloudQuery can make it happen. We support a huge range of destinations, customizable transformations for ETL, and we regularly release new plugins.

Extensible and Open Source SDK

Write your own connectors in any language by utilizing the CloudQuery open source SDK powered by Apache Arrow. Get out-of-the-box scheduling, rate-limiting, transformation, documentation and much more.

Step by step guide for how to export data from AWS to S3

MacOS Setup

Step 1: Install CloudQuery

To install CloudQuery, run the following command in your terminal:

brew install cloudquery/tap/cloudquery

Step 2: Create a Configuration File

Next, run the following command to initialize a sync configuration file for AWS to S3:

cloudquery init --source=aws --destination=s3

This will generate a config file named aws_to_s3.yaml. Follow the instructions to fill out the necessary fields to authenticate against your own environment.

Step 3: Log in to CloudQuery CLI

Next, log in to the CloudQuery CLI. If you have't already, you can sign up for a free account as part of this step:

cloudquery login

Step 4: Run a Sync

cloudquery sync aws_to_s3.yaml

This will start syncing data from the AWS API to your S3 environment! 🚀

See the CloudQuery documentation portal for more deployment guides, options and further tips.

FAQs

What is CloudQuery?
CloudQuery is an open-source tool that helps you extract, transform, and load cloud asset data from various sources into databases for security, compliance, and visibility.
Why does CloudQuery require login?
Logging in allows CloudQuery to authenticate your access to the CloudQuery Hub and monitor usage for billing purposes. Data synced with CloudQuery remains private to your environment and is not shared with our servers or any third parties.
What data does CloudQuery have access to?
CloudQuery accesses only the metadata and configurations of your cloud resources that you specify without touching sensitive data or workloads.
How is CloudQuery priced?
CloudQuery offers flexible pricing based on the number of cloud accounts and usage. Visit our pricing page for detailed plans.
Is there a free version of CloudQuery?
Yes, CloudQuery offers a free plan that includes basic features, perfect for smaller teams or personal use. More details can be found on our pricing page.
What permissions does the AWS integration require to run?
To sync your data to S3 the CloudQuery AWS integration only requires read permissions. The integration will never need to make any changes to your AWS setup, and we recommend that you only grant read permissions.
How will my data be kept secure?
CloudQuery does not have access to any of the data you sync from AWS. We only collect only metadata about usage for billing purposes. The CloudQuery CLI will only have access to the data that you grant it permission to access. You will always be in control of the data that is being synced from AWS to S3 and will have the ability to revoke permissions at any time.
How much does it cost to use CloudQuery to sync from AWS to S3?
Anyone can try CloudQuery completely free of charge for 14 days, regardless of how much data you move from AWS to S3. After your trial ends, you can continue to sync one million rows every month completely free of charge - any further syncs are charged based on a simple pay as you go pricing model per million rows. The more you sync, the less you pay per row.
Which permissions do I need to grant in S3 in order to sync from AWS?
The CloudQuery integration only requires PutObject permissions to sync to the S3 destination and we recommend that you grant only these permissions to CloudQuery in order to keep your setup as secure as possible. CloudQuery does not require access to your S3 cloud configuration settings, only permission to add objects to your S3 buckets.
Can CloudQuery read from my credentials and config S3 files?
Yes, if you want CloudQuery to use these credentials, then the files need to be located in the .aws directory of your home folder. The two files are almost identical in format but if there is a conflict, CloudQuery will prioritise the credential information that it reads from the credentials file over those found in config.
What formats can CloudQuery load from AWS to an S3 destination?
CloudQuery can load information in your choice of csv, parquet or json. This is specified in the format field.
Join our mailing list

Subscribe to our newsletter to make sure you don't miss any updates.

Legal

© 2024 CloudQuery, Inc. All rights reserved.

We use tracking cookies to understand how you use the product and help us improve it. Please accept cookies to help us improve. You can always opt out later via the link in the footer.