Export from MongoDB Atlas to Google Cloud Storage
CloudQuery is an open-source data integration platform that allows you to export data from any source to any destination.
The CloudQuery MongoDB Atlas plugin allows you to sync data from MongoDB Atlas to any destination, including Google Cloud Storage. It takes only minutes to get started.
MongoDB Atlas
The CloudQuery MongoDB Atlas plugin extracts information from your MongoDB Atlas API and loads it into any supported CloudQuery destination
cloudquery
v4.0.1
Source
Google Cloud Storage
This destination plugin lets you sync data from a CloudQuery source to remote GCS (Google Cloud Storage) storage in various formats such as CSV, JSON and Parquet
Table of Contents
MacOS Setup
Step 1. Install CloudQuery
brew install cloudquery/tap/cloudquery
Step 2. Log in to CloudQuery CLI
cloudquery login
Step 3. Configure MongoDB Atlas source plugin
You can find more information about the configuration in the plugin documentation
kind: source
# Common source-plugin configuration
spec:
name: mongodbatlas
path: cloudquery/mongodbatlas
registry: cloudquery
version: "v4.0.1"
tables: ["*"]
destinations: ["v5.2.2"]
# Plugin specific configuration
spec:
api_key: ${MONGODB_ATLAS_PUBLIC_KEY}
api_secret: ${MONGODB_ATLAS_PRIVATE_KEY}
# optional parameters
# base_url: https://cloud.mongodb.com
Step 4. Configure Google Cloud Storage destination plugin
You can find more information about the configuration in the plugin documentation
kind: destination
spec:
name: "gcs"
path: "cloudquery/gcs"
registry: "cloudquery"
version: "v5.2.2"
write_mode: "append"
spec:
bucket: "bucket_name"
path: "path/to/files/{{TABLE}}/{{UUID}}.{{FORMAT}}"
format: "parquet" # options: parquet, json, csv
format_spec:
# CSV specific parameters:
# delimiter: ","
# skip_header: false
# Parquet specific parameters:
# version: "v2Latest"
# root_repetition: "repeated"
# Optional parameters
# compression: "" # options: gzip
# no_rotate: false
# batch_size: 10000
# batch_size_bytes: 52428800 # 50 MiB
# batch_timeout: 30s
Step 5. Run Sync
cloudquery sync mongodbatlas.yml gcs.yml