Connector Details

Name	Value
Platform	Azure Blob Storage
Auth Type	API Keys
Direction	Bidirectional
Tap Repo	https://github.com/hotgluexyz/tap-blob-storage
Target Repo	https://github.com/hotgluexyz/target-blob-storage
Tap Metrics	Usage:
Target Metrics	Usage:

Credentials Setup

Follow the steps below to get the credentials you need to use the Azure Blob Storage connector.

How to get your Blob Storage credentials

The Blob Storage requires only a connection string to connect to your Blob Storage. To find your connection string, log in to your Azure Portal and navigate to your Storage Accounts Dashboard, Azure Storage Accounts

Next, select your Storage Account, and navigate to Security + Networking > Access Keys and copy one of your connection strings. You’ll then paste this connection string directly into the Hotglue dashboard or into a config object, depending on how you are linking your connector. Azure Connection String

Target Blob Storage

Config

In addition to the connect_string parameter, you should specify the following fields when connecting:

{
    "connect_string": "...",
    "container": "...", // Container name to write to
    "path_prefix": "...", // Directory to insert files into
    "overwrite": true // Whether to overwrite files (defaults False)
}

Example ETL Script

import gluestick as gs
import os
import time

# Define standard Hotglue directories
ROOT_DIR = os.environ.get("ROOT_DIR", ".")
INPUT_DIR = f"{ROOT_DIR}/sync-output"
OUTPUT_DIR = f"{ROOT_DIR}/etl-output"


# Read sync output
input = gs.Reader()

# Get tenant id
tenant_id = os.environ.get('USER_ID', os.environ.get('TENANT', 'default'))

# Possible values parquet, singer, csv, json, jsonl
EXPORT_FORMAT = "parquet"

# Iterate through the different streams in the sync output
for key in eval(str(input)):
    input_df = input.get(key)

    # Include tenant_id as a field if desired
    input_df["tenant"] = tenant_id

    # Create a unique file name
    timestamp = int(time.time()) # Unix Time stamp
    file_name = f"{tenant_id}_{key}_{timestamp}"

    # Write tenantid_streamname_timestamp.parquet
    gs.to_export(input_df, file_name, OUTPUT_DIR, export_format=EXPORT_FORMAT)

Getting Started

Key Concepts

Managed authentication

Transformation

Custom Connectors

Unified Schema

Environment Settings

CLI

Connectors

Azure Blob Storage

Connector Details

Credentials Setup

How to get your Blob Storage credentials

Target Blob Storage

Config

Example ETL Script

Target Changelog

Getting Started

Key Concepts

Managed authentication

Transformation

Custom Connectors

Unified Schema

Environment Settings

CLI

Connectors

​Connector Details

​Credentials Setup

​How to get your Blob Storage credentials

​Target Blob Storage

​Config

​Example ETL Script

​Target Changelog

Connector Details

Credentials Setup

How to get your Blob Storage credentials

Target Blob Storage

Config

Example ETL Script

Target Changelog