Skip to content

Introduction

Python

Class

The dbtCloudClient class is the main interface through which you will interact with dbt Cloud API endpoints. The class accepts three optional arguments:

  • api_key
  • service_token
  • host

An api_key can be used to access endpoints from any version of the dbt Cloud API (v2 or v3). The service_token can be used for either the dbt Cloud API or the Metadata API. If you have the proper permissions, you would only need to pass a service token.

from dbtc import dbtCloudClient

client = dbtCloudClient(service_token='this-is-my-service-token')

Alternatively, you can set the following environment variables in place of passing the arguments to the class:

  • api_key --> DBT_CLOUD_API_KEY
  • service_token --> DBT_CLOUD_SERVICE_TOKEN
  • host --> DBT_CLOUD_HOST

If you have set environment variables, and have the proper permissions, you'll be able to instantiate the dbtCloudClient class as follows:

from dbtc import dbtCloudClient

client = dbtCloudClient()

Info

The host argument is only necessary for customers using dbt Cloud outside of the North America multi-tenant instance (see more info on dbt Cloud hosting)

Interfaces

The dbtCloudClient class contains two properties:

  • cloud - instance of the _AdminClient class, which contains methods to create, read, update, and delete dbt Cloud resources
  • metadata - instance of the _MetadataClient class, which contains methods to retrieve metadata generated from a dbt Cloud job run
  • sl - instance of the _SemanticLayerClient class, which contains methods to interact with dbt Cloud's Semantic Layer via a GraphQL API

cloud

from dbtc import dbtCloudClient

# Assuming we've set the `DBT_CLOUD_SERVICET_TOKEN` environment variable`
client = dbtCloudClient()

accounts = client.cloud.list_accounts()

metadata

from dbtc import dbtCloudClient

client = dbtCloudClient()
query = '''
query ($environmentId: BigInt!, $first: Int!) {
  environment(id: $environmentId) {
    definition {
      metrics(first: $first) {
        edges {
          node {
            name
            description
            type
            formula
            filter
            tags
            parents {
              name
              resourceType
            }
          }
        }
      }
    }
  }
}
'''
variables = {'environmentId': 1, 'first': 500}
data = client.metadata.query(query, variables)

# Data will be in the edges key, which will be a list of nodes
nodes = data['data']['definition']['metrics']['edges']
for node in nodes:
    # node is a dictionary
    node_name = node['name']
    ...

sl

from dbtc import dbtCloudClient

client = dbtCloudClient(environment_id=1)

query_result = client.sl.query(
    metrics=['total_revenue', 'total_profit', 'total_customers'],
    group_by=['customer__region', 'customer__nation', 'metric_time__quarter'],
)

# As a default the `result` attribute will be a pandas.DataFrame
query_result.result

# Also view the SQL
query_result.sql

# Or do something based on the status of the query
if query_result.status == 'SUCCESSFUL':
    ...

CLI

This package also comes with a command-line utility, dbtc. All of the methods available through the cloud or metadata properties on the dbtCloudClient class are available through the command line as well.

The command line interface also accepts additional environment variables:

  • DBT_CLOUD_ACCOUNT_ID
  • DBT_CLOUD_PROJECT_ID

Setting these will reduce the amount of arguments you'll need to pass.

dbtc get-project --account-id=1 --project-id=1

Or, if you've set the DBT_CLOUD_ACCOUNT_ID and DBT_CLOUD_PROJECT_ID environment variables.

dbtc get-project