Workloads are the primary way Amberflo tracks AI usage and spend. They represent the individual entities that consume AI resources and serve as the foundation for attribution, governance, and access control.

A workload can represent almost any consumer of LLMs, including:

A specific environment such as development, QA, or production

For example, you might create separate workloads for:

This level of separation allows you to understand usage patterns and cost at a much finer granularity.

Workloads serve two critical purposes in Amberflo:

All usage and cost flowing through the gateway is attributed to a workload. This makes it easy to understand who is consuming AI resources and how much they are using.

Each workload explicitly defines which models it is allowed to access. This allows you to enforce guardrails and prevent unintended usage of specific models.

Workloads sit between models and access keys. Models define what is available.

Go to Access Management in the left-hand navigation.

This page displays all previously created workloads, including:

This list represents all active consumers that can be granted access to LLMs through the gateway.

The name is a human-readable label designed to help you easily identify what the workload represents.

As you type the name, Amberflo automatically generates a workload ID. The ID must:

Use only letters, numbers, underscores, or hyphens

You can edit the ID if needed, as long as it follows these rules.

After setting the name and ID, you will see a list of available models. This list includes all models that have been configured in the gateway.

Select the models that this workload is allowed to access. You can choose one, many, or all available models depending on your use case.

This selection enforces model-level access control for the workload.

Once you have selected the models, click Create Workload.

The workload will appear in the workloads list and is now ready to be used for access and attribution.

Creating a workload does not automatically grant access to applications. Workloads define what is allowed, not how access happens.

The next step is to create access keys for a workload. Access keys are what applications use to authenticate with the gateway. Every request made with a key is automatically attributed to the associated workload.

After creating a workload, continue to the Access Keys section to learn how to generate keys that allow applications to access models through the gateway while preserving full attribution and governance.

LLM Models

AI Workloads

Virtual Keys

Amberflo Docs

What is Amberflo?