Use Case & Pricing Models
Billing for LLM tokens
7 min
bill your customers for ai token usage without the operational overhead amberflo tracks consumption per model, keeps prices current across providers, and handles all the metering logic — so you focus on your margin, not the infrastructure set your markup percentage, choose your models, and route calls through amberflo's ai gateway or a supported partner — or self report usage directly from your own integration amberflo syncs live token prices from major providers and configures usage based billing automatically overview say you're building an ai product and want a consistent 30% margin over raw llm token costs — regardless of provider or model instead of building and maintaining your own price tracking and billing logic, amberflo handles it end to end price sync amberflo continuously fetches the latest token prices from all major llm providers so your billing always reflects reality markup configuration set your margin once amberflo configures the underlying usage based billing resources — meters, prices, and rate cards — automatically automatic usage recording usage is captured through the amberflo ai gateway, a supported partner integration, or your own self reported ingest — whichever fits your stack flexible ai pricing models choose the model that fits how your customers buy — or combine multiple approaches into a custom tier 🪙 credit packs & top ups sell prepaid credit bundles that customers apply toward token usage as they go 📦 fixed fee with included usage charge a recurring monthly fee that covers a set allocation of token usage 📊 pure usage based bill customers only for what they consume — no base fee, no commitments 🔀 hybrid models mix and match any of the above to build custom pricing tiers for different customer segments when a customer's credit balance reaches zero, amberflo fires a webhook so you can decide whether to allow overages or pause usage immediately token prices in the dashboard the amberflo dashboard shows current token prices across all supported providers in one place prices update automatically — no manual maintenance required you can adjust your markup percentage at any time and changes take effect immediately for new usage automated price tracking when a provider changes their pricing or ships a new model, amberflo detects the change and notifies you you can also configure per rate card automation apply to new customers new customers are automatically placed on the updated pricing going forward apply to all customers updated pricing applies to both new and existing customers on that rate card one click billing setup enter your desired markup (e g 30%) and click submit amberflo provisions all the required usage based billing resources — prices, meters, and rate configuration — in a single step that's all you need to start billing for token usage three ways to track usage pick whichever integration approach fits your architecture ai gateway recommended route your llm calls through amberflo's ai gateway provide your prompt, model, and customer id — amberflo handles routing to the provider, returns the response, and records token usage broken down by model and type, all in one request no separate billing integration needed the ai gateway is available to all billing for llm tokens users integration partners already using a third party ai gateway? we've partnered with leading platforms to capture usage automatically — no extra api calls on your end openrouter litellm self report token usage manage your own llm provider connections? report token usage directly to amberflo using any of the following amberflo meter api