Consumption, Delivery and Metering
When you consume inference through The Grid's API, the platform tracks every request at the token level - recording which Unit and Lot the tokens were drawn from, which supplier served the request, and whether the response met the Instrument's quality benchmarks. This metering and delivery infrastructure is what makes The Grid an auditable market rather than a black-box proxy: every token is accounted for, and failed or out-of-spec deliveries are not billed.
The importance of accurate tracking
For each Instrument, buyers and suppliers rely on The Grid to:
Track exactly how many tokens were delivered against each Unit.
Confirm that performance stayed within the agreed instrument specifications.
Maintain records to show rich usage metrics and resolve any disputes.
Canonical Usage Records
For every inference call, The Grid records usage data that includes:
Timestamp
The time of call and response from the API.
Unit and Lot Information
The Units from where tokens were deducted, and the Lots where the Units belonged.
Token Counts
Number of tokens input and output per call.
Trade Details
The individual trade(s) where the buyer purchased those Unit(s).
Supplier Information
The supplier that served the request.
Performance Metrics
Metrics such as Time to First Token and other measurements that are part of the Minimum Service Benchmark.
These records help us ensure that buyers receive the best quality inference, and that suppliers adhere with the Instrument Specifications at all times.
User Action Histories
Users can log in into their accounts to find a history of actions:
Order submissions and cancellations
Trades and associated price and size details
Transfers from Trading Account to Consumption Account
Sweep events and resulting Lots
Last updated
Was this helpful?