Consumption, Delivery and Metering

When you consume inference through The Grid's API, the platform tracks every request at the token level - recording which Unit and Lot the tokens were drawn from, which supplier served the request, and whether the response met the Instrument's quality benchmarks. This metering and delivery infrastructure is what makes The Grid an auditable market rather than a black-box proxy: every token is accounted for, and failed or out-of-spec deliveries are not billed.

The importance of accurate tracking

For each Instrument, buyers and suppliers rely on The Grid to:

  • Track exactly how many tokens were delivered against each Unit.

  • Confirm that performance stayed within the agreed instrument specifications.

  • Maintain records to show rich usage metrics and resolve any disputes.

Canonical Usage Records

For every inference call, The Grid records usage data that includes:

  • Timestamp

    The time of call and response from the API.

  • Unit and Lot Information

    The Units from where tokens were deducted, and the Lots where the Units belonged.

  • Token Counts

    Number of tokens input and output per call.

  • Trade Details

    The individual trade(s) where the buyer purchased those Unit(s).

  • Supplier Information

    The supplier that served the request.

  • Performance Metrics

    Metrics such as Time to First Token and other measurements that are part of the Minimum Service Benchmark.

These records help us ensure that buyers receive the best quality inference, and that suppliers adhere with the Instrument Specifications at all times.

User Action Histories

Users can log in into their accounts to find a history of actions:

  • Order submissions and cancellations

  • Trades and associated price and size details

  • Transfers from Trading Account to Consumption Account

  • Sweep events and resulting Lots

Last updated

Was this helpful?