Consumption, Delivery and Metering

The Grid’s infrastructure ensures that buyers of Instruments receive inference delivered in accordance with the benchmarks, and that tokens are properly metered and accounted for in consumption. This section explains how the underlying mechanism works.

The importance of accurate tracking

For each Instrument, buyers and suppliers rely on The Grid to:

  • Track exactly how many tokens were delivered against each Unit.

  • Confirm that performance stayed within the agreed instrument specifications.

  • Maintain records to show rich usage metrics and resolve any disputes.

Canonical Usage Records

For every inference call, The Grid records usage data that includes:

  • Timestamp

    The time of call and response from the API.

  • Unit and Lot Information

    The Units from where tokens were deducted, and the Lots where the Units belonged.

  • Token Counts

    Number of tokens input and output per call.

  • Trade Details

    The individual trade(s) where the buyer purchased those Unit(s).

  • Supplier Information

    The supplier that served the request.

  • Performance Metrics

    Metrics such as Time to First Token and other measurements that are part of the Minimum Service Benchmark.

These records help us ensure that buyers receive the best quality inference, and that suppliers adhere with the Instrument Specifications at all times.

User Action Histories

While The Grid stores routing data records itself, users can log in into their accounts to find detailed histories of their actions:

  • Order submissions and cancellations.

  • Trades and associated price and size details.

  • Transfers between Trading and Consumption.

  • Sweep events and resulting Lots.

  • Configuration changes and admin actions.

Last updated

Was this helpful?