Current Instruments : Chat Prime & Chat Fast
Overview
The initial Text-to-Text Instruments available on The Grid are:
Chat Fast: Optimized for speed and throughput.
Very low time to first token.
High streaming tokens per second.
Designed for short to medium outputs.
Intelligence floor that is good enough for many production workloads.
Chat Prime: Optimized for quality and long form coherence.
Higher minimum intelligence benchmark score.
Larger maximum output size.
Accepts somewhat slower time to first token.
Designed for deep reasoning, long context, and complex tool use.
Which Instrument is right for you?
Choose Chat Fast when:
You have tight latency budgets and need instant feeling UX.
You run many parallel calls where throughput and cost per token matter more than the last bit of reasoning quality.
Your prompts are short and you expect brief to moderate outputs.
You are doing routing, summarization, classification, or other transforms rather than complex planning.
Typical use cases:
Support chat assistants, help center bots, Slack and Discord helpers.
Email, meeting, and thread summarization, code diff explanations, document summaries.
RAG answers where context is already tight and responses should be concise.
Bulk text transforms such as classify, redact, extract fields, paraphrase, or translate short snippets.
Product surfaces that need instant feedback like autocomplete, inline suggestions, or hinting.
Choose Chat Prime when:
You need higher reasoning quality for multi step analysis or synthesis across long context.
You expect very long outputs that must remain coherent.
You rely heavily on tool calling, retrieval, or other agent style workflows.
You are handling high stakes user facing answers where wrong but fast is not acceptable.
You prefer fewer escalations and are willing to pay more per Unit to get higher quality by default.
Typical use cases:
Deep code reasoning narratives like architecture reviews, migration plans, and extensive refactors with verification steps.
Executive assistants and research copilots that read large corpora and produce detailed briefs.
Strategic and planning agents for OKRs, roadmaps, project plans, and complex workflow orchestration.
Long form documents such as RFCs, memos, policy drafts, or multi chapter documentation.
High stakes customer facing answers where correctness is more important than raw speed or cost.
Last updated
Was this helpful?