Question 1

What is AgentBox?

Accepted Answer

AgentBox is the production cloud for AI agents. Building a demo is easy; running an agent reliably for real users is not — that's what AgentBox handles. Builders publish their agent once and get the full stack underneath: 100+ frontier models through one API key, dedicated isolated runtime for every end-user session, built-in usage analytics and billing, and a Marketplace to reach Enterprise customers.

Because compute and inference run on the same network, agents in production don't pay the latency, egress, or operational cost of stitching together vendors.

Question 2

What does the Verified badge mean?

Accepted Answer

A hosted agent earns the Verified badge when it runs entirely on GMI infrastructure — GMI Cluster Engine for compute and GMI Models for inference, all on GMI's NVIDIA GPU fleet. Both layers share the same network, so execution and model calls don't leave GMI.

Question 3

How do I publish a hosted agent?

Accepted Answer

Complete the four-step Register wizard. You provide a container image URL, select a compute tier, choose a GMI model, and publish. Your template is stored on GMI; no container runs until an Enterprise provides an instance.

Question 4

Does GMI host my container image?

Accepted Answer

No. You host the image in your own registry. GMI stores only the URL and pulls from your registry every time an instance is provisioned.

Question 5

How do end users access a hosted agent?

Accepted Answer

The Enterprise calls POST /v1/containers with the agent's template_id and receives a dedicated container endpoint to route to its end user. Each end user gets their own isolated instance, with state and tool access scoped to that session.

Question 6

How am I billed for a hosted agent?

Accepted Answer

Two metered line items: container compute while an agent instance is running, plus GMI Models token usage if your agent calls GMI models. Both appear separately in Console → Settings → Usage & Billing.

Question 7

Am I charged for idle containers?

Accepted Answer

Each hosted agent runs on a dedicated container instance — the end user gets isolated state, no queue contention, and no cold starts. The container is billed for its full lifetime, from running until your application calls DELETE /v1/containers/{id}, including idle time between requests. We're exploring auto-pause for future releases.

Question 8

How do I track my spending?

Accepted Answer

Per-agent usage is visible in each agent's Analytics tab. Aggregate billing is in Console → Settings → Usage & Billing.

Capability	Self-Hosted / Stitched Stack	GMI Agentbox
Deployment + launch path	Manual	Included
Model + inference + compute	Separate setup	Included
Resource transparency	Manual	Included
Agentbox access layer	Separate system	Included
Usage and logs	Separate tools	Included
Go-live visibility	Limited	Included
Commercialization path	Custom build	Included

The full-stack platform for production-ready
Agents

The full-stack platform for production-ready Agents

Not an agent catalog
A launchpad for Agents

Access or deploy, in one platform

Use GMI your way

Deploy first. Launch when ready

Deploy your Agent

Connect models & compute

Validate and publish

Operate after launch

Everything you need to go from workflow to product

Client cases

Topify

SocratesLabs

NemoClaw

GMI Cloud Sales Ops Agent

TinyHumans

One platform
Not a patchwork of tools

FAQ

Your Agent is ready
Now make it launchable

The full-stack platform for production-readyAgents