Skip to main content

What This Checks

tm infer doctor is a local readiness check for the next inference request. It helps you confirm that the CLI has the inputs it needs, but it does not send a live inference request. Use tm infer chat for an end-to-end gateway check, and use tm auth whoami if you also need to confirm the current Control Plane bearer token.

Usage

tm infer doctor [OPTIONS]

Examples

Inspect On-Demand readiness for a direct request and for --model @latest.

tm infer doctor

Inspect Serverless readiness for a direct request that will provide --model explicitly.

tm infer doctor --surface serverless

Inspect the effective Serverless target after applying explicit request overrides.

tm infer doctor --surface serverless --model YOUR_SERVERLESS_MODEL_NAME --base-url https://serverless.tensormesh.ai

Use shell-friendly exit codes when direct gateway prerequisites are missing.

tm infer doctor --exit-status

Options

NameTypeRequiredDefaultDetails
--surface`choice[on-demandserverless]`no"on-demand"Inference surface to inspect.
--modeltextnoModel name to inspect for the next request.
--user-idtextnoX-User-Id header to inspect. Only used for —surface on-demand.
--api-keytextnoInference API key to inspect.
--base-urltextnoOverride the base URL for the selected surface.
--exit-statusbooleannofalseExit non-zero when checks report a not-ready state. Boolean flag.

Inherited Global Options

NameTypeRequiredDefaultDetails
--version, -VbooleannofalseShow the version and exit. Boolean flag.
--configpathno"~/.config/tensormesh/config.toml"Path to config TOML file
--output`choice[textjsonyamlrawtable]`no"text"Output format (text is human-readable; json is machine-friendly).
--quietbooleannofalseSuppress non-essential output. Boolean flag.
--debugbooleannofalsePrint debug logs to stderr (secrets redacted). Boolean flag.
--ca-bundlepathnoPath to a PEM CA bundle for TLS verification (overrides TENSORMESH_CA_BUNDLE).
--timeoutfloatnoDefault HTTP timeout in seconds (overrides TENSORMESH_TIMEOUT_SECONDS; subcommands may override).
--max-retriesintegernoMax retries for idempotent HTTP requests on transient errors (overrides TENSORMESH_MAX_RETRIES; subcommands may override).
--controlplane-basetextnoOverride the Control Plane base URL.
--gateway-providertextnoInference Gateway provider for built-in host selection (nebius, lambda, yotta).

Prerequisites

  • For the default On-Demand flow, run tm init --sync first so the CLI can reuse the synced managed gateway settings.

Caveats

  • Reports local readiness only. Use tm auth whoami to live-validate Control Plane auth and tm infer chat for an end-to-end inference check.
  • tm infer doctor accepts the same local targeting overrides as tm infer chat for --model, --user-id, --api-key, and --base-url.
  • --surface on-demand checks local prerequisites for X-User-Id and --model @latest resolution.
  • --surface serverless checks the shared inference API key (gateway_api_key) and selected host, but does not live-validate the upstream endpoint.
  • Use tm billing pricing serverless list to discover published serverless model names when you have Control Plane access for the same Tensormesh environment. tm infer doctor can check whether --model is present, but it does not enumerate them itself.
  • --model @latest readiness only applies to --surface on-demand and only checks for a local Control Plane token; it does not live-validate that token.

Parent Command