Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.tensormesh.ai/llms.txt

Use this file to discover all available pages before exploring further.

Reserved Deployments provide dedicated GPU clusters tailored to your specific infrastructure needs — guaranteed capacity, consistent performance, and no resource contention. Navigate to Deploy → Reserved to submit a request. Reserved

When To Use Reserved

High-Volume Production

Workloads that require consistent throughput at a scale where serverless costs exceed a flat cluster rate.

Latency SLAs

Applications with strict latency requirements that need dedicated, non-shared GPU resources.

Enterprise Compliance

Deployments that require data isolation, custom networking, or specific compliance guarantees.

Tailored Pricing

A flat cluster rate replaces variable per-token billing — easier to budget at scale and priced to your specific workload and capacity requirements.

Requesting a Cluster

Submit a request through the form at Deploy → Reserved. Provide your cluster specifications: GPU Selection — Choose between high-compute GPU options
Cluster Size — Define the total number of GPUs required for your workload
Timeline — Specify your deployment window
Use Case — Describe the intended workload

Review and Launch

Once a request is submitted, our team reviews your requirements:
1

Consultation

Our team contacts you within 1 business day to discuss your requirements.
2

Capacity Planning

We provide a tailored pricing and hardware roadmap based on your specifications.
3

Provisioning

Dedicated GPU instances are provisioned and your cluster is ready to serve requests.

Not Ready for Reserved?

Start with Serverless Inference — instant access, pay-per-token, no setup required. Serverless is suitable for most development and production API workloads. Move to reserved when volume or latency requirements exceed what serverless offers.
You can also reach us directly via Management → Contact Us to discuss your capacity needs before submitting a formal request.