Skip to main content
Tensormesh Lab is an experimental space for exploring platform capabilities hands-on. Run live demos against serverless models to visualize how KV caching accelerates inference. Navigate to Tensormesh Lab → Demos from the sidebar to get started.

How It Works

1

Choose a Demo

Browse the demo catalog and select one to run.
2

Select a Model

Pick from the available serverless models. No deployment setup required.
3

Run the Demo

Configure and click Run. The demo streams results in real time.

Available Demos

Ask the Document — Sends 20 questions against a shared long-document prefix to demonstrate KV cache speedup. The first request is cold; follow-ups reuse the cached state and skip the prefill step.
Try the same demo across different models to compare how architecture and model size affect cache speedup.