Skip to main content
POST
https://api.tensormesh.ai
/
v1
/
models:search-by-infra
List Models By Infra
curl --request POST \
  --url https://api.tensormesh.ai/v1/models:search-by-infra \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "infra": {
    "cloudProvider": "CLOUD_PROVIDER_UNSPECIFIED",
    "nebiusRegion": "NEBIUS_REGION_UNSPECIFIED",
    "lambdaRegion": "LAMBDA_REGION_UNSPECIFIED",
    "onpremRegion": "<string>"
  },
  "userId": "<string>"
}
'
{
  "models": [
    {
      "modelId": "<string>",
      "deploymentId": "<string>",
      "userId": "<string>",
      "description": "<string>",
      "modelPath": "<string>",
      "modelName": "<string>",
      "status": "MODEL_STATUS_UNSPECIFIED",
      "events": [
        {
          "createdAt": "2023-11-07T05:31:56Z",
          "log": "<string>",
          "eventType": "EVENT_TYPE_UNSPECIFIED"
        }
      ],
      "createdAt": "2023-11-07T05:31:56Z",
      "updatedAt": "2023-11-07T05:31:56Z",
      "modelSpec": {},
      "infra": {
        "cloudProvider": "CLOUD_PROVIDER_UNSPECIFIED",
        "nebiusRegion": "NEBIUS_REGION_UNSPECIFIED",
        "lambdaRegion": "LAMBDA_REGION_UNSPECIFIED",
        "onpremRegion": "<string>"
      },
      "gpuCount": 123,
      "gpuType": "GPU_TYPE_UNSPECIFIED",
      "replicas": 123,
      "endpoint": "<string>",
      "apiKey": "<string>"
    }
  ]
}

Authorizations

Authorization
string
header
required

Bearer authentication using an access token. Format: Bearer <access_token>

Body

application/json

ListModelsByInfraRequest contains the information needed to list models in a infrastructure.

infra
object

Infra specifies the infrastructure configuration for deploying and running models.

This message defines where a model deployment should run by specifying both the cloud provider and the specific region. It uses a oneof for region selection to ensure type-safe region specification based on the chosen provider.

See also: tensormesh/common/v1/cloud_provider.proto for provider and region enum definitions

userId
string

User ID for authorization. Must be a valid UUID.

Response

A successful response.

ListModelsByInfraResponse contains the list of models in the infrastructure.

models
object[]

List of deployments in the cluster.