GPU AI Service

List deployable Inventory items

get

Returns GPU offerings that are deployable right now based on live inventory, with pricing and availability classification.

Authorizations

AuthorizationstringRequired

JWT token for authentication

Query parameters

regionstringOptional

Filter by region name (e.g., as-south-1)

typestring · enumOptional

Filter by item type (GPU only)

Default: GPUPossible values:

gpu_modelstringOptional

Filter GPU items by model (e.g., A40, A100, L40S)

gpu_vendorstringOptional

Filter by GPU vendor (case-insensitive, e.g., NVIDIA)

vram_minintegerOptional

Minimum VRAM (GiB). 0 or missing means no lower bound.

vram_maxintegerOptional

Maximum VRAM (GiB). 0 or missing means no upper bound.

pageinteger · min: 1Optional

Page number for pagination (starts from 1)

Default: 1

limitinteger · min: 1 · max: 100Optional

Number of items per page (max 100)

Default: 20

Responses

200

Inventory retrieved

application/json

Responseany

400

Invalid query parameters

application/json

500

Internal server error

application/json

get

/api/v1beta1/inventory

GET /api/v1beta1/inventory HTTP/1.1
Authorization: Bearer YOUR_SECRET_TOKEN
Accept: */*

No content

Get GPU config details by config_id

get

Authorizations

AuthorizationstringRequired

JWT token for authentication

Query parameters

config_idstringRequired

Unique config identifier from gpu_devices

Responses

200

GPU config retrieved

application/json

Responseany

400

Invalid request

application/json

404

Config not found

application/json

500

Internal server error

application/json

get

/api/v1beta1/inventory/gpuconfig

GET /api/v1beta1/inventory/gpuconfig?config_id=text HTTP/1.1
Authorization: Bearer YOUR_SECRET_TOKEN
Accept: */*

No content

List GPU notification requests for the authenticated user

get

Authorizations

AuthorizationstringRequired

JWT token for authentication

Query parameters

statusstring · enumOptional

Filter requests by status

Possible values:

Responses

200

List of notification requests

application/json

ResponseGPUNotifyAvailabilityRequest[]

401

Unauthorized

application/json

500

Internal server error

application/json

get

/api/v1beta1/inventory/gpu/notify-availability

GET /api/v1beta1/inventory/gpu/notify-availability HTTP/1.1
Authorization: Bearer YOUR_SECRET_TOKEN
Accept: */*

[]

Register for email notification when a GPU becomes available

post

Authorizations

AuthorizationstringRequired

JWT token for authentication

Body

region_idstringRequired

Region ID (e.g. as-south-1)

config_idstringRequired

GPU Config ID to watch for

Responses

202

Request accepted

400

Invalid request

application/json

404

Region or Config not found

application/json

500

Internal server error

application/json

post

/api/v1beta1/inventory/gpu/notify-availability

POST /api/v1beta1/inventory/gpu/notify-availability HTTP/1.1
Authorization: Bearer YOUR_SECRET_TOKEN
Content-Type: application/json
Accept: */*
Content-Length: 39

{
  "region_id": "text",
  "config_id": "text"
}

No content

List AI Runtimes

get

Retrieves a list of AI Runtimes for a specific project.

Authorizations

AuthorizationstringRequired

JWT token for authentication

Path parameters

org_idstringRequired

The organization identifier.

project_idstringRequired

The project identifier.

Query parameters

pageinteger · min: 1Optional

Page number for pagination (starts from 1)

Default: 1

limitinteger · min: 1 · max: 100Optional

Number of items per page (max 100)

Default: 20

Responses

200

A paginated list of AI Runtimes.

application/json

Responseany

400

The request is invalid.

application/json

401

Unauthorized access.

application/json

404

The project was not found.

application/json

500

An unexpected internal server error occurred.

application/json

get

/api/v1beta1/orgs/{org_id}/projects/{project_id}/airuntimes

GET /api/v1beta1/orgs/{org_id}/projects/{project_id}/airuntimes HTTP/1.1
Authorization: Bearer YOUR_SECRET_TOKEN
Accept: */*

No content

Create an AI Runtime

post

Creates a new AI Runtime instance from a direct specification or a pre-defined template.

Authorizations

AuthorizationstringRequired

JWT token for authentication

Path parameters

org_idstringRequired

The organization identifier.

project_idstringRequired

The project identifier.

Body

anyOptional

Responses

201

The AI Runtime was created successfully.

application/json

400

The request payload is invalid.

application/json

401

Unauthorized access.

application/json

404

The project or template was not found.

application/json

409

An AI Runtime with the same name already exists in the project.

application/json

500

An unexpected internal server error occurred.

application/json

post

/api/v1beta1/orgs/{org_id}/projects/{project_id}/airuntimes

POST /api/v1beta1/orgs/{org_id}/projects/{project_id}/airuntimes HTTP/1.1
Authorization: Bearer YOUR_SECRET_TOKEN
Content-Type: application/json
Accept: */*

No content

Get AI Runtime by ID

get

Fetches the details of a specific AI Runtime by its unique identifier.

Authorizations

AuthorizationstringRequired

JWT token for authentication

Path parameters

org_idstringRequired

The organization identifier.

project_idstringRequired

The project identifier.

airuntime_idstring · uuidRequired

The unique identifier of the AI Runtime.

Responses

200

Detailed information about the AI Runtime.

application/json

Responseany

400

The request is invalid.

application/json

401

Unauthorized access.

application/json

404

An AI Runtime with the specified ID was not found.

application/json

500

An unexpected internal server error occurred.

application/json

get

/api/v1beta1/orgs/{org_id}/projects/{project_id}/airuntimes/{airuntime_id}

GET /api/v1beta1/orgs/{org_id}/projects/{project_id}/airuntimes/{airuntime_id} HTTP/1.1
Authorization: Bearer YOUR_SECRET_TOKEN
Accept: */*

No content

Delete an AI Runtime

delete

Permanently deletes a specific AI Runtime by its unique ID.

Authorizations

AuthorizationstringRequired

JWT token for authentication

Path parameters

org_idstringRequired

The organization identifier.

project_idstringRequired

The project identifier.

airuntime_idstring · uuidRequired

The unique identifier of the AI Runtime to delete.

Responses

204

The AI Runtime was deleted successfully.

400

The request is invalid.

application/json

401

Unauthorized access.

application/json

404

An AI Runtime with the specified ID was not found.

application/json

500

An unexpected internal server error occurred.

application/json

delete

/api/v1beta1/orgs/{org_id}/projects/{project_id}/airuntimes/{airuntime_id}

DELETE /api/v1beta1/orgs/{org_id}/projects/{project_id}/airuntimes/{airuntime_id} HTTP/1.1
Authorization: Bearer YOUR_SECRET_TOKEN
Accept: */*

No content

Get AI Runtime utilization metrics

get

Fetches real-time resource utilization metrics (CPU, RAM, GPU) for a specific AI Runtime.

Authorizations

AuthorizationstringRequired

JWT token for authentication

Path parameters

org_idstringRequired

The organization identifier.

project_idstringRequired

The project identifier.

airuntime_idstring · uuidRequired

The unique identifier of the AI Runtime.

Responses

200

Resource utilization metrics for the AI Runtime.

application/json

Responseany

400

The request is invalid.

application/json

401

Unauthorized access.

application/json

404

An AI Runtime with the specified ID was not found.

application/json

500

An unexpected internal server error occurred.

application/json

get

/api/v1beta1/orgs/{org_id}/projects/{project_id}/airuntimes/{airuntime_id}/metrics

GET /api/v1beta1/orgs/{org_id}/projects/{project_id}/airuntimes/{airuntime_id}/metrics HTTP/1.1
Authorization: Bearer YOUR_SECRET_TOKEN
Accept: */*

No content

List AI Runtime Templates

get

Retrieves a list of available AI Runtime Templates.

Authorizations

AuthorizationstringRequired

JWT token for authentication

Responses

200

A paginated list of AI Runtime Templates.

application/json

Responseany

401

Unauthorized access.

application/json

500

An unexpected internal server error occurred.

application/json

get

/api/v1beta1/airuntime-templates

GET /api/v1beta1/airuntime-templates HTTP/1.1
Authorization: Bearer YOUR_SECRET_TOKEN
Accept: */*

No content

Get an AI Runtime Template

get

Retrieves the details of a specific AI Runtime Template by its unique ID.

Authorizations

AuthorizationstringRequired

JWT token for authentication

Path parameters

template_idstringRequired

The unique identifier of the AI Runtime Template.

Pattern: ^tpl-[a-zA-Z0-9-]+$

Responses

200

Detailed information about the AI Runtime Template.

application/json

Responseany

401

Unauthorized access.

application/json

404

An AI Runtime Template with the specified ID was not found.

application/json

500

An unexpected internal server error occurred.

application/json

get

/api/v1beta1/airuntime-templates/{template_id}

GET /api/v1beta1/airuntime-templates/{template_id} HTTP/1.1
Authorization: Bearer YOUR_SECRET_TOKEN
Accept: */*

No content

PreviousTenant NextAI Inference

Last updated 20 days ago

Good night

hashtagList deployable Inventory items

hashtagGet GPU config details by config_id

hashtagList GPU notification requests for the authenticated user

hashtagRegister for email notification when a GPU becomes available

hashtagList AI Runtimes

hashtagCreate an AI Runtime

hashtagGet AI Runtime by ID

hashtagDelete an AI Runtime

hashtagGet AI Runtime utilization metrics

hashtagList AI Runtime Templates

hashtagGet an AI Runtime Template

List deployable Inventory items

Get GPU config details by config_id

List GPU notification requests for the authenticated user

Register for email notification when a GPU becomes available

List AI Runtimes

Create an AI Runtime

Get AI Runtime by ID

Delete an AI Runtime

Get AI Runtime utilization metrics

List AI Runtime Templates

Get an AI Runtime Template