AI End Point
-
Create private AI End Point
- Using argument
- Using file
initz create ai-endpoint --name=<ai-endpoint-name> --mode=<prod/test>
Flags
--name
- Indicates the name of the private AI EndPoint.
- required
- Shorthand :
n
--mode
- Value can be
prod/test
. - Optional
--orgid
- Indicates the organization ID of the organization.
- Optional
initz create ai-endpoint -f <ai-endpoint.json/ai-endpoint.yaml>
Example Configuration :
- JSON
- YAML
ai-endpoint.json{
"ai_svc_name": "example",
"apply_policy": false,
"hf_token": "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",
"instance_size": "Nvidia L40S - 1 GPU 40G | 16GB RAM | 4vCPU",
"max_replicas": 1,
"min_replicas": 0,
"mode": "prod",
"model_repository": "meta-llama/Llama-3.2-1B-Instruct",
"org_id": "65bd07d8ffa05c77d828eaaa",
"workspace_id": "66e2e80a8641db5e5c7b8833"
}ai-endpoint.yamlai_svc_name: example
apply_policy: false
hf_token: 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
instance_size: Nvidia L40S - 1 GPU 40G | 16GB RAM | 4vCPU
max_replicas: 1
min_replicas: 0
mode: prod
model_repository: meta-llama/Llama-3.2-1B-Instruct
org_id: 65bd07d8ffa05c77d828eaaa
workspace_id: 66e2e80a8641db5e5c7b8833You can either use the configuration provided in the file above to create the private AI EndPoint or generate your own example template by running the command given below.
Generate Configuration :
- JSON
- YAML
initz create ai-endpoint --name=<ai-endpoint-name> --mode=<prod/test> -t json
initz create ai-endpoint --name=<ai-endpoint-name> --mode=<prod/test> -t yaml
important
- Private AI Endpoints can only be created in a GPU Enabled workspace.
- If you are currently not in 'GPU Enabled' workspace so
-
You can either set the GPU enabled workspace by running the command below.
initz set workspace --workspaceid=<workspaceID>
-
OR can switch to GPU Enabled workspace by argument while creating the private AI End Point.
initz create ai-endpoint --name=<ai-endpoint-name> --mode=<prod/test> --workspaceid=<workspaceID>
-