Manage and configure model resources
config.yaml
before running truss push
. Any changes after deployment will not impact previous deployments. Running truss push
again will create a new deployment using the resources specified in the config.yaml
.
Gi
in resources.memory
refers to Gibibytes, which are slightly larger
than Gigabytes.truss push
, the development deployment will be redeployed with the new specified instance type.Instance | $/min | vCPU | RAM |
---|---|---|---|
1×2 | $0.00058 | 1 | 2 GiB |
1×4 | $0.00086 | 1 | 4 GiB |
2×8 | $0.00173 | 2 | 8 GiB |
4×16 | $0.00346 | 4 | 16 GiB |
8×32 | $0.00691 | 8 | 32 GiB |
16×64 | $0.01382 | 16 | 64 GiB |
1x2
: Text classification (e.g., Truss quickstart)4x16
: LayoutLM Document QA4x16+
: Sentence Transformers embeddings on larger corporaInstance | $/min | vCPU | RAM | GPU | VRAM |
---|---|---|---|---|---|
T4x4x16 | $0.01052 | 4 | 16 GiB | NVIDIA T4 | 16 GiB |
T4x8x32 | $0.01504 | 8 | 32 GiB | NVIDIA T4 | 16 GiB |
T4x16x64 | $0.02408 | 16 | 64 GiB | NVIDIA T4 | 16 GiB |
L4x4x16 | $0.01414 | 4 | 16 GiB | NVIDIA L4 | 24 GiB |
L4:2x4x16 | $0.04002 | 24 | 96 GiB | 2 NVIDIA L4s | 48 GiB |
L4:4x48x192 | $0.08003 | 48 | 192 GiB | 4 NVIDIA L4s | 96 GiB |
A10Gx4x16 | $0.02012 | 4 | 16 GiB | NVIDIA A10G | 24 GiB |
A10Gx8x32 | $0.02424 | 8 | 32 GiB | NVIDIA A10G | 24 GiB |
A10Gx16x64 | $0.03248 | 16 | 64 GiB | NVIDIA A10G | 24 GiB |
A10G:2x24x96 | $0.05672 | 24 | 96 GiB | 2 NVIDIA A10Gs | 48 GiB |
A10G:4x48x192 | $0.11344 | 48 | 192 GiB | 4 NVIDIA A10Gs | 96 GiB |
A10G:8x192x768 | $0.32576 | 192 | 768 GiB | 8 NVIDIA A10Gs | 188 GiB |
V100x8x61 | $0.06120 | 16 | 61 GiB | NVIDIA V100 | 16 GiB |
A100x12x144 | $0.10240 | 12 | 144 GiB | 1 NVIDIA A100 | 80 GiB |
A100:2x24x288 | $0.20480 | 24 | 288 GiB | 2 NVIDIA A100s | 160 GiB |
A100:3x36x432 | $0.30720 | 36 | 432 GiB | 3 NVIDIA A100s | 240 GiB |
A100:4x48x576 | $0.40960 | 48 | 576 GiB | 4 NVIDIA A100s | 320 GiB |
A100:5x60x720 | $0.51200 | 60 | 720 GiB | 5 NVIDIA A100s | 400 GiB |
A100:6x72x864 | $0.61440 | 72 | 864 GiB | 6 NVIDIA A100s | 480 GiB |
A100:7x84x1008 | $0.71680 | 84 | 1008 GiB | 7 NVIDIA A100s | 560 GiB |
A100:8x96x1152 | $0.81920 | 96 | 1152 GiB | 8 NVIDIA A100s | 640 GiB |
H100x26x234 | $0.16640 | 26 | 234 GiB | 1 NVIDIA H100 | 80 GiB |
H100:2x52x468 | $0.33280 | 52 | 468 GiB | 2 NVIDIA H100s | 160 GiB |
H100:4x104x936 | $0.66560 | 104 | 936 GiB | 4 NVIDIA H100s | 320 GiB |
H100:8x208x1872 | $1.33120 | 208 | 1872 GiB | 8 NVIDIA H100s | 640 GiB |
H100MIG:3gx13x117 | $0.08250 | 13 | 117 GiB | Fractional NVIDIA H100 | 40 GiB |