Guided Path: Mid-Size Company

Goal: serve more users and more traffic than a single-model startup box can handle, with a clear path to add capacity later without starting over.

1. The model

Apache 2.0

Qwen3-235B-A22B — Alibaba

Total parameters235B

Minimum memory282.0 GB

Math: 235B × 1 GB + 20% = 282.0 GB.

2. The build

A single DGX H100 node has 640GB of NVLink-connected memory — 2.3x what this model needs at minimum. That headroom is the point: it's what lets many concurrent users hit the model at once without queueing, and leaves room to run a second model alongside it.

NVIDIA press image

1x NVIDIA DGX H100 node

GPU / systemNVIDIA DGX H100 (8x H100 node) × 1

Combined memory640 GB

Total price$350,000

⚡ 8,500 W sustained draw

🏠 = 7.08x an average home (~1,200W continuous)

🔋 = drains a 90 kWh EV battery in 10.59 hrs

3. Room to grow

When traffic outgrows one node, the honest next step is a second DGX H100 wired to the first over NVLink/InfiniBand — a real cluster (see the Clusters page), not more desktop cards crammed into a case.

NVIDIA press image

2x NVIDIA DGX H100 nodes (networked cluster)

GPU / systemNVIDIA DGX H100 (8x H100 node) × 2

Combined memory1,280 GB

Total price$700,000

⚡ 17,000 W sustained draw

🏠 = 14.17x an average home (~1,200W continuous)

🔋 = drains a 90 kWh EV battery in 5.29 hrs