Hardware Datasheet · GPU Server
Nexus Compute
Nexus Compute HGX B200 8-GPU EPYC Inference Server
EPYC-driven Blackwell density for high-throughput, low-latency model serving.
Overview
This server pairs the NVIDIA HGX B200 8-GPU platform with dual AMD EPYC processors and abundant PCIe lanes, tuned for high-concurrency inference where memory bandwidth and tokens-per-second matter most. Nexus Compute specifies, configures, tests, and warranty-backs each system through authorized channels, optimizing GPU partitioning and networking around your latency and volume targets.
Specifications
| GPU | NVIDIA HGX B200 8-GPU (180GB HBM3e each, 1.4TB total) |
| GPU Interconnect | 5th-gen NVLink + NVSwitch, 1.8TB/s per GPU |
| CPU | Dual AMD EPYC (9004/9005 series) |
| System Memory | Up to 3TB DDR5 ECC |
| Storage | Hot-swap NVMe array + M.2 boot |
| Networking | 8x ConnectX-7 up to 400Gb/s (1:1 GPU:NIC) |
| GPU Partitioning | Multi-instance partitioning for multi-tenant serving |
| Form Factor | 8U–10U rackmount (air-cooled) |
| Warranty | Enterprise warranty with support options |
Typical Use Cases
- ·Production LLM and generative model serving
- ·High-concurrency, latency-sensitive inference
- ·Multi-tenant GPU serving with partitioning
- ·Retrieval-augmented generation at scale
Industries
Warranty & Support
Supplied through authorized channels with full manufacturer warranty. On-site, next-business-day support options available. Every system is configured, tested, and documented before delivery, with asset and warranty records provided for enterprise audit requirements.
Request a tailored quote
Configurations are tailored per engagement — contact us for pricing and lead times.
sales@nexus-compute.com
+1 737 276 1016
nexus-compute.com
Specifications are indicative and configured to each engagement. All product names, logos, and trademarks are the property of their respective owners. Nexus Compute is an independent enterprise hardware supplier.