Home Solutions LenovoLenovo ThinkSystem SR860 V3 — 4-Socket with 4x NVIDIA L40S

Lenovo

Lenovo ThinkSystem SR860 V3 — 4-Socket with 4x NVIDIA L40S

Four L40S GPUs for cost-efficient inference, generative AI serving, and VDI.

Request Quote Download Datasheet

Full manufacturer warrantyAuthorized channel48-hour quote

We help you choose, configure, and deliver the right system — no obligation.

Lenovo ThinkSystem SR860 V3 — 4-Socket with 4x NVIDIA L40S — Lenovo enterprise hardware

Lenovo ThinkSystem SR860 V3 — 4-Socket with 4x NVIDIA L40S hardware detail 1

Lenovo ThinkSystem SR860 V3 — 4-Socket with 4x NVIDIA L40S hardware detail 2

Lenovo ThinkSystem SR860 V3 — 4-Socket with 4x NVIDIA L40S hardware detail 3

Configuration at a Glance

GPU / Accelerator4x NVIDIA L40S (48GB GDDR6 each)

CPU4x Intel Xeon Scalable (4th/5th Gen)

MemoryUp to 4TB DDR5 (configurable)

StorageNVMe / SAS hot-swap (configurable); M.2 boot

Tailored per engagement. Full technical overview below.

Configuration Options

Core specifications for this system. Every component is configurable to your workload — request a quote for a tailored build.

GPU / Accelerator

4x NVIDIA L40S (48GB GDDR6 each)

Processor

4x Intel Xeon Scalable (4th/5th Gen)

Memory

Up to 4TB DDR5 (configurable)

Storage

NVMe / SAS hot-swap (configurable); M.2 boot

Overview

This ThinkSystem SR860 V3 is configured with four NVIDIA L40S accelerators to balance inference throughput, generative AI serving, and graphics-rich VDI on a single four-socket node. It is integrated, validated under mixed GPU load, and delivered warranty-backed through Lenovo-authorized distribution.

Who This Solution Is For

Teams serving inference and generative AI at scale

VDI and virtual workstation administrators

Enterprises needing versatile GPU compute

Organizations optimizing GPU cost-per-stream

Business Benefits

Versatile acceleration

The L40S handles inference, fine-tuning, and graphics workloads, maximizing utilization of a single shared platform.

Efficient cost-per-stream

Four L40S GPUs deliver strong throughput per watt for serving and VDI without top-tier training-GPU cost.

Headroom to grow

The 4U four-socket chassis leaves room to add GPUs, memory, and storage as demand rises.

Typical Business Use Cases

LLM and generative AI inference serving

Virtual desktop and virtual workstation hosting

Real-time video and rendering pipelines

Mixed AI and visualization workloads

Industry Applications

Media & EntertainmentSaaS & SoftwareFinancial ServicesAI & Machine Learning

Technical Overview

The SR860 V3 four-socket platform supports multiple double-wide GPUs over PCIe Gen5; this build pairs four NVIDIA L40S accelerators with 4th/5th Gen Intel Xeon Scalable CPUs and DDR5 memory. The XClarity Controller, redundant power, and flexible NVMe storage make it a dependable shared inference and VDI host.

GPU / Accelerator	4x NVIDIA L40S (48GB GDDR6 each)
CPU	4x Intel Xeon Scalable (4th/5th Gen)
Memory	Up to 4TB DDR5 (configurable)
Storage	NVMe / SAS hot-swap (configurable); M.2 boot
Networking	OCP 3.0 + PCIe Gen5; 25/100GbE
Form Factor	4U four-socket rackmount
Management	Lenovo XClarity Controller
Power	Redundant hot-swap PSUs (N+1)
Warranty	Lenovo 3-year (upgradeable to 5-year, on-site)

Specifications are indicative and configured to each engagement. Request a quote for a configuration tailored to your requirements.

Warranty, Support & Fulfillment

Every system ships from an authorized channel, configured and tested, with the documentation enterprise buyers need — backed by warranty and a dedicated account team.

Enterprise Warranty

Full manufacturer warranty with optional on-site, next-business-day support and extended coverage.

Authorized Channel

Sourced through Tier-1 distribution and OEM partners — never grey market. Asset & warranty records included.

Lead Time & Deployment

48-hour quotes, then configured, burn-in tested, and delivered on a committed schedule.

Nationwide Fulfillment

Coordinated logistics, rack-and-stack, and delivery wherever your infrastructure lives.

Frequently Asked Questions

L40S or H100 for our workload?

For inference, generative AI serving, fine-tuning, and VDI, the L40S offers excellent value and versatility; the H100 is better for large-scale training. We match the GPU to your dominant workload.

How many VDI users can this support?

User density depends on profile (knowledge worker versus 3D designer) and vGPU sizing. Four L40S GPUs support substantial mixed VDI; we size vGPU profiles to your user mix.

Can we expand the GPU count later?

Yes — the SR860 V3 supports additional double-wide GPUs within the chassis, so capacity can grow with demand under power and cooling constraints.

Hardware Assistance

Configure the Lenovo ThinkSystem SR860 V3 — 4-Socket with 4x NVIDIA L40S with Nexus Compute

Tell us your requirements and a hardware specialist will help you specify, configure, and quote the right system — typically within two business days. No obligation.

Request Quote Speak to an Infrastructure Specialist