Intel Launches Sapphire Rapids Fourth-Gen Xeon CPUs and Ponte Vecchio Max GPU Series (2024)

Image

After years of delays, Intel formally launched its fourth-gen Xeon Scalable Sapphire Rapids CPUs, in both regular and HBM-infused Max flavors, and its "Ponte Vecchio" Data Center GPU Max Series today. Intel's expansive portfolio of 52 new CPUs will face off with AMD's EPYC Genoa lineup that debuted last year. The company also slipped in a low-key announcement of its last line of Optane Persistent Memory DIMMs.

While AMD's chips maintain the core count lead with a maximum of 96 cores on a single chip, Intel's Sapphire Rapids chips bring the company up to a maximum of 60 cores, a 50% improvement over its previous peak of 40 cores with the third-gen Ice Lake Xeons. Intel claims this will lead to a 53% improvement in general compute over its prior-gen chips, but largely avoided making direct comparisons to AMD's chips during its presentations. However, Intel has provided samples to the press for unrestricted third-party reviews, so it isn't shying away from the competition.

Sapphire Rapids leans heavily into new acceleration technologies that can either be purchased outright or bought through a new pay-as-you-go model. These new purpose-built accelerator regions of the chip are designed to radically boost performance in several types of work, like compression, encryption, data movement, and data analytics, that typically require discrete accelerators for maximum performance.

Despite having a clear core count lead, AMD doesn't have similar acceleration features for its Genoa processors. When employing the new accelerators, Intel claims an average 2.9X improvement in performance-per-watt over its own previous-gen models in some workloads. Intel also claims a 10X improvement in AI inference and training, and a 3X improvement in data analytics workloads.

Intel's Sapphire Rapids, which comes fabbed on the 'Intel 7' process, also brings a host of new connectivity technologies, like support for PCIe 5.0, DDR5 memory, and the CXL 1.1 interface (type 1 and 2 devices), giving the company a firmer footing against AMD's Genoa. We're hard at work benchmarking the chips for our full review that we will post in the coming days, but in the interim, here's a brief overview of the new lineup.

Intel 4th-Gen Xeon Scalable Sapphire Rapids Pricing and Specifications

Image

Intel Xeon Sapphire Rapids Accelerators

Image

Intel's new on-die accelerators are a key new component for its Sapphire Rapids processors. As mentioned above, you can either purchase chips with all of the accelerator options activated, or you can opt for less expensive models and purchase accelerator licenses as needed through the Intel On Demand service. Not all chips have the same accelerator options, which we'll cover below.

Intel hasn't provided a pricing guide for the accelerators yet, but the licenses will be provided through server OEMs and are activated via software and a licensing API. Instead of buying a full license outright, you can also opt for a pay-as-you-go feature with usage metering to measure how much of a service you use. This feature will likely be popular among CSPs.

The idea behind the Intel On Demand service is to allow customers to activate and pay for only the features they need, while also providing a future upgrade path that doesn't require buying new servers or processors. Instead, customers could opt to employ acceleration engines to boost performance. This also allows Intel and its partners to carve multiple types of SKUs from the same functional silicon, thus simplifying supply chains and reducing costs.

These functions represent Intel's continuation of its long history of bringing fixed-function accelerators onto the processor die. Still, the powerful units on Sapphire Rapids will require software support the extract the full performance capabilities. Intel is already working with several software providers to enable support in a broad range of applications, many of which you can see in the album above.

Intel has four types of accelerators available with Sapphire Rapids. The Data Streaming Accelerator (DSA) improves data movement by offloading the CPU of data-copy and data-transformation operations. The Dynamic Load Balancer (DLB) accelerator steps in to provide packet prioritization and dynamically balance network traffic across the CPU cores as the system load fluctuates.

Intel also has an In-Memory Analytics Accelerator (IAA) that accelerates analytics performance and offloads the CPU cores, thus improving database query throughput and other functions.

Intel has also brought its Quick Assist Technology (QAT) accelerators onboard the CPU. This function used to reside on the chipset. Thishardware offload accelerator augments cryptography and compression/decompression performance. Intel has employed QAT accelerators for quite some time, so this technology already enjoys broad software support.

Unfortunately, the chips have varying acceleration capabilities — you can't buy four 'devices' on all models. The Sapphire Rapids processors are comprised of two types of designs (Die Chops), as listed in the SKU table. The XCC chips are comprised of four total die, and each die has one of each accelerator (IAA, QAT, DSA, DLB). That means you can activate a maximum of four accelerators of each type on these chips (for example, 4 IAA, 4 QAT, 4 DSA, 4 DLB).

In contrast, some of the chips use a single MCC die, so they only have one IAA and DSA accelerator and two each of the QAT and DLB accelerators (2 QAT, 2 DLB, 1 IAA, 1 DSA).

Intel Max CPU Series and Ponte Vecchio Max GPU Series

Image

Intel recently announced details about its forthcoming Xeon Max Series of CPUs and the Intel Data Center GPU Max Series (Ponte Vecchio). Today marks the formal launch.

Intel's HBM2e-equipped Max CPU models come to market with 32 to 56 cores and are based on the standard Sapphire Rapids design. These chips are the first x86 processors to employ HBM2e memory on-package, thus providing a larger 64GB pool of local memory for the processor. The HBM memory will help with memory-bound workloads that aren't as sensitive to core counts, so the Max models come with fewer cores than standard models. Target workloads include computational fluid dynamics, climate and weather forecasting, AI training and inference, big data analytics, in-memory databases, and storage applications.

The Max CPUs can operate in a multitude of various configurations, such as with the HBM memory used for all memory operations (HBM only - no DDR5 memory required), an HBM 'Flat Mode' that presents the HBM as a separate memory region (this requires extensive software support), or in an HBM 'Caching Mode' that uses the HBM2e as a DRAM-backed cache. The latter requires no code changes and will likely be the most frequently used mode of operation.

The Xeon Max CPUs will square off with AMD's EPYC Milan-X processors, which come with a 3D-stacked L3 cache called 3D V-Cache. The Milan-X models have up to 768MB of total L3 cache per chip that delivers an incredible amount of bandwidth, but it doesn't provide as much capacity as Intel's approach with HBM2e. Both approaches have their respective strengths and weaknesses, so we're eager to put the Xeon Max processors to the test.

Notably, Fujitsu's A64FX Arm processor uses a similar HBM technique. The HBM-equipped A64FX processors power the Fugaku supercomputer, which was the fastest in the world for several years (until the AMD-powered exascale-class Frontier took over last year). Fugaku still maintains the second spot on the Top500.

Intel also launched its Max GPU Series, previously code-named Ponte Vecchio. Intel had previously unveiled the three different GPU models, which come in both standard PCIe and OAM form factors. You can read more about the Max GPU Series here.

Intel Optane Persistent Memory (PMem) 300

Intel Launches Sapphire Rapids Fourth-Gen Xeon CPUs and Ponte Vecchio Max GPU Series (57)

As part of its Sapphire Rapids launch, Intel quietly introduced the final series of Optane Persistent Memory DIMMs. The final generation, codenamed Crow's Pass but officially known as the Intel Optane Persistent Memory 300, will come in 128, 256, and 512 GB capacities and operate at DDR5-4400. That's a big improvement over the previous peak of DDR4-3200, but it also means that Sapphire Rapids systems will have to downclock the standard memory to DDR5-4400 from the supported DDR5-4800 if they plan on employing Optane.

Intel claims that the 300-series offers 56% more sequential bandwidth and 214% more bandwidth in random workloads, along with support for up to 4TB of Optane per socket, or 6TB total for a system. Just like the previous-gen Optane 200 series, the DIMMs operate at 15W. However, they now step up to a DDR-T2 interface and AES-XTS 256-bit encryption.

At itsdebut in 2015, Intel and partner Micron touted the underlying tech,3D XPoint, as delivering 1000x the performance and endurance of NAND storage in tandem with 10x the density of DRAM, but the technology is now winding its way to an end. Intel has already stopped producing itsOptane storage products for client PCs, which makes sense because it is selling its NAND business to SK Hynix.

However, Intel has retained its memory business for the data center, including itspersistent memory DIMMsthat can function as an adjunct to main memory — a capability only Intel offers. Those products will also not see any future generations after the 300-series modules.

Intel cites an industry shift toCXL-based architecturesas a reason for winding down the Optane business, mirroring Intel's ex-partner Micron's sentiments when itexited the business last year. Sapphire Rapids supports both Optane DIMMs and the CXL interface, but this will be one of the last times the two are seen together — CXL will be the industry's preferred method of connecting exotic memories to chips in the future.

We are currently underway with our testing for our Sapphire Rapids review, so stay tuned for the full performance breakdown and architectural details in the coming days.

Stay On the Cutting Edge: Get the Tom's Hardware Newsletter

Get Tom's Hardware's best news and in-depth reviews, straight to your inbox.

Paul Alcorn

Managing Editor: News and Emerging Tech

Paul Alcorn is the Managing Editor: News and Emerging Tech for Tom's Hardware US. He also writes news and reviews on CPUs, storage, and enterprise hardware.

See more CPUs News

More about cpus

Intel's Arrow Lake CPU socket is nearly identical to the old socket — LGA1851 pinout shows one additional USB 2.0 portAMD vs Intel: Which CPUs Are Better in 2024?

Latest

U.S. government addresses critical workforce shortages for the semiconductor industry with new program

See more latest►

61 CommentsComment from the forums

Giroro
So Intel is making lower performing, less efficient, higher priced processors with confusing product segmentation and a supremely unethical upgrade model.
Let's see how that works out works out for them.
Reply
Tech0000
My take: Intel is betting on an investment in very specific accelerator silicon (as opposed to using that silicon budget to add more general cores etc) will be competitive against AMD EPYC many more core products.
The bet is that the largest investments that their customers do will be in server capacity for AI (obviously), data traffic/ encryption (also obviously) etc. etc. (read their marketing above) and not in general server performance improvements (across all work loads - even though there is some of that as well with gen 4).
No doubt AI etc. is important but it is a relatively big and risky bet, trading potential general server volume (sales that may slip to AMD with their many more core general high performing EPYC servers) vs. accelerated workload server volume.
They said they listened to their customers and this is what they said so...
We'll see it intel got the market prediction right...
It also felt like they got some of their customers to even say (in the intel's video stream today) that with the new gen4 Xeons they do not need special AI accelerators (from NVidia etc) since the new Xeons are good enough. Intel quickly jumped on that nice set up (in the video) and (re)introduced the "Democratization of AI" slogan. So in some ways it's a NVidia compete as well...
Again, We'll see...
Strategically, it feels like Intel is getting squeezed and is fighting a two front war AMD on one side and NVidia & co on the other side... not a comfortable position to be in...
Reply
jkflipflop98
Giroro said:
So Intel is making lower performing, less efficient, higher priced processors with confusing product segmentation and a supremely unethical upgrade model.
Let's see how that works out works out for them.
Must be opposite day here on Tom's.
Reply
Tech0000
Forgot to say that intel mentioned, they are going to launch the workstation W790 series W2400 and W3400 chips on February 15th.
hopefully they will be priced at a level that we can build reasonable HEDTs out of them. W3400 looks great on paper, but priced wrong, it will be out of reach and Intel consumer HEDT will stay dead. sigh...
Reply
jkflipflop98
A 24 core 13900k is $599 at newegg. I'm not sure what you're going on about "consumer HEDT will stay dead". Looks like it's alive and kicking to me.
Reply
Samlebon2306
These chip are going to get Threadripped really hard.
Reply
rluker5
Giroro said:
So Intel is making lower performing, less efficient, higher priced processors with confusing product segmentation and a supremely unethical upgrade model.
Let's see how that works out works out for them.
Depends on the workload.
Gaming GPUs have a hard time competing with Asics when they can mine the same crypto.
Reply
truerock
This is question that is not specific to these Intel CPUs.
Why do workstation and server CPUs run slower than desktop PC CPUs?
I googled it and all I'm getting is: more cores means more heat which means server CPUs need to run slower. But that is obviously not true.
Xeon SKU 6434 has only 8 cores and max turbo is only 4.1GHz.
The 24 (8p/16e) core Intel ‘Raptor Lake’ Core i9-13900K will run 5.8GHz.
Is there any hope of a Xeon 8-core CPU running boost 5.8GHz someday?
Thank you for any insight.
Reply
Amdlova
And Iam trying to get an 7259cl :) 24 cores of raw power and some optanes to play. Epyc has some Market but have amd problems. Cannot guarantees if will work right.
Reply
Diogene7
I am curious to see what will be the next gen Non Volatile Memory (NVM) options that will connect through CXL will appear on the market in a couple of years ?
And also what will be the latency because it will be usung the PCIe5 or PCIe6 bus, but Intel Optane DC PMEM were pluggable on the DRAM bus ?
Any guess ? Nantero carbon nanotube NRAM ? Everspin / Avalanche magnetic MRAM (STT-MRAM, SOT-MRAM, VG-SOT-MRAM,…) ?
Reply

Show more comments

Most Popular

Australian police arrest hacker who created 'Evil Twin' wireless network to steal data during flights

Experimental transistor survives in a nuclear reactor at 125 degrees Celsius temps — GaN semiconductor can survive up to five years in a reactor

Amazon Web Services hints at 1000 Watt next-gen Trainium AI chip — AWS lays the groundwork for liquid-cooled data centers to house new AI chips

China plans standardized brain-computer tech similar to Elon Musk’s Neuralink

Microsoft's redesign of the Windows 11's Weather app shoves in yet more ads

MSI launches two 240Hz QD-OLED gaming monitors — new 34 and 27-inchers come with 1440p visuals and USB-C connectivity

GPUs can now use PCIe-attached memory or SSDs to boost VRAM capacity —Panmnesia's CXL IP claims double-digit nanosecond latency

Noctua's next-gen flagship CPU cooler finally arrives — Noctua NH-D15 G2 released at $150

Intel schedules the end of its 200-series Optane memory DIMMs — shipments to draw to an end in late 2025

Nvidia could receive French ban hammer — antitrust charges may follow government raids of Nvidia's offices in France

Chinese competitor to Nvidia charged with financial fraud, loses spot on Shenzhen Stock Exchange

Intel Launches Sapphire Rapids Fourth-Gen Xeon CPUs and Ponte Vecchio Max GPU Series (2024)

Intel 4th-Gen Xeon Scalable Sapphire Rapids Pricing and Specifications

Intel Xeon Sapphire Rapids Accelerators

Intel Max CPU Series and Ponte Vecchio Max GPU Series

Intel Optane Persistent Memory (PMem) 300

Stay On the Cutting Edge: Get the Tom's Hardware Newsletter

Most Popular

References