AMD Archive

Turning off Zen 4’s op cache for curiosity and giggles

CPUs start executing instructions by fetching those instruction bytes from memory and decoding them into internal operations (micro-ops). Getting data from memory and operating on it consumes power and incurs latency. Micro-op caching is a popular technique to improve on both fronts, and involves caching micro-ops that correspond to frequently executed instructions. AMD’s recent CPUs have particularly large micro-op caches, or op caches for short. Zen 4’s op cache can hold 6.75K micro-ops, and has the highest capacity of any op cache across the past few generations of CPUs. This huge op cache enjoys high hitrates, and gives the feeling AMD is leaning harder on micro-op caching than Intel or Arm. That begs the question of how the core would handle if its big, beautiful op cache stepped out for lunch. ↫ Chester Lam at Chips and Cheese The results of turning off the op cache were far less dramatic than one would expect, and this mostly comes down to the processor having to wait on other bottlenecks anyway, like the memory, and a lot of tasks consisting of multiple types of operations which not all make use of op cache. While it definitely contributes to making Zen 4 cores faster overall, even without it, it’s still an amazing core that outperforms its Intel competition. As a sidenote, this is such a fun and weird thing to do and benchmark. It doesn’t serve much of a purpose, and the information gained isn’t very practical, but turning off specific parts of a processor and observing the consequences does create some insight into exactly how a modern processor works. There are so many different elements that make up a modern processor now, and just gigahertz or even the number of cores barely tells even half the story. Anyway, we need more of these weird benchmarks.

AMD deprioritizing flagship gaming GPUs

I had a chance to speak to Jack Huynh, AMD’s senior vice president and general manager of the Computing and Graphics Business Group, during IFA 2024 in a question and answer session. Due to speculation that AMD won’t launch flagship GPUs for its next-gen lineup, I pressed Huynh for information regarding the company’s plans for the high-end GPU market with the RDNA 4-powered Radeon RX 8000-series. His comments sketch out a plan focused specifically on gaining market share in the GPU market above all else, and this strategy deprioritizes chasing Nvidia’s highest-end gaming cards — at least for now. ↫ Paul Alcorn at Tom’s Hardware Reading through the actual comments, it seems that AMD is not going to chase the very, extreme high-end that NVIDIA serves, like the 4090 level of GPUs. Honestly, I’m completely okay with that – those high-end GPUs are insanely expensive, and unlike what YouTube and tech websites might suggest, nobody buys these GPUs. Consistently, for more than a decade now, it’s the xx60-xx70 levels of cards that dominate the market, and it’s smart of AMD (and Intel) to focus on that segment if you want to sell as many GPUs as possible. The very top of the GPU market just doesn’t make a lot price/performance sense. You pay considerably more for a 4090 compared to a 4080, but the price increase does not correspond to a similar increase in performance. It simply makes a lot more sense to save that money and spend it elsewhere, such as on a better CPU, more RAM, more storage, or a new display. I’d rather AMD not waste time and energy on making these high-end GPUs nobody buys, and instead focus on improving the GPUs people actually buy. And of course, AMD just hasn’t been able to match NVIDIA at the top end, and that’s probably not going to change any time soon. Releasing a high-end, expensive GPU, only to be trounced by your one competitor every single time is not a good look, so why even try?

AMD branch prediction optimization for Ryzen 9000 and 7000 CPUs now available in Windows 11 23H2

The Windows patch that’s supposed to improve Windows’ performance on AMD’s new Ryzen 9000 chips has been backported from Windows 11 24H2 to Windows 11 23H2. Now, AMD has confirmed that it’s bringing the same branch prediction optimizations to Windows 11 23H2 via backporting. The new update will be listed under the Windows Update menu in the “Optional Updates” section as “KB5041587” which makes things a lot easier for those who haven’t updated or opted into the 24H2 preview. ↫ Hassan Mujtaba at Wccftech If you’re using Windows on a Ryzen 9000 chip – or even other recent Ryzen chips, who may also see a benefit from this patch – you should strongly consider installing this patch.

AMD says Microsoft’s next big Windows 11 update will improve Zen 5 CPU performance

AMD says Microsoft’s upcoming Windows 11 version 24H2 update will improve performance for its new Zen 5 CPUs. The Ryzen 9000 series launched earlier this month, and failed to live up to AMD’s performance promises in most reviews. After rumors of a Windows bug, AMD has revealed that AMD-specific branch prediction code will be optimized in Windows 11 version 24H2, which is expected to ship next month. ↫ Tom Warren at The Verge It’s wild how seemingly small things can have a major impact on the launch of a new processor (or GPU) line these days. The main culprit behind the disappointing benchmarks upon launch of the Ryzen 9000 series turned out to be the 9000’s new branch prediction method, the code for which is not yet available in Windows. However, AMD ran their tests in “Admin mode”, which yielded results as if such code was actually present. AMD has said the branch prediction code needed to unlock the full potential of Ryzen 9000 chips in Windows and yield benchmark results comparable to AMD’s own internal tests and PR promiseswill be released next month as part of “Windows 11, version 24H2 in preview through the Windows Insider Program (Release Preview Channel – Build 26100) or by downloading the ISO here“. AMD claims this update will benefit users of previous Zen 5 and Zen 3 processors as well, but to a lesser degree. No word on if this issue affects Linux users in any way.

The AMD Zen 5 microarchitecture: powering Ryzen AI 300 series for mobile and Ryzen 9000 for desktop

Built around the new Zen 5 CPU microarchitecture with some fundamental improvements to both graphics and AI performance, the Ryzen AI 300 series, code-named Strix Point, is set to deliver improvements in several areas. The Ryzen AI 300 series looks set to add another footnote in the march towards the AI PC with its mobile SoC featuring a new XDNA 2 NPU, from which AMD promises 50 TOPS of performance. AMD has also upgraded the integrated graphics with the RDNA 3.5, which is designed to replace the last generation of RDNA 3 mobile graphics, for better performance in games than we’ve seen before. Further to this, during AMD’s recent Tech Day last week, AMD disclosed some of the technical details regarding Zen 5, which also covers a number of key elements under the hood on both the Ryzen AI 300 and the Ryzen 9000 series. On paper, the Zen 5 architecture looks quite a big step up compared to Zen 4, with the key component driving Zen 5 forward through higher instructions per cycle than its predecessor, which is something AMD has managed to do consistently from Zen to Zen 2, Zen 3, Zen 4, and now Zen 5. ↫ Gavin Bonshor at AnandTech Not the review and deep analysis quite yet, but a first thorough look at what Zen 5 is going to bring us, straight from AnandTech.

AMD unveils Ryzen 9000 CPUs for desktop, Zen 5 takes center stage at Computex 2024

In regards to performance, AMD is touting an average (geomean) IPC increase in desktop workloads for Zen 5 of 16%. And with the new desktop Ryzen chips’ turbo clockspeeds remaining largely identical to their Ryzen 7000 predecessors, this should translate into similar performance expectations for the new chips. The AMD Ryzen 9000 series will also launch on the AM5 socket, which debuted with AMD’s Ryzen 7000 series and marks AMD’s commitment to socket/platform longevity. Along with the Ryzen 9000 series will come a pair of new high-performance chipsets: the X870E (Extreme) and the regular X870 chipsets. The fundamental features that vendors will integrate into their specific motherboards remain tight-lipped. Still, we do know that USB 4.0 ports are standard on the X870E/X870 boards, along with PCIe 5.0 for both PCIe graphics and NVMe storage, with higher AMD EXPO memory profile support expected than previous generations. ↫ Gavin Bonshor at AnandTech I absolutely love that AMD maintains compatibility with its chipset and socket generations as well as it does. I’m currently running a Ryzen 9 7900X, and I see no reason to upgrade any time soon, but it’s good to know I’ll at least have otions once the time comes. Compare this to Intel, which broke compatibility pretty much intentionally almost every generation for years now, and this is a huge win for consumers. Of course, as AMD regains more and more of its foothold across the market, it will eventually also resort to the kind of tactics Intel has been using while it pretty much had the market to itself. It’s only a matter of time before we’ll see the first new Ryzen generation that mysteriously requires a new socket or chipset out of the blue.

AMD unveils their Embedded+ architecture, Ryzen Embedded with Versal together

One area of AMD’s product portfolio that doesn’t get as much attention as the desktop and server parts is their Embedded platform. AMD’s Embedded series has been important for on-the-edge devices, including industrial, automotive, healthcare, digital gaming machines, and thin client systems. Today, AMD has unveiled their latest Embedded architecture, Embedded+, which combines their Ryzen Embedded processors based on the Zen+ architecture with their Versal adaptive SoCs onto a single board. ↫ Gavin Bonshor at AnandTech Machines with these chips will flood the used market a few years from now, and they’re going to be great buys for all kinds of fun projects – and because the corporate world buys these machines by the truckload, they show up on eBay at impulse prices within years. Sometimes, you can even buy cheap whole lots of these kinds of boxes. They often tend to be a little weird, and come with features and trinkets normal computers don’t come with, which is always good for some weekend fun. Cathode Ray Dude is currently doing a series on these little things on YouTube, and there’s always something weird to discover about what kind of odd features and design choices these machines possess. If there’s interest from you, our lovely readers, I can see if I can snatch up a few weird ones from eBay and write about what kind of fun projects you can do with these. You can usually run Linux on these, the embedded versions of Windows, and if they’re not too weird, they could probably serve as a cheap Haiku box, too.

AMD says overclocking blows a hidden fuse on Ryzen Threadripper 7000 to show if you’ve overclocked the chip, but it doesn’t automatically void your CPU’s warranty

A recent discovery that overclocking AMD’s latest chips blows a fuse to denote the chip has been overclocked has led to slightly misleading claims that it will automatically void the chips’ warranty for any type of failure. However, AMD clarified to Tom’s Hardware that overclocking AMD’s Ryzen Threadripper Pro 7000 (Storm Peak) and non-Pro lineup, among the best workstation CPUs, doesn’t automatically void the processor’s warranty. ↫ Zhiye Liu at Tom’s Hardware Something about these fuses in processors doesn’t sit right with me.

AMD begins Polaris and Vega GPU retirement process, reduces ongoing driver support

As AMD is now well into their third generation of RDNA architecture GPUs, the sun has been slowly setting on AMD’s remaining Graphics Core Next (GCN) designs, better known by the architecture names of Polaris and Vega. In recent weeks the company dropped support for those GPU architectures in their open source Vulkan Linux driver, AMDVLK, and now we have confirmation that the company is slowly winding down support for these architectures in their Windows drivers as well. Under AMD’s extended driver support schedule for Polaris and Vega, the drivers for these architectures will no longer be kept at feature parity with the RDNA architectures. And while AMD will continue to support Polaris and Vega for some time to come, that support is being reduced to security updates and “functionality updates as available.” What’s odd is that AMD is still selling these as integrated GPUs to this day, and they, too, are getting this treatment. That’s a pretty shitty deal for people buying these products today.

AMD unveils Ryzen Threadripper 7000 family: 96 core Zen 4 for workstations and HEDT

Being announced today by AMD for a November 21st launch, this morning AMD is taking the wraps off of their Ryzen 7000 Threadripper CPUs. These high-end chips are being split up into two product lines, with AMD assembling the workstation-focused Ryzen Threadripper 7000 Pro series, as well as the non-pro Ryzen Threadripper 7000 series for the more consumer-ish high-end desktop (HEDT) market. Both chip lines are based on AMD’s tried and true Zen 4 architecture – derivatives of AMD’s EPYC server processors – incorporating AMD’s Zen 4 chiplets and a discrete I/O dies. As with previous generations of Threadripper parts, we’re essentially looking at the desktop version of AMD’s EPYC hardware. With both product lines, AMD is targeting customer bases that need CPUs more powerful than a desktop Ryzen processor, but not as exotic (or expensive) as AMD’s server wares. This means chips with lots and lots of CPU cores – up to 96 in the case of the Threadripper 7000 Pro series – as well as support for a good deal more I/O and memory. The amount varies with the specific chip lineup, but both leave Ryzen 7000 and its 16 cores and 24 PCIe lanes in the dust. I’m hoping these will eventually find their way to eBay, so that around five years from now, I can replace my dual-Xeon workstation with a Threadripper machine.

ECC RAM on AMD Ryzen 7000 desktop CPUs

ECC support has been standard on Ryzen processors, but with the recent introduction of the Ryzen 7000 series and the new AM5 socket, any mention of ECC was dropped from specification pages and similar documentation. It turns out, though, that there’s more to this story. A couple months ago I came across a topic on the ASRock forums talking about ECC support on AM5 motherboards, in which a user called ApplesOfEpicness said that they’d worked with an AMD engineer to get ECC RAM going within AMD’s AGESA firmware. They’d claimed to have tested it on an ASRock motherboard with an updated UEFI, by shorting ground and data pins, and seeing errors be reported up to the OS. I was intrigued by this! Even though I didn’t have the same motherboard that ApplesOfEpicness did, I had chosen an ASRock board (the B650E PG Riptide)—I had figured that if ECC was possible on any AM5 board at all, it would be supported on ASRock. So based on the forum post, last week I ordered a pair of 32 GB server-grade ECC sticks from v-color. I updated my motherboard’s UEFI to the latest version (version 1.28 with AGESA 1.0.0.7b), and then replaced my existing RAM with the new sticks. I started up the system, and after a very long link training process… it booted up! It boots, but does it actually work? This may seem like a simple question to answer, but it turns out it’s a lot harder to verify working ECC than you might think. Excellent investigative work by the author, Rain.

Hot Chips 2023: AMD’s Phoenix SoC

Phoenix is the latest addition to AMD’s long line of APUs (chips with integrated graphics). Ever since Picasso launched with Zen cores and Vega graphics, AMD’s APUs saw massive improvements from generation to generations. That’s largely because AMD started from so far behind. But Zen 2 and Zen 3 APUs were already very solid products, so Phoenix’s improvements make it a very dangerous competitor. AMD has put a lot of focus into reducing power consumption across every area of the chip. Zen 4 cores do an excellent job on the CPU side, while RDNA 3 provides strong graphics performance. Hardware offload helps power efficiency on specialized AI and audio processing workloads. To support all this, Infinity Fabric gets lower power states and very flexible clock behavior. Phoenix ends up being able to perform well across a wide range of form factors and power targets. These are the kinds of chips powering the current slew of mobile gaming devices like the Steam Deck and its various competitors. It’s great to see this market segment take off, mostly thanks to AMD and Valve, but I’m going to hold off just one or two generations more before jumping in. If AMD’s pace of improvement continues, these handheld devices are going to become even thinner and lighter. That being said, I’d still love to review a Steam Deck for OSNews, specifically because of its Linux base. Maybe I’ll run into an acceptable deal at some point soon.

AMD intros EPYC 97×4 “Bergamo” CPUs: 128 Zen 4c cores for servers

Kicking off a busy day of product announcements and updates for AMD’s data center business group, this morning AMD is finally announcing their long-awaited high density “Bergamo” server CPUs. Based on AMD’s density-optimized Zen 4c architecture, the new EPYC 97×4 chips offer up to 128 CPU cores, 32 more cores than AMD’s current-generation flagship EPYC 9004 “Genoa” chips. According to AMD, the new EPYC processors are shipping now, though we’re still awaiting further details about practical availability. There is so much competition in the processor space at the moment – it’s just great. Few of us will ever get to use or even see these processors, but eventually, technologies developed for the very high end of the today will make their way down to the attainable end of tomorrow.

Bulldozer, AMD’s crash modernization: frontend and execution engine

But as you might think, nobody at AMD envisioned it that way in the planning or design stages. No engineer would ever start working with the idea to “build a shit product”; a recent chat with an engineer who was at AMD during Bulldozer’s development gave us additional insight on what the original goals for the architecture were. AMD originally wanted Bulldozer to be like K10, but with a shared frontend and FPU. In one architecture, AMD would improve single threaded performance while massively increasing multithreaded performance, and move to a new 32 nm node at the same time. But those goals were too ambitious, and AMD struggled to keep clock frequency up on the 32 nm process. This resulted in cuts to the architecture, which started to stack up. The last AMD processor I used pre-Zen was a Phenom II, which was a fine processor for the price. However, after that, it quickly became clear that Intel had taken the lead. As such, I never experienced this era of AMD, and I think many of you will have had the same experience. This makes articles like these incredibly interesting.

AMD EPYC Genoa gaps Intel Xeon in stunning fashion

The AMD EPYC 9004 series, codenamed “Genoa” is nothing short of a game-changer. We use that often in the industry, but this is not a 15-25% generational improvement. The new AMD EPYC Genoa changes the very foundation of what it means to be a server. This is a 50-60% (or more) per-socket improvement, meaning we get a 3:2 or 2:1 consolidation just from a generation ago. If you are coming from 3-5 year-old Xeon Scalable (1st and 2nd Gen) servers to EPYC, the consolidation potential is even more immense, more like 4:1. This new series is about much more than just additional cores or a few new features. AMD EPYC Genoa is a game-changer, and we are going to go in-depth as to why in this article. These are absolutely monster processors, and widen the already existing gap between AMD and Intel in the server space even more.

AMD’s next-gen Radeon RX 7900 XTX and 7900 XT launch December 13, start at $999

AMD is gearing up to launch its next-generation Radeon RX 7000-series GPUs next month, and today the company shared more details about the cards’ pricing, performance levels, and the new RDNA 3 GPU architecture that will power all of its graphics cards for the next couple of years. The launch begins at the high end, with the Radeon RX 7900 XTX and RX 7900 XT. AMD will launch both of these GPUs on December 13, with the 7900 XTX starting at $999 and the XT starting at $899 (cards made by AMD’s partners will surely push these prices upward a bit). Both of these price tags undercut Nvidia’s RTX 4000 series, which starts at $1,599 for the top-tier GeForce RTX 4090 and $1,199 for the RTX 4080. Graphics cards have become insanely expensive. While AMD’s prices undercut NVIDIA, they’re still bonkers expensive. Assuming you’ll be able to even find them at these prices to begin with.

AMD gives details on EPYC Zen4: Genoa and Bergamo, up to 96 and 128 cores

Since AMD’s relaunch into high-performance x86 processor design, one of the fundamental targets for the company was to be a competitive force in the data center. By having a competitive product that customers could trust, the goal has always been to target what the customer wants, and subsequently grow market share and revenue. Since the launch of 3rd Generation EPYC, AMD is growing its enterprise revenue at a good pace, however questions always turn around to what the roadmap might hold. In the past, AMD has disclosed that its 4th Generation EPYC, known as Genoa, would be coming in 2022 with Zen 4 cores built on TSMC 5nm. Today, AMD is expanding the Zen 4 family with another segment of cloud-optimized processors called Bergamo. As part of AMD’s Data Center event today, the company is showcasing that its 4th Generation EPYC roadmap will consist of two segments: Genoa, with up to 96 Zen 4 cores, and Bergamo, with up to 128 Zen 4c cores. Not only are we getting official confirmation of core counts, but AMD is disclosing that Bergamo will be using a different type of core: the Zen 4c core. Imagine how much faster I could translate on one of these.

AMD: we stand ready to make Arm chips

AMD’s CFO Devinder Kumar recently commented that AMD stands ready to manufacture Arm chips if needed, noting that the company’s customers want to work with AMD on Arm-based solutions. Kumar’s remarks came during last week’s Deutsche Bank Technology Conference, building on comments from AMD CEO Lisa Su earlier in the year that underscored the company’s willingness to create custom silicon solutions for its customers, be they based on x86 or Arm architectures. Intel also intends to produce Arm and RISC-V chips, too, meaning that the rise of non-x86 architectures will be partially fueled by the stewards of the dominant x86 ecosystem. This is entirely unsurprising news. You don’t have to build Snapdragon or Apple-level ARM chips to make a lot of money with Arm, and companies like Intel and AMD would be stupid not to look into it.

AMD 3rd Gen EPYC Milan review: a peak vs per core performance balance

From a competitive standpoint, Milan continues to strengthen and maintain a very stark one-sided performance advantage against its biggest competitor, Intel. Rome had already offered more raw socket performance than the best Intel had to offer at the time, and the gap is currently quite large as Intel has not updated in that time. Intel has stated that its Ice Lake Xeon-SP family will come sometime soon, however unless Intel manages to close the core count gap, then AMD looks to be in very good shape. Meanwhile, as AMD is focused on Intel, the Arm competition has also entered the market with force through 2020, and designs such as the Ampere Altra are able to outperform the new top Milan SKUs in many throughput-bound workloads. AMD still has very clear advantages, such as much superior memory performance through huge caches, or vastly superior per-thread performance with specialised dedicated SKUs. Still, it leaves AMD in a spot as they can’t claim to be the outright performance leader under every scenario, and offers another generational target to consider as it develops future cores. Another monstrous CPU by AMD, and another case where Intel simply doesn’t even come close. There’s offerings on the ARM front, though, that are slowly starting to make their way into the data centre.

64 cores of rendering madness: the AMD Threadripper Pro 3995WX review

In the tests that matter, most noticeably the 3D rendering tests, we’re seeing a 3% speed-up on the Threadripper Pro compared to the regular Threadripper at the same memory frequency and sub-timings. The core frequencies were preferential on the 3990X, but the memory bandwidth of the 3995WX is obviously helping to a small degree, enough to pull ahead in our testing, along with the benefit of having access to 8x of the memory capacity as well as Pro features for proper enterprise-level administration. The downside of this comparison is the cost: the SEP difference is +$1500, or another 50%, for the Threadripper Pro 3995WX over the regular Threadripper 3990X. With this price increase, you’re not really paying +50% for the performance difference (ECC memory also costs a good amount), but the feature set. Threadripper Pro is aimed at the visual effects and rendering market, where holding 3D models in main memory is a key aspect of workflow speed as well as full-scene production. Alongside the memory capacity difference, having double the PCIe 4.0 lanes means more access to offload hardware or additional fast storage, also important tools in the visual effects space. Threadripper Pro falls very much into the bucket of ‘if you need it, this is the option to go for‘. AMD is entirely in a league of its own with these processors. I keep repeating it, but AMD’s comeback is one of the most remarkable stories in the history of technology.