Naples to Rome: New AMD EPYC CPUs



On August 7, the second-generation AMD EPYC ™ line was launched worldwide. The new processors are based on the Zen 2 microarchitecture and are built on a 7-nm process.

Features



“They’re greeted by clothes, escorted by the mind,” says folk wisdom. So we will start with the “clothes” of the new generation. The processor markings have undergone minor changes: the fourth digit, representing the generation, has changed from 1 to 2 . The first digit, as before, indicates the series, and the second and third - the model. AMD has not abandoned processors with the P index, which do not support work in multi-socket systems.

The second generation inherited the SP3 socket from the first generation without changes, which allows the use of new processors without updating the motherboard, but this will not be able to fully reveal their potential. Achieving full performance is possible with the use of new motherboards supporting 3200 MHz for DDR4 memory.

The processor "filling" has changed beyond recognition: the technological process has changed, a new Zen 2 microarchitecture has been applied and a new high-speed memory controller has appeared.

Performance


The transition to the 7-nm process led to the compaction of the crystal and an increase in the number of cores to 64 , which is twice as high as in the first generation. The base frequency of the second generation processors is in the range from 2.00 to 2.90 GHz . For comparison, the first-generation base did not exceed 2.30 GHz . An increase in the number of cores and the base frequency of the processor has led to an increase in heat dissipation up to 120 watts in budget versions and up to 225 watts in top versions.

Process improvement is not the only innovation. A new microarchitecture called Zen 2 has contributed to the improvement of processor specifications. Improvements are aimed at interacting with caches: the data exchange speed with L1 has doubled, the data transfer speed between caches has been increased, the size of the L3 cache has been increased.

Detailed specifications of the line are presented in the table.
ModelTDP, WattNumber of coresBase frequency, GHzMaximum frequency, GHzSize L3 Cache, MB
7742225642.253.40256
7702 / 7702P200642.003.35256
7642225482.303.30256
7552200482.203.30192
7542225322.903.40128
7502 / 7502P180322.503.35128
7452155322.353.35128
7402 / 7402P180242.803.35128
7352155242.303.20128
7302 / 7302P155sixteen3.003.30128
7282120sixteen2.803.2064
7272120122.903.2064
7262155eight3.203.40128
7252120eight3.103.2064
7232P120eight3.103.2032
Rome boasts an improvement in working with external devices: for the first time, it became possible to work with an external x2APIC interrupt controller , and also supports DDR4-3200 and 128 PCIe lanes . The new generation of processors includes support for Virtualized IOMMU technology, with which virtual machines gain direct access to physical peripherals.

Increasing capacity and improving conditions for virtualization leads to an increase in the number of simultaneously running applications. Applications work with sensitive data that needs enhanced protection.

Security


In the updated line of processors, security issues were not the last place. Inside the system, a processor is installed on a chip called the AMD Secure Processor based on ARM® Cortex® A5 , which stores keys and encrypts the contents of RAM using the AES-128 algorithm.

AMD Secure Processor offers two types of memory encryption (these methods require OS support):


Test servers


Now that the theory of the new product is known, we will carry out practical tests. AMD EPYC ™ 7452 will be the representative of the second generation. AMD EPYC ™ 7551 is chosen as the opponent from the first generation. The opponent from the “blue” camp is Intel® Xeon® Gold 5218 as part of a two-socket system. The choice of these systems is justified by the similarity of their technical characteristics.
EPYC 7452EPYC 7551Xeon 5218
Number of Cores3232sixteen
Number of threads646432
Base frequency, GHz2.352.002.30
Maximum frequency, GHz3.703.003.90
TDP, Watt155180125
RAM TypeDDR4-3200DDR4-2666DDR4-2666
L1 cache, KB204830721024
L2 cache, KB163841638416384
L3 cache, MB1286422
In tests, processors are placed on an equal footing: the same memory, the same drives, and identical operating systems with identical settings.

For maximum performance, all channels of the processor's memory controller must be enabled. Intel® processors have six channels, and AMD processors have eight. Given this difference, it is difficult to create identical conditions, so a compromise was found: systems with AMD processors have 8 modules of 16 GB each, and a dual-socket system with Intel® Xeon® has 12 modules of 8 GB each. All RAM modules operate at a frequency of 2666 MHz .

An operating system (OS) is hosted on SSDs to reduce the impact of the disk subsystem on testing. All tests were performed on CentOS version 7.
EPYC 7452EPYC 7551Xeon 5218
Processors1 x AMD EPYC 74521 x AMD EPYC 75512 x Intel Xeon 5218
RAM8 x 16GB DDR4-26668 x 16GB DDR4-266612 x 8 GB DDR4-2666
Disks1 x Samsung SSD 850 EVO 250GB1 x Intel SSD S4510 480 GB1 x Intel SSD S4510 480 GB

Tests


Testing should be as objective as possible, especially since we are talking about comparing Intel® processors and AMD. Therefore, we will not use optimizing compilers to build tests that are supplied as source codes.

GeekBench 4


GeekBench is a popular cross-platform processor performance test with its own online results database . The test comes in the form of ready-made executable files, which is why optimization for specific processors is not provided.

The general metrics of the GeekBench groups are important for us:


The indicated groups of tests are launched in two modes: in a single thread and when executed simultaneously on all cores. Based on the results, GeekBench puts the final ratings: Single-Core Score and Multi-Core Score .


The main battle is between AMD EPYC ™ 7452 and Intel® Xeon® Gold 5218, while the first generation EPYC ™ is inferior to the second in all tests.

Consider single-threaded tests. Rome shows excellent results when working with cryptographic tasks and memory, but loses when performing integer calculations. As a result, the second generation EPYC ™ scores 4893 points and becomes the winner in the Single-Core nomination. The second and third places are occupied by Xeon and EPYC of the first generation with 4695 and 3981 points respectively.


In multi-threaded tests, the balance of power changes significantly. EPYC ™ 7452 does a great job of computing, but loses ground in cryptographic tasks and working with memory , which does not prevent it from becoming a leader with 96009 points in the Multi-Core nomination.

SPEC CPU 2017


SPEC CPU 2017 is a set of performance tests recognized by processor manufacturers. Tests of this set are distributed in the form of source codes, which allows you to optimize them for specific equipment on a specific operating system.

SPEC CPU consists of four test suites:


The first part in the test name determines the type of computation on the processor: integers (int) or above floating-point numbers (fp). The second part determines the type of testing: single-core (rate) or multi-core (speed).

We conducted all four test suites. Tests are compiled at the third level of optimizations using the GNU 4.8.5 compiler suite. Multi-core tests ran in 64 threads, and single-core tests ran in 32 copies.


Tests compiled with optimizations show inconsistent results with GeekBench. The second generation AMD EPYC ™ is superior to the dual-socket system with Intel® processors in all tests except intspeed, but with a significantly smaller margin than in GeekBench.

Phoronix Test Suite


Phoronix Test Suite (PTS) - software that allows you to run tests from a large database of user performance tests. This solution allows you to automatically run the desired tests on several experimental servers at the same time as aggregating the results on the master server.

We have developed our own set of 21 tests, including:

EPYC 7452EPYC 75512 x Xeon 5218
RAMspeed SMP - Type: Add - Benchmark: Integer32476.926531.4928942.2
RAMspeed SMP - Type: Copy - Benchmark: Integer30325.7623419.8627165.75
RAMspeed SMP - Type: Scale - Benchmark: Integer30429,7622011.0828629.12
RAMspeed SMP - Type: Triad - Benchmark: Integer31482.618208.5828299.14
RAMspeed SMP - Type: Average - Benchmark: Integer31060.831745.7128432.31
RAMspeed SMP - Type: Add - Benchmark: Floating Point32434.2637939.528445.26
RAMspeed SMP - Type: Copy - Benchmark: Floating Point30386.9935209.9727119.9
RAMspeed SMP - Type: Scale - Benchmark: Floating Point30,097.1130509.0526508.4
RAMspeed SMP - Type: Triad - Benchmark: Floating Point32473.0438458.628385.89
RAMspeed SMP - Type: Average - Benchmark: Floating Point31295.534393.327,637.44
Stream - Type: Copy107,192.8110,996.94126,257.4
Stream - Type: Scale72,434.4287,300.88105633.7
Stream - Type: Triad77729,7297735.96115100.86
Stream - Type: Add77021.1697,204.36114907.6
MBW - Test: Memory Copy - Array Size: 8192 MiB16888.5212,402.324845.29
MBW - Test: Memory Copy, Fixed Block Size - Array Size: 8192 MiB10752.127410.172982.56
CacheBench - Test: Read2312.412079.623286,28
CacheBench - Test: Write24357.420329.2127520.75
CacheBench - Test: Read / Modify / Write24920.321,598.9828966.95
GNU MPC - Multi-Precision Benchmark714358108950
NAMD - ATPase Simulation - 327,506 Atoms0,800790.941190.77091
Botan - Test: KASUMI - Encrypt69.6961.7478.69
Botan - Test: KASUMI - Decrypt67.1658.5774.85
Botan - Test: AES-256 - Encrypt4575.944,173.763687.71
Botan - Test: AES-256 - Decrypt4552.924152.073704.3
Botan - Test: Twofish - Encrypt279.59247.38325.1
Botan - Test: Twofish - Decrypt281.87249.85333.97
Botan - Test: Blowfish - Encrypt247.76217.14282.4
Botan - Test: Blowfish - Decrypt249.19217.89283.61
Botan - Test: CAST-256 - Encrypt116,150101,470124,690
Botan - Test: CAST-256 - Decrypt116.68101.95125.75
John The Ripper - Test: Blowfish498514056839555
7-Zip Compression - Compress Speed ​​Test163202107009135458
C-Ray - Total Time - 4K, 16 Rays Per Pixel23.4126.6529.48
POV-Ray - Trace Time18.6723.320.46
Smallpt - Global Illumination Renderer; 128 samples15.065.5258.97
dav1d - Video Input: Summer Nature 4K17.6728.3720.98
FLAC Audio Encoding - WAV To FLAC12.2214.1711.43
FFmpeg - H.264 HD To NTSC DV9.1410.8511.46
Hackbench - Count: 1 - Type: Thread3.95.823.35
Hackbench - Count: 8 - Type: Thread10.918.118.73
Hackbench - Count: 1 - Type: Process3.785.433.15
Hackbench - Count: 8 - Type: Process9.6613.318.14
Hackbench - Count: 32 - Type: Process39.0232.2527.74
OpenSSL - RSA 4096-bit Performance6825.94351.476809.23
ctx_clock - Context Switch Time211220160
NGINX Benchmark - Static Web Page Serving26991.7917497.0228274.97
Schbench - Message Threads: 2 - Workers Per Message Thread: 242204101
Schbench - Message Threads: 8 - Workers Per Message Thread: 8462447047009
Schbench - Message Threads: 32 - Workers Per Message Thread: 32156416157952167509
Radiance Benchmark - Test: Serial807,01783.481102.24
Radiance Benchmark - Test: SMP Parallel260.47238.69333.32
For the first time in tests, the AMD EPYC ™ 7551 comes first. AMD processors, regardless of generation, are better at single-threaded work with memory, rendering and video transcoding. Intel® processors, in turn, do better with cryptographic tasks and multi-threaded work with memory, as was previously revealed in GeekBench tests.

findings


Despite the variety of tests, the choice between the first and second generation AMD EPYC ™ is obvious: Rome is superior to its predecessor in the vast majority of tests. However, the first generation does not lose ground in working with memory and rendering.

Comparison of Intel® processors and AMD processors - a real battle of the titans, requiring detailed consideration. Installed software solutions perform better on average with the second generation AMD EPYC ™. In particular, Rome shows excellent results in multi-threaded computing and single-threaded cryptographic tasks. When working with software that is compiled from source codes, preference is given to the new generation of EPYC ™ processors, which are leading in floating point tests and only slightly lagging behind in multi-threaded integer calculations.

Intel® Xeon® Gold, in turn, shows good results in multi-threaded cryptography, audio transcoding and memory handling. The dual-socket system with Intel® Xeon® processors performed well when processing web server requests.

To summarize, it should be noted that the tests performed are synthetic and the results on real tasks may vary. To obtain accurate results on specific tasks, additional testing is required. Now just a few words about the cost. The recommended price for AMD EPYC® 7452 is $ 2025, for Intel® Xeon® Gold 5218 - $ 1250, i.e. $ 2500 for a dual-socket system.

The new AMD EPYC ™ 7452 will be available soon at our Selectel Lab.

Be the first to know!


The second-generation EPYC ™ is just another round in the confrontation between AMD and Intel. You can read more about how the rivalry between the two corporations began in our blog .

Source: https://habr.com/ru/post/463401/


All Articles