Radeon Instinct platforms

Built primarily to accelerate deep learning, artificial neural networks, and HPC/GPGPU applications, Instinct GPUs were originally launched in December 2016 based on the GCN third- and fourth- generation microarchitectures.

The Instinct MI50 belongs to the AMD Vega 20 GCN 5.1 architecture. It has a 16 GB HBM2 memory, a 1,746 MHz boost clock speed, 3,840 stream processors, 60 compute units, and 1 TB/s memory bandwidth. It offers full FP64 performance at 6.7 TFLOPS (1:2).

AMD's Instinct GPUs are direct competitors to NVIDIA's Tesla GPUs and are also made with the purpose of accelerating computational tasks with unrestricted computing power.

Like NVIDIAs Tesla GPUs, the Instinct series can also immensely contribute to computations where floating-point calculations require double precision for greater accuracy. Apart from pure computations, high-end image generation for applications in professional and scientific fields is also possible with ROCm.

In general, Radeon Pro and Instinct GPUs have higher double-precision power compared to the consumer-level Radeon GPUs. Instinct GPUs cost around $10,000, while the Pro series are also available for mid-to-high range budgets, comparable to the Radeon series.

Reviewing all three GPU prices to performance ratios, a notable advantage for gamers who also happen to be equally passionate developers is the price point at which the new Radeon VII will be available at the time of writing this chapter. A margin of $699 USD really makes it easier for them to get involved with GPU computing, considering its 1:8 FP64 metric at 1.6 TFLOPS!

Let's compare the specifications of the three GPU models, which also includes all three FP64 metrics:

Radeon VII

Radeon Vega Frontier Edition

Radeon Instinct MI50

Microarchitecture

Vega second generation

Vega second Generation

Vega second generation

Memory size

16 GB

16 GB

16 GB

Memory type

HBM2

HBM2

HBM2

Memory bus

4,096-bit

2,048-bit

4,096-bit

Bandwidth

1 TB/s

484 GB/s

1 TB/s

xGMI connect support

No

No

No

DirectGMA support

No

Yes

No

Radeon VII

Radeon Vega Frontier Edition

Radeon Instinct MI50

Pixel rate

112.0 GPixel/s

102.4 GPixel/s

111.7 GPixel/s

Texture rate

420.0 GTexel/s

409.6 GTexel/s

419.0 GTexel/s

FP16 (half) performance

27.7 TFLOPS

26.2 TFLOPS

26.8 TFLOPS

FP32 (float) performance

13.8 TFLOPS

13.1 TFLOPS

13.4 TFLOPS

FP64 (double) performance

3.46 TFLOPS

0.8 TFLOPS

6.7 TFLOPS

Radeon VII

Radeon Vega Frontier Edition

Radeon Instinct MI50

GPU clock

1,400 MHz

1,382 MHz

1,200 MHz

Boost clock

1,750 MHz

1,600 MHz

1,746 MHz

Memory clock

1,000 MHz

945 MHz

1,000 MHz

Effective memory clock

2,000 MHz

1,890 MHz

2,000 MHz

 

For a deep learning (DL) performance comparison of GPUs without tensor cores, they can be equivalently compared to their conventional FLOPS to benchmark maximum DL performance (particularly FP16/FP32 mixed precision).

We can see that apart from the 1:4 FP64 metric, the Radeon VII is just a revised version of the Instinct MI50 card! The Vega Frontier edition has a much lower FP64 value!

If we revisit the affordability level that we spoke of in the Computing on NVIDIA GPUs section for NVIDIA, AMD GPU prices with the mentioned specifications can be considered a win-win situation both for organizations (academia/industry) and individuals who are independent researchers.

Just like NVIDIA has GPUDirect and NVLink, AMD has DirectGMA and xGMI Interconnect (upcoming). GMA stands for Graphic Memory Access and xGMI stands for inter-chip Global Memory Interconnect. DirectGMA is open source via AMD's GPUOpen initiative. GCN refers to AMD's Graphic Core Next architecture.