clpeak is a synthetic benchmarking tool to measure peak capabilities of opencl devices.
clpeak
returns:
Platform: AMD Accelerated Parallel Processing Device: gfx1100 Driver version : 3513.0 (HSA1.1,LC) (Linux x64) Compute units : 48 Clock frequency : 3220 MHz Global memory bandwidth (GBPS) float : 751.23 float2 : 789.32 float4 : 825.00 float8 : 856.98 float16 : 877.29 Single-precision compute (GFLOPS) float : 34524.21 float2 : 34385.53 float4 : 34120.14 float8 : 34467.64 float16 : 32858.38 Half-precision compute (GFLOPS) half : 34022.72 half2 : 65991.11 half4 : 66666.16 half8 : 62676.74 half16 : 64124.60 Double-precision compute (GFLOPS) double : 1155.41 double2 : 1171.97 double4 : 1174.89 double8 : 1169.35 double16 : 1160.33 Integer compute (GIOPS) int : 8587.24 int2 : 8392.64 int4 : 8364.48 int8 : 8410.87 int16 : 8421.63 Transfer bandwidth (GBPS) enqueueWriteBuffer : 19.29 enqueueReadBuffer : 19.92 enqueueMapBuffer(for read) : 249707.39 memcpy from mapped ptr : 19.80 enqueueUnmap(after write) : 363980.28 memcpy to mapped ptr : 19.32 Kernel launch latency : 15.75 us