본문 바로가기
HPC/클러스터

Memory Becnchmark [stream]

by mirhenge 2018. 1. 18.

stream benchmark 실행

다음 사이트에서 원하는 파일 다운로드


[root@amd01 stream]# wget  http://www.cs.virginia.edu/stream/FTP/Code/stream.f
[root@amd01 stream]# wget  http://www.cs.virginia.edu/stream/FTP/Code/stream.c
[root@amd01 stream]# wget http://www.cs.virginia.edu/stream/FTP/Code/Versions/stream_omp.c
[root@amd01 stream]# ls
stream.c  stream.f stream_omp.c
[root@amd01 stream]#
[root@amd01 stream]# gcc -fopenmp -D OPENMP stream_omp.c -o stream_omp.exe
[root@amd01 stream]#
[root@amd01 stream]# export OMP_NUM_THREADS=4
[root@amd01 stream]# export GOMP_CPU_AFFINITY=0-7
[root@amd01 stream]#
[root@amd01 stream]# ./stream_omp.exe
-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 2000000, Offset = 0
Total memory required = 45.8 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Number of Threads requested = 4
Number of Threads requested = 4
Number of Threads requested = 4
Number of Threads requested = 4
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 1510 microseconds.
   (= 1510 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function      Rate (MB/s)   Avg time     Min time     Max time
Copy:       20149.7865       0.0016       0.0016       0.0016
Scale:      18811.1742       0.0017       0.0017       0.0017
Add:        22707.7140       0.0021       0.0021       0.0021
Triad:      21866.6875       0.0022       0.0022       0.0022
-------------------------------------------------------------
Solution Validates
-------------------------------------------------------------
[root@amd01 stream]#


참고 문서 :

IBM High Performance computing cluster health Check sg248168.pdf