mflop
play

Mflop 300 200 100 0 6 8 10 12 14 16 18 20 22 Vector - PDF document

Performance of scalar product benchmark 600 R10000/195MHz R8000/75MHz Alpha/500Mhz IBM Power 2/66Mhz 500 IBM PPC604e/166Mhz IBM PPC604e/233Mhz PPro 200/Mhz, NAG F90 PPro 200/Mhz, PG F77 400 Mflop 300 200 100 0 6 8 10 12 14 16


  1. Performance of scalar product benchmark 600 R10000/195MHz R8000/75MHz Alpha/500Mhz IBM Power 2/66Mhz 500 IBM PPC604e/166Mhz IBM PPC604e/233Mhz PPro 200/Mhz, NAG F90 PPro 200/Mhz, PG F77 400 Mflop 300 200 100 0 6 8 10 12 14 16 18 20 22 Vector length 2^n

  2. 600 "sk2.a500" "sk1.a500" "sk2.R10000" 500 "sk1.R10000" 400 300 200 100 0 6 8 10 12 14 16 18 20 22

  3. 120 "sk3.a500" "sk3.r10" "sk3.rs6" 100 "sk3.r8" 80 60 40 20 0 6 8 10 12 14 16 18 20 22

  4. CPU 32 Registers 1000 W. Lev. 1 Cache 12000 W. Level 2 Cache 0.5 MW ext. Level 3 Cache 64 MW Main Memory 1 GW Disk Space

  5. 180 red black(1) red black(2) red black(4) 160 140 120 100 Mflop 80 60 40 20 0 4 16 64 256 1024 grid size

  6. 180 red black(4) fused (1, 1) fused (2, 2) 160 fused (4, 0) 140 120 100 Mflop 80 60 40 20 0 4 16 64 256 1024 grid size

  7. 180 optimised RB melt(2, 2) melt(3, 3) 160 melt(4, 4) 140 120 100 Mflop 80 60 40 20 0 4 16 64 256 1024 grid size

More recommend