Started Labbook 2010.

March 4, 2009

  • Tested perflab-rotate on ow125 with -02 option:
    Rotate: Version = naive_rotate: baseline implementation:
    Dim 64 128 256 512 1024 Mean
    Your CPEs 8.4 9.5 17.2 31.0 105.1
    Baseline CPEs 8.7 9.5 17.2 31.3 103.6
    Speedup 1.0 1.0 1.0 1.0 1.0 1.0
    Rotate: Version = rotate_b64: average optimalization using 64x64 blocking:
    Dim 64 128 256 512 1024 Mean
    Your CPEs 10.6 10.3 12.0 20.4 22.9
    Baseline CPEs 8.7 9.5 17.2 31.3 103.6
    Speedup 0.8 0.9 1.4 1.5 4.5 1.5
    Rotate: Version = rotate_b128: average optimalization using 128x128 blocking:
    Dim 64 128 256 512 1024 Mean
    Your CPEs 10.5 10.4 17.9 20.6 25.1
    Baseline CPEs 8.7 9.5 17.2 31.3 103.6
    Speedup 0.8 0.9 1.0 1.5 4.1 1.4
    Rotate: Version = rotate_block: maximum optimalization 112x112 block2 (nxn):
    Dim 128 256 512 1024 2048 Mean
    Your CPEs 10.0 17.7 23.2 26.0 58.4
    Baseline CPEs 9.5 17.2 31.3 103.6 107.4
    Speedup 0.9 1.0 1.4 4.0 1.8 1.6
    Rotate: Version = rotate_block: maximum optimalization nxn block(n=128):
    Dim 64 128 256 512 1024 Mean
    Your CPEs 7.9 8.0 12.5 16.6 23.3
    Baseline CPEs 8.7 9.5 17.2 31.3 103.6
    Speedup 1.1 1.2 1.4 1.9 4.5 1.7
    Rotate: Version = rotate_block: maximum optimalization nxn block2(n=128):
    Dim 128 256 512 1024 2048 Mean
    Your CPEs 8.1 12.2 16.6 23.1 62.4
    Baseline CPEs 9.5 17.2 31.3 103.6 107.4
    Speedup 1.2 1.4 1.9 4.5 1.72 1.7
    Rotate: Version = rotate_block: maximum optimalization 136x136 block (nxn):
    Dim 128 256 512 1024 2048 Mean
    Your CPEs 8.1 12.7 17.4 23.1 65.8
    Baseline CPEs 9.5 17.2 31.3 103.6 107.4
    Speedup 1.2 1.4 1.8 4.5 1.6 1.8
    Rotate: Version = rotate_block: maximum optimalization 140x140 block:
    Dim 128 256 512 1024 2048 Mean
    Your CPEs 8.1 12.4 16.4 22.7 65.2
    Baseline CPEs 9.5 17.2 31.3 103.6 107.4
    Speedup 1.2 1.4 1.9 4.6 1.6 1.9
    Rotate: Version = rotate_block: maximum optimalization 142x142 block:
    Dim 256 512 1024 1536 2048 Mean
    Your CPEs 12.7 16.5 25.1 18.7 66.9
    Baseline CPEs 17.2 31.3 103.6 98.1 107.4
    Speedup 1.4 1.9 4.1 5.6 1.6 2.5
    Rotate: Version = rotate_block: maximum optimalization 144x144 block (nxn):
    Dim 128 256 512 1024 2048 Mean
    Your CPEs 8.0 13.0 16.5 21.6 65.3
    Baseline CPEs 9.5 17.2 31.3 103.6 107.4
    Speedup 1.2 1.3 1.9 4.8 1.6 1.9
    Rotate: Version = rotate_block: maximum optimalization 152x152 block (nxn):
    Dim 128 256 512 1024 2048 Mean
    Your CPEs 8.0 12.5 16.4 22.9 70.7
    Baseline CPEs 9.5 17.2 31.3 103.6 107.4
    Speedup 1.2 1.4 1.9 4.5 1.5 1.8
    Rotate: Version = rotate_block: maximum optimalization 160x160 block2 (nxn):
    Dim 128 256 512 1024 2048 Mean
    Your CPEs 10.0 18.5 23.1 28.2 76.8
    Baseline CPEs 9.5 17.2 31.3 103.6 107.4
    Speedup 1.0 0.9 1.4 3.7 1.4 1.4
    Rotate: Version = rotate_b32_u2: Rotate using 32x32 blocking, 2x unrolling:
    Dim 256 512 1024 1536 2048 Mean
    Your CPEs 9.9 14.3 15.8 14.7 17.9
    Baseline CPEs 17.2 31.3 103.6 98.1 107.4
    Speedup 1.7 2.2 6.6 6.7 6.0 4.0
    Rotate: Version = rotate_hybrid: Hybrid of toggle32 and toggle16x2:
    Dim 256 512 1024 1536 2048 Mean
    Your CPEs 4.6 10.1 13.9 14.2 14.2
    Baseline CPEs 17.2 31.3 103.6 98.1 107.4
    Speedup 3.7 3.1 7.4 6.9 7.6 5.4
  • Can it much faster, can blocking help?

    February 25, 2009

    February 10, 2009

    January 28, 2009

    January 22, 2009

    January 21, 2009

    January 20, 2009

    January 7, 2009