[Numpy-discussion] Openmp support (was numpy's future (1.1 and beyond): which direction(s) ?)

Neal Becker ndbecker2@gmail....
Sat Mar 22 18:27:52 CDT 2008


gcc --version
gcc (GCC) 4.1.2 20070925 (Red Hat 4.1.2-33)
Copyright (C) 2006 Free Software Foundation, Inc.
This is free software; see the source for copying conditions.  There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

[nbecker@nbecker1 ~]$ cat /proc/cpuinfo
processor       : 0
vendor_id       : GenuineIntel
cpu family      : 6
model           : 15
model name      : Intel(R) Core(TM)2 Duo CPU     T7500  @ 2.20GHz
stepping        : 11
cpu MHz         : 2201.000
cache size      : 4096 KB
physical id     : 0
siblings        : 2
core id         : 0
cpu cores       : 2
fpu             : yes
fpu_exception   : yes
cpuid level     : 10
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx lm constant_tsc arch_perfmon pebs bts rep_good pni monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr lahf_lm ida
bogomips        : 4393.14
clflush size    : 64
cache_alignment : 64
address sizes   : 36 bits physical, 48 bits virtual
power management:

processor       : 1
vendor_id       : GenuineIntel
cpu family      : 6
model           : 15
model name      : Intel(R) Core(TM)2 Duo CPU     T7500  @ 2.20GHz
stepping        : 11
cpu MHz         : 2201.000
cache size      : 4096 KB
physical id     : 0
siblings        : 2
core id         : 1
cpu cores       : 2
fpu             : yes
fpu_exception   : yes
cpuid level     : 10
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx lm constant_tsc arch_perfmon pebs bts rep_good pni monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr lahf_lm ida
bogomips        : 4389.47
clflush size    : 64
cache_alignment : 64
address sizes   : 36 bits physical, 48 bits virtual
power management:

[nbecker@nbecker1 ~]$ gcc -O2 vec_bench.c -o vec_bench
[nbecker@nbecker1 ~]$ ./vec_bench
Testing methods...
All OK

        Problem size              Simple              Intrin              Inline
                 100   0.0003ms (100.0%)   0.0003ms ( 78.3%)   0.0003ms ( 75.5%)
                1000   0.0029ms (100.0%)   0.0022ms ( 75.9%)   0.0026ms ( 87.0%)
               10000   0.0131ms (100.0%)   0.0085ms ( 65.0%)   0.0092ms ( 70.3%)
              100000   0.1210ms (100.0%)   0.0875ms ( 72.3%)   0.0932ms ( 77.0%)
             1000000   4.2518ms (100.0%)   7.5801ms (178.3%)   7.6278ms (179.4%)
            10000000  81.6962ms (100.0%)  79.8668ms ( 97.8%)  81.6365ms ( 99.9%)
[nbecker@nbecker1 ~]$ gcc -O3 -ffast-math vec_bench.c -o vec_bench
[nbecker@nbecker1 ~]$ ./vec_bench
Testing methods...
All OK

        Problem size              Simple              Intrin              Inline
                 100   0.0003ms (100.0%)   0.0002ms ( 68.4%)   0.0003ms ( 74.2%)
                1000   0.0029ms (100.0%)   0.0023ms ( 77.2%)   0.0025ms ( 86.9%)
               10000   0.0353ms (100.0%)   0.0086ms ( 24.5%)   0.0092ms ( 26.1%)
              100000   0.1497ms (100.0%)   0.1013ms ( 67.6%)   0.1146ms ( 76.6%)
             1000000   4.4004ms (100.0%)   7.5651ms (171.9%)   7.6200ms (173.2%)
            10000000  81.3631ms (100.0%)  83.3591ms (102.5%)  79.8199ms ( 98.1%)
[nbecker@nbecker1 ~]$ gcc -O3 -msse4a vec_bench.c -o vec_bench
[nbecker@nbecker1 ~]$ ./vec_bench
Testing methods...
All OK

        Problem size              Simple              Intrin              Inline
                 100   0.0001ms (100.0%)   0.0001ms ( 67.5%)   0.0001ms ( 74.8%)
                1000   0.0011ms (100.0%)   0.0008ms ( 78.0%)   0.0009ms ( 86.4%)
               10000   0.0116ms (100.0%)   0.0085ms ( 73.2%)   0.0092ms ( 79.1%)
              100000   0.1500ms (100.0%)   0.0873ms ( 58.2%)   0.0931ms ( 62.1%)
             1000000   4.2654ms (100.0%)   7.5623ms (177.3%)   7.5713ms (177.5%)
            10000000  79.4805ms (100.0%)  81.0649ms (102.0%)  81.1859ms (102.1%)



More information about the Numpy-discussion mailing list