[Numpy-discussion] Optimizing reduction loops (sum(), prod(), et al.)

Matthieu Brucher matthieu.brucher@gmail....
Thu Jul 9 04:02:01 CDT 2009


2009/7/9 David Cournapeau <david@ar.media.kyoto-u.ac.jp>:
> Matthieu Brucher wrote:
>>
>> Unfortunately, this is not possible. We've been playing with blocking
>> loops for a long time in finite difference schemes, and it is always
>> compiler dependent
>
> You mean CPU dependent, right ? I can't see how a reasonable optimizing
> compiler could make a big difference on cache effects ?

Yes, of course, CU dependent...

> @ Pauli: if (optionally) knowing a few cache info would help you, I
> could implement it. It should not be too difficult for most cases we
> care about,
>
> David
>
> _______________________________________________
> NumPy-Discussion mailing list
> NumPy-Discussion@scipy.org
> http://mail.scipy.org/mailman/listinfo/numpy-discussion
>



-- 
Information System Engineer, Ph.D.
Website: http://matthieu-brucher.developpez.com/
Blogs: http://matt.eifelle.com and http://blog.developpez.com/?blog=92
LinkedIn: http://www.linkedin.com/in/matthieubrucher


More information about the NumPy-Discussion mailing list