[Numpy-discussion] how to select indices from boolean array

Pauli Virtanen ptvirtan@elisanet...
Thu Oct 25 07:44:30 CDT 2007


wBernhard Voigt <bernhard.voigt@gmail.com> kirjoitti: 
> is nonzero the appropriate function to get the indexes of a boolean array?
> 
> foo = numpy.random.random(10000)
> cut = foo > .5
> indexes = cut.nonzero()[0]

I think writing

    indices = where(foo > 5)[0]

might be more clear, although it does the same thing as .nonzero().

> Another question: Why is the the type returned by nonzero a tuple and not an
> ndarray of the same shape as the input (or self if used on an object)?

Because indexing with booleans in numpy works differently from indexing with numbers, especially for multidimensional arrays. You want to have

   ( foo[foo > 5] == foo[where(foo > 5)] ).all()

so where/nonzero returns a tuple that contains 1D-arrays, one per dimension, which together enumerate the relevant entries. (IIRC, in Matlab, find returns indices in the flattened array.)

-- 
Pauli Virtanen


-- 




More information about the Numpy-discussion mailing list