[SciPy-dev] percentileofscore
Robert Kern
robert.kern@gmail....
Sun Nov 16 23:58:36 CST 2008
On Sun, Nov 16, 2008 at 23:53, <josef.pktd@gmail.com> wrote:
> On Sun, Nov 16, 2008 at 10:56 PM, Robert Kern <robert.kern@gmail.com> wrote:
>
>>
>> Wikipedia says to use half of the frequency of the ties (x==score) in
>> addition to the cumulative frequency of strict x<score.
>>
>> http://en.wikipedia.org/wiki/Percentile_rank
>>
>
> The 0.5 weight looks pretty arbitrary to me
> percentilescore_wikip([1,2,3,4,4,4,5,6,7,8]) = 3 + 0.5*3 = 4.5
It's not arbitrary. It's the average of the x<score and x<=score cases.
> I guess the question is, whether this a commonly accepted convention,
> or maybe, which and whose convention should scipy follow.
>
> The proposed patch is pretty easy to adjust to any convention.
> Maybe percentileofscore should get a weight parameter for ties:
> 0 for strict inequality, 1 for weak inequality,
> 0.5 (default?) for wikipedia and -1 for mean.
I prefer strings, myself. 'strict', 'weak', 'mean'.
--
Robert Kern
"I have come to believe that the whole world is an enigma, a harmless
enigma that is made terrible by our own mad attempt to interpret it as
though it had an underlying truth."
-- Umberto Eco
More information about the Scipy-dev
mailing list