[SciPy-user] normalizing data/distributions

A.J. Rossini rossini at blindglobe.net
Wed May 12 22:14:07 CDT 2004


"Scott Bray" <scottbray83 at hotmail.com> writes:

> i am working on a statistics type project for university study. i have
> a set of data, have built a discrete probability distribution from
> this data (using the cauchy distribution) and now want to normalize
> it. currently, the area of the distribution is not equal to one. i
> have been trying to find literature about how to normalize, but have
> been unsuccessful (and what i have found, i am unsure on the
> validity). some say to normalize the data points by:
>
> (data point - sample mean) / sample std

This is to "normalize" it in the sense of making it match a N(0,1)
distribution -- you explicitly said that you were using a cauchy as
the basis?!

> others say to multiply by a normalising constant that is "chosen" to
> make the area equal to one. i tried this by just scaling the area to
> equal one.

This is probably the most reasonable approach to making a
"distribution" of your discretized empirical observations.

best,
-tony

-- 
rossini at u.washington.edu            http://www.analytics.washington.edu/ 
Biomedical and Health Informatics   University of Washington
Biostatistics, SCHARP/HVTN          Fred Hutchinson Cancer Research Center
UW (Tu/Th/F): 206-616-7630 FAX=206-543-3461 | Voicemail is unreliable
FHCRC  (M/W): 206-667-7025 FAX=206-667-4812 | use Email

CONFIDENTIALITY NOTICE: This e-mail message and any attachments may be
confidential and privileged. If you received this message in error,
please destroy it and notify the sender. Thank you.



More information about the SciPy-user mailing list