[SciPy-User] kmeans

Keith Goodman kwgoodman@gmail....
Fri Jul 23 13:54:27 CDT 2010


On Fri, Jul 23, 2010 at 11:39 AM, Lutz Maibaum <lutz.maibaum@gmail.com> wrote:
> On Fri, Jul 23, 2010 at 11:33 AM, Keith Goodman <kwgoodman@gmail.com> wrote:
>> What needs to be fixed?
>>
>> - Switch code and doc to use rmse
>
> To be compatible with the (at least to me!) standard use of k-means, I
> think both code and doc should use the sum of squared distances as the
> cost function in the optimization, and also as the return value.

What about the thresh (threshold) input parameter? If the sum of
squares were used then the user would have to adjust the threshold for
the number of data points.


More information about the SciPy-User mailing list