[Numpy-discussion] More loadtxt() changes

Ryan May rmay31@gmail....
Wed Nov 26 16:55:47 CST 2008


Manuel Metz wrote:
> Ryan May wrote:
>> 3) Better support for missing values.  The docstring mentions a way of
>> handling missing values by passing in a converter.  The problem with this is
>> that you have to pass in a converter for *every column* that will contain
>> missing values.  If you have a text file with 50 columns, writing this
>> dictionary of converters seems like ugly and needless boilerplate.  I'm
>> unsure of how best to pass in both what values indicate missing values and
>> what values to fill in their place.  I'd love suggestions
> 
> Hi Ryan,
>    this would be a great feature to have !!!

Thanks for the support!

> One question: I have a datafile in ASCII format that uses a fixed width 
> for each column. If no data if present, the space is left empty (see 
> second row). What is the default behavior of the StringConverter class 
> in this case? Does it ignore the empty entry by default? If so, what is 
> the value in the array in this case? Is it nan?
> 
> Example file:
> 
>    1| 123.4| -123.4| 00.0
>    2|      |  234.7| 12.2
> 

I don't think this is so much anything to do with StringConverter, but 
more to do with how to split lines.  Maybe we should add an option that, 
instead of simply specifying characters that delimit the fields, allows 
one to pass a custom function to split lines?  That could either be done 
by overriding `delimiter` or by adding a new option like `splitter`

I'll have to give that some thought.

Ryan

-- 
Ryan May
Graduate Research Assistant
School of Meteorology
University of Oklahoma


More information about the Numpy-discussion mailing list