无法理解NumPy loadtxt中的转换器行为

a, b, c, d, e = np.loadtxt(infile, delimiter = ',', usecols=(0, 2, 5, 8, 9), skiprows = 1, unpack = True, converters = dict(zip((0, 2, 5, 8, 9), (int, float, float, int, int))))

41 skiprows = 1, 42 unpack = True, ---> 43 dtype = (int,float,float,int,int)) 44 45 /Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/numpy/lib/npyio.pyc in loadtxt(fname, dtype, comments, delimiter, converters, skiprows, usecols, unpack) 665 try: 666 # Make sure we're dealing with a proper dtype --> 667 dtype = np.dtype(dtype) 668 defconv = _getconv(dtype) 669 TypeError: data type not understood WARNING: Failure executing file: <forward_NDMMF.py>

2条回答

网友

1楼 · 编辑于 2024-05-08 22:10:39

要指定不同列的类型，可以使用参数dtype，而不是converters：

dtype=(int,float,float,int,int)

编辑：

显然，这种类型的dtype规范似乎不适用于loadtxt，但它适用于genfromtxt（有人知道为什么这不适用于loadtxt，或者这是genfromtxt的额外功能之一吗？）

如果您想使用loadtxt，一个带元组的结构化dtype规范可以工作，比如[('f0', int), ('f1', float)]，而不是(int, float)

但还有一个问题。当使用这样的结构化数据类型和这样的结构化数组（针对不同列的不同类型）时，unpack似乎不起作用。至少举个简单的例子。但这可能是一个已经解决的错误：http://projects.scipy.org/numpy/ticket/1458（但要做到这一点，您必须升级到甚至1.6）。

网友

2楼 · 编辑于 2024-05-08 22:10:39

loadtxt文档表明 converters应该包含以下函数specifically return floats：

converters : dict, optional
A dictionary mapping column number to a function that will convert that column to a float. E.g., if column 0 is a date string: converters = {0: datestr2num}. Converters can also be used to provide a default value for missing data: converters = {3: lambda s: float(s or 0)}. Default: None.

如果您需要整数，则需要使用dtype关键字来转换浮点数。

>>> numpy.loadtxt('th.txt', delimiter=',', usecols=(0, 2, 3), converters=dict(zip((0, 2, 3), (float, float, float))), dtype=([('i1', '<i4'), ('i2', '<f4'), ('i3', '<i4')]))
array([(1, 3.2000000476837158, 4), (1, 3.2000000476837158, 4),
       (1, 3.2000000476837158, 4), (1, 3.2000000476837158, 4),
       (1, 3.2000000476837158, 4), (1, 3.2000000476837158, 4),
       (1, 3.2000000476837158, 4), (1, 3.2000000476837158, 4),
       (1, 3.2000000476837158, 4)],
      dtype=[('i1', '<i4'), ('f1', '<f4'), ('i2', '<i4')])

当然，在这种情况下实际上不需要converters——这实际上是为了将'True'这样的任意字符串值转换为数值。此外，如果您实际上需要一个简单的二维数组而不是记录数组，则不要传递记录格式：

>>> numpy.loadtxt('th.txt', delimiter=',', usecols=(0, 2, 3), dtype=int)
array([[1, 3, 4],
       [1, 3, 4],
       [1, 3, 4],
       [1, 3, 4],
       [1, 3, 4],
       [1, 3, 4],
       [1, 3, 4],
       [1, 3, 4],
       [1, 3, 4]])

但如果这样做，就不能按列指定格式。

相关问题更多 >

编程相关推荐

热门问题

热门文章