python - numpy append_field 为具有 2d 形状的新字段提供形状错误

Question

我有一个结构化的 numpy 数组，我想使用 recfunctions 库 http://pyopengl.sourceforge.net/pydoc/numpy.lib.recfunctions.html 函数 append_fields() 或 rec_append_fields() 附加一个具有某种形状的字段. 但是，我收到一个错误：

ValueError：操作数无法与形状一起广播 (10) (10,3)

其中10是我现有数组的长度，(3,)是我要附加的字段的形状。

例如：

import numpy as np
from numpy.lib.recfunctions import append_fields


my_structured_array = np.array(
    zip([0,1,2,3],[[4.3,3.2],[1.4,5.6],[6.,2.5],[4.5,5.4]]),
    dtype=[('id','int8'),('pos','2float16')]
    )
my_new_field = np.ones(
    len(my_structured_array),
    dtype='2int8'
    )
my_appended_array = append_fields(
    my_structured_array,
    'new',
    data=my_new_field
    )

ValueError：操作数无法与形状一起广播 (4) (4,2)

有任何想法吗？我尝试制作my_new_field一个元组列表并将具有适当形状的 dtype 参数放入 append_fields()：

my_new_field = len(my_structured_array)*[(1,1)]

my_appended_array = append_fields(
    my_structured_array,
    'new',
    data=my_new_field,
    dtype='2int8'
    )

但是一旦它被转换为一个 numpy 数组，结果似乎是一样的。

当我使用 rec_append_fields() 而不是简单的 append_fields() 时，这一切似乎都没有改变

编辑：鉴于我的新字段与我的数组的形状不同，我想我想要的附加是不可能的，@radicalbiscuit建议。

In : my_new_field.shape
Out: (4, 2)

In : my_structured_array.shape
Out: (4,)

但是，我在数组中包含了一个形状与原始数组不同的原始字段来说明我的观点，即字段不必具有与结构化数组相同的形状。我怎样才能附加这样的字段？

In : my_structured_array['pos'].shape
Out: (4, 2)

In : my_new_field.shape
Out: (4, 2)

我应该注意，对于我的应用程序，只要以后可以以某种方式更改形状，我就可以附加一个空字段。谢谢！

score 6 · Accepted Answer

append_fields()确实要求两个数组的形状相同。话虽如此，正如您在中所意识到的那样my_structured_array，numpy 确实支持子数组（即，字段本身可以是具有形状的数组）。

在您的情况下，我认为您可能不想my_new_field成为二维数组，而是成为具有shape(my_structured_array)dtype 元素的一维数组（形状），例如dtype([('myfield', '<i8', (2,))]). 例如，

import numpy as np
from numpy.lib.recfunctions import append_fields

my_structured_array = np.array(
    zip([0,1,2,3],[[4.3,3.2],[1.4,5.6],[6.,2.5],[4.5,5.4]]),
    dtype=[('id','int8'),('pos','2float16')]
    )

my_new_field = np.ones(
    len(my_structured_array),
    dtype=[('myfield', 'i8', 2)]
    )

my_appended_array = append_fields(
    my_structured_array,
    'new',
    data=my_new_field
    )

会产生，

>>> my_appended_array[0]
(0, [4.30078125, 3.19921875], ([1, 1],))

尽管数据类型myfield嵌套在中，但稍有不便new，

>>> my_appended_array.dtype
dtype([('id', '|i1'), ('pos', '<f2', (2,)), ('new', [('myfield', '<i8', (2,))])])

然而，这很容易被强制删除，

>>> np.asarray(my_appended_array, dtype=[('id', '|i1'), ('pos', '<f2', (2,)), ('myfield', '<i8', (2,))])
array([(0, [4.30078125, 3.19921875], [0, 0]),
       (1, [1.400390625, 5.6015625], [0, 0]), (2, [6.0, 2.5], [0, 0]),
       (3, [4.5, 5.3984375], [0, 0])], 
      dtype=[('id', '|i1'), ('pos', '<f2', (2,)), ('myfield', '<i8', (2,))])

尽管如此，我们不得不重复my_structured_array这里的 dtype 有点不幸。虽然乍一看似乎numpy.lib.recfunctions.flatten_descr可以完成扁平化 dtype 的繁琐工作，但不幸的是，它提供了一个元组而不是np.dtype. 但是，将其输出强制到列表中可以解决此问题，

>>> np.dtype(list(np.lib.recfunctions.flatten_descr(my_appended_array.dtype)))
dtype([('id', '|i1'), ('pos', '<f2', (2,)), ('myfield', '<i8', (2,))])

这可以作为 dtype 传递给np.asarray，使事情对my_structured_array.dtype.

事实上，诸如此类的细微不一致会使处理记录阵列的工作变得一团糟。人们会觉得事情可以更加连贯地结合在一起。

编辑： 事实证明，该np.lib.recfunctions.merge_arrays功能更适合这种合并，

 >>> my_appended_array = merge_arrays([my_structured_array, my_new_field], flatten=True)
 array([(0, [4.30078125, 3.19921875], [1, 1]),
        (1, [1.400390625, 5.6015625], [1, 1]), (2, [6.0, 2.5], [1, 1]),
        (3, [4.5, 5.3984375], [1, 1])], 
       dtype=[('id', '|i1'), ('pos', '<f2', (2,)), ('myfield', '<i8', (2,))])

score 2 · Accepted Answer

append_fields()要求两个数组的形状相同，在这种情况下它们不是。打印出这两个数组将有助于它变得明显：

>>> my_structured_array
array([(0, [4.30078125, 3.19921875]), (1, [1.400390625, 5.6015625]),
       (2, [6.0, 2.5]), (3, [4.5, 5.3984375])], 
      dtype=[('id', '|i1'), ('pos', '<f2', (2,))])
>>> my_new_field
array([[1, 1],
       [1, 1],
       [1, 1],
       [1, 1]], dtype=int8)

如您所见，my_structured_array它是一个长度为 4 的数组，其中每个元素是一个包含两个对象的元组，一个 int 和一个包含两个浮点数的列表。

my_new_field另一方面，它是一个长度为 4 的数组，其中每个元素都是两个整数的列表。这就像尝试添加苹果和橙子。

使您的阵列具有相同的形状，它们将加在一起。

python - numpy append_field 为具有 2d 形状的新字段提供形状错误

2 回答 2

Related

Reference