python - Python Pandas：多列的布尔索引

Question

尽管至少有两个关于如何在 Python 库中索引 DataFrame 的好pandas教程，但我仍然无法找到一种优雅的方式来SELECT处理多个列。

>>> d = pd.DataFrame({'x':[1, 2, 3, 4, 5], 'y':[4, 5, 6, 7, 8]})
>>> d
   x  y
0  1  4
1  2  5
2  3  6
3  4  7
4  5  8
>>> d[d['x']>2] # This works fine
   x  y
2  3  6
3  4  7
4  5  8
>>> d[d['x']>2 & d['y']>7] # I had expected this to work, but it doesn't
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
ValueError: The truth value of an array with more than one element is ambiguous. Use a.any() or a.all()

我发现（我认为是）一种相当不雅的方式，就像这样

>>> d[d['x']>2][d['y']>7]

但它并不漂亮，而且它的可读性得分相当低（我认为）。

有没有更好、更符合 Python 的方式？

score 87 · Accepted Answer

这是一个优先运算符问题。

您应该添加额外的括号以使您的多条件测试正常工作：

d[(d['x']>2) & (d['y']>7)]

您提到的教程的这一部分显示了一个带有几个布尔条件的示例，并使用了括号。

score 1 · Accepted Answer

可能还有更好的方法，但是

In [56]: d[d['x'] > 2] and d[d['y'] > 7]
Out[56]: 
   x  y
4  5  8

作品。

python - Python Pandas：多列的布尔索引

2 回答 2

Related

Reference