python - 剖析一行（混淆的？）Python

Question

我正在阅读关于 Stack Overflow 的另一个问题（Zen of Python），我在 Jaime Soriano 的回答中遇到了这一行：

import this
"".join([c in this.d and this.d[c] or c for c in this.s])

在 Python shell 中输入上述内容会打印：

"The Zen of Python, by Tim Peters\n\nBeautiful is better than ugly.\nExplicit is
better than implicit.\nSimple is better than complex.\nComplex is better than 
complicated.\nFlat is better than nested.\nSparse is better than dense.
\nReadability counts.\nSpecial cases aren't special enough to break the rules.
\nAlthough practicality beats purity.\nErrors should never pass silently.
\nUnless explicitly silenced.\nIn the face of ambiguity, refuse the temptation to
guess.\nThere should be one-- and preferably only one --obvious way to do it.
\nAlthough that way may not be obvious at first unless you're Dutch.\nNow is 
better than never.\nAlthough never is often better than *right* now.\nIf the 
implementation is hard to explain, it's a bad idea.\nIf the implementation is
easy to explain, it may be a good idea.\nNamespaces are one honking great idea 
-- let's do more of those!"

所以当然，我不得不用整个上午的时间来试图理解上面的列表……理解……事情。我不敢断然宣布它被混淆了，但这只是因为我已经编程了一个半月，所以不确定这种结构在 python 中是否司空见惯。

this.s包含上述打印输出的编码版本：

"Gur Mra bs Clguba, ol Gvz Crgref\n\nOrnhgvshy vf orggre guna htyl.\nRkcyvpvg vf orggre guna vzcyvpvg.\nFvzcyr vf orggre guna pbzcyrk.\nPbzcyrk vf orggre guna pbzcyvpngrq.\nSyng vf orggre guna arfgrq.\nFcnefr vf orggre guna qrafr.\nErnqnovyvgl pbhagf.\nFcrpvny pnfrf nera'g fcrpvny rabhtu gb oernx gur ehyrf.\nNygubhtu cenpgvpnyvgl orngf chevgl.\nReebef fubhyq arire cnff fvyragyl.\nHayrff rkcyvpvgyl fvyraprq.\nVa gur snpr bs nzovthvgl, ershfr gur grzcgngvba gb thrff.\nGurer fubhyq or bar-- naq cersrenoyl bayl bar --boivbhf jnl gb qb vg.\nNygubhtu gung jnl znl abg or boivbhf ng svefg hayrff lbh'er Qhgpu.\nAbj vf orggre guna arire.\nNygubhtu arire vf bsgra orggre guna *evtug* abj.\nVs gur vzcyrzragngvba vf uneq gb rkcynva, vg'f n onq vqrn.\nVs gur vzcyrzragngvba vf rnfl gb rkcynva, vg znl or n tbbq vqrn.\nAnzrfcnprf ner bar ubaxvat terng vqrn -- yrg'f qb zber bs gubfr!"

并this.d包含一个带有解码密码的字典this.s：

{'A': 'N', 'C': 'P', 'B': 'O', 'E': 'R', 'D': 'Q', 'G': 'T', 'F': 'S', 'I': 'V', 'H': 'U', 'K': 'X', 'J': 'W', 'M': 'Z', 'L': 'Y', 'O': 'B', 'N': 'A', 'Q': 'D', 'P': 'C', 'S': 'F', 'R': 'E', 'U': 'H', 'T': 'G', 'W': 'J', 'V': 'I', 'Y': 'L', 'X': 'K', 'Z': 'M', 'a': 'n', 'c': 'p', 'b': 'o', 'e': 'r', 'd': 'q', 'g': 't', 'f': 's', 'i': 'v', 'h': 'u', 'k': 'x', 'j': 'w', 'm': 'z', 'l': 'y', 'o': 'b', 'n': 'a', 'q': 'd', 'p': 'c', 's': 'f', 'r': 'e', 'u': 'h', 't': 'g', 'w': 'j', 'v': 'i', 'y': 'l', 'x': 'k', 'z': 'm'}

据我所知，Jaime 代码中的执行流程是这样的：
1. 循环c for c in this.s为 c 赋值
2. 如果语句的c in this.d计算结果为 True，则“and”语句会执行恰好在其右侧的任何内容，在这种情况下this.d[c]。
3. 如果该语句的c in this.d计算结果为 False（这在 Jaime 的代码中从未发生过），“或”语句将执行恰好位于其右侧的任何内容，在本例中为循环c for c in this.s。

我对那个流程是否正确？

即使我对执行顺序的看法是正确的，这仍然给我留下了很多问题。为什么 <1> 是第一个执行的，即使它的代码在几个条件语句之后出现在最后一行？换句话说，为什么for循环开始执行并赋值，但实际上只是在代码执行的后期才返回一个值，如果有的话？

此外，对于奖励积分，Zen 文件中关于荷兰语的奇怪行是什么？

编辑：虽然现在说出来让我感到羞耻，但直到三秒钟前，我还以为 Guido van Rossum 是意大利人。在阅读了他的 Wikipedia 文章后，我至少掌握了（如果不是完全理解的话）为什么会有那条线。

score 11 · Accepted Answer

列表理解行中的运算符关联如下：

"".join([(((c in this.d) and this.d[c]) or c) for c in this.s])

删除列表理解：

result = []
for c in this.s:
   result.append(((c in this.d) and this.d[c]) or c)
print "".join(result)

删除用于模拟-语句的and/布尔技巧：orifelse

result = []
for c in this.s:
   if c in this.d:
      result.append(this.d[c])
   else:
      result.append(c)
print "".join(result)

score 2 · Accepted Answer

您对流程是正确的。

循环是一种[dosomething(c) for c in this.s]列表理解，应该被视为 this.s 中所有 c 的内容。

荷兰语部分是关于 Python 的 Guido Van Rossum 的创造者是荷兰人。

score 2 · Accepted Answer

你的分析很接近。这是一个列表理解。（顺便说一句，如果消除外部方括号，将产生相同的输出，这将被称为生成器理解）

这里有一些文档。

列表推导的基本形式是

[expression for var in enumerable if condition]

它们按以下顺序进行评估：

可枚举的被评估
每个值依次分配给 var
检查条件
表达式被评估

结果是条件为真的枚举中每个元素的表达式值列表。

这个例子没有使用条件，所以在添加一些括号后剩下的是：

[(c in this.d and this.d[c] or c) for c in (this.s)]

this.s是可枚举的。c是迭代变量。c in this.d and this.d[c] or c是表达式。

c in this.d and this.d[c] or c使用 python 逻辑运算符的短路特性来实现与this.d[c] if c in this.d else c.

总而言之，我根本不会称之为混淆。一旦你理解了列表推导的威力，它就会看起来很自然。

score 2 · Accepted Answer

通常，列表推导具有以下形式：

[ expression for var in iterator ]

当我写下一个列表理解时，我通常从写作开始

[ for var in iterator ]

因为多年的过程编程已经将 for-loop 方面作为第一部分灌输到我的脑海中。

而且，正如您正确指出的那样，for 循环是似乎首先“执行”的部分。

对于每次通过循环，都会计算表达式。（小点：计算表达式，执行语句。）

所以在这种情况下，我们有

[ expression for c in this.s ]

this.s 是一个字符串。在 Python 中，字符串是迭代器！当你写

for c in some_string:

循环遍历字符串中的字符。因此c，按顺序处理 this.s 中的每个字符。

现在表达式是

c in this.d and this.d[c] or c

这就是所谓的三元运算。该链接解释了逻辑，但基本思想是

if c in this.d:
    the expression evaluates to this.d[c]
else:
    the expression evaluates c

因此，条件c in this.d只是检查 dictthis.d是否有一个带有 value 的键c。如果是，则返回this.d[c]，如果不是，则返回c自身。

另一种写法是

[this.d.get(c,c) for c in this.s]

（get 方法的第二个参数是第一个参数不在字典中时返回的默认值）。

PS。三元形式

condition and value1 or value2

容易出错。（考虑如果 is True，但是 is 会发生什么。condition由于value1是True None，您可能condition期望三元形式的计算结果为意识到这个陷阱，三元形式可能会引入错误。）value1NoneNoneFalsevalue2

对于现代版本的 Python，更好的编写方法是

value1 if condition else value2

它不易受上述陷阱的影响。如果condition为 True，则表达式的计算结果始终为value1。

但在上述特定情况下，我更喜欢this.d.get(c,c).

score 2 · Accepted Answer

"".join([c in this.d and this.d[c] or c for c in this.s])肯定是混淆了。这是一个禅宗版本：

this.s.decode('rot13')

score 0 · Accepted Answer

我的现代 if else 和生成器版本：

import this ## prints zenofpython
print '-'*70
whatiszenofpython = "".join(this.d[c] if c in this.d else c for c in this.s)
zen = ''
for c in this.s:
    zen += this.d[c] if c in this.d else c
print zen

口头版本：导入 this，它的主程序解扰并打印消息 this.s 要解扰消息，将在 dict this.d 中找到的那些字母替换为它们的解码对应部分（大写/小写不同）。其他字母不需要更改，而是按原样打印。

python - 剖析一行（混淆的？）Python

6 回答 6

Related

Reference