python - Pyparsing：空间作为有效令牌

Question

我正在使用 pyparser 处理十六进制到文本转换器的输出。它每行打印 16 个字符，以空格分隔。如果十六进制值是 ASCII 可打印字符，则打印该字符，否则转换器输出句点 (.)

大多数情况下，输出如下所示：

. a . v a l i d . s t r i n g .
. a n o t h e r . s t r i n g .
. e t c . . . . . . . . . . . .

我描述这一行的 pyparsing 代码是：

dump_line = 16 * Word(printables, exact=1)

这工作正常，直到十六进制到文本转换器达到 0x20 的十六进制值，这导致它输出一个空格。

l i n e . w . a .   s p a c e .

在这种情况下，pyparsing 会忽略输出的空格并占用下一行中的字符以形成 16 个字符的“配额”。

有人可以建议我如何告诉 pyparsing 期望 16 个字符，每个字符用空格分隔，其中空格也可以是有效字符？

提前致谢。Ĵ

score 6 · Accepted Answer

由于这有很大的空白，您需要告诉您的角色表达式不要单独留下前导空白。在下面的 dumpchar 定义中查看这是如何完成的：

hexdump = """\
. a . v a l i d . s t r i n g . 
. a n o t h e r . s t r i n g . 
. e t c . . . . . . . . . . . . 
l i n e . w . a .   s p a c e . 
. e t c . . . . . . . . . . . . 
"""

from pyparsing import oneOf, printables, delimitedList, White, LineEnd

# expression for a single char or space
dumpchar = oneOf(list(printables)+[' ']).leaveWhitespace()

# convert '.'s to something else, if you like; in this example, '_'
dumpchar.setParseAction(lambda t:'_' if t[0]=='.' else None)

# expression for a whole line of dump chars - intervening spaces will
# be discarded by delimitedList
dumpline = delimitedList(dumpchar, delim=White(' ',exact=1)) + LineEnd().suppress()

# if you want the intervening spaces, use this form instead
#dumpline = delimitedList(dumpchar, delim=White(' ',exact=1), combine=True) + LineEnd().suppress()

# read dumped lines from hexdump
for t in dumpline.searchString(hexdump):
    print ''.join(t)

印刷：

_a_valid_string_
_another_string_
_etc____________
line_w_a_ space_
_etc____________

score 1 · Accepted Answer

考虑使用另一种方法来删除空格

>>> s=". a . v a l i d . s t r i n g ."
>>> s=s[::2]
>>> s
'.a.valid.string.'

python - Pyparsing：空间作为有效令牌

2 回答 2

Related

Reference