以下是我试图为其生成解析器的 EBNF 格式(主要是 -此处记录了实际语法)语法:
expr = lambda_expr_list $;
lambda_expr_list = [ lambda_expr_list "," ] lambda_expr;
lambda_expr = conditional_expr [ "->" lambda_expr ];
conditional_expr = boolean_or_expr [ "if" conditional_expr "else" conditional_expr ];
boolean_or_expr = [ boolean_or_expr "or" ] boolean_xor_expr;
boolean_xor_expr = [ boolean_xor_expr "xor" ] boolean_and_expr;
boolean_and_expr = [ boolean_and_expr "and" ] boolean_not_expr;
boolean_not_expr = [ "not" ] relation;
relation = [ relation ( "=="
| "!="
| ">"
| "<="
| "<"
| ">="
| [ "not" ] "in"
| "is" [ "not" ] ) ] bitwise_or_expr;
bitwise_or_expr = [ bitwise_or_expr "|" ] bitwise_xor_expr;
bitwise_xor_expr = [ bitwise_xor_expr "^" ] bitwise_and_expr;
bitwise_and_expr = [ bitwise_and_expr "&" ] bitwise_shift_expr;
bitwise_shift_expr = [ bitwise_shift_expr ( "<<"
| ">>" ) ] subtraction_expr;
subtraction_expr = [ subtraction_expr "-" ] addition_expr;
addition_expr = [ addition_expr "+" ] division_expr;
division_expr = [ division_expr ( "/"
| "\\" ) ] multiplication_expr;
multiplication_expr = [ multiplication_expr ( "*"
| "%" ) ] negative_expr;
negative_expr = [ "-" ] positive_expr;
positive_expr = [ "+" ] bitwise_not_expr;
bitwise_not_expr = [ "~" ] power_expr;
power_expr = slice_expr [ "**" power_expr ];
slice_expr = member_access_expr { subscript };
subscript = "[" slice_defn_list "]";
slice_defn_list = [ slice_defn_list "," ] slice_defn;
slice_defn = lambda_expr
| [ lambda_expr ] ":" [ [ lambda_expr ] ":" [ lambda_expr ] ];
member_access_expr = [ member_access_expr "." ] function_call_expr;
function_call_expr = atom { parameter_list };
parameter_list = "(" [ lambda_expr_list ] ")";
atom = identifier
| scalar_literal
| nary_literal;
identifier = /[_A-Za-z][_A-Za-z0-9]*/;
scalar_literal = float_literal
| integer_literal
| boolean_literal;
float_literal = point_float_literal
| exponent_float_literal;
point_float_literal = /[0-9]+?\.[0-9]+|[0-9]+\./;
exponent_float_literal = /([0-9]+|[0-9]+?\.[0-9]+|[0-9]+\.)[eE][+-]?[0-9]+/;
integer_literal = dec_integer_literal
| oct_integer_literal
| hex_integer_literal
| bin_integer_literal;
dec_integer_literal = /[1-9][0-9]*|0+/;
oct_integer_literal = /0[oO][0-7]+/;
hex_integer_literal = /0[xX][0-9a-fA-F]+/;
bin_integer_literal = /0[bB][01]+/;
boolean_literal = "true"
| "false";
nary_literal = tuple_literal
| list_literal
| dict_literal
| string_literal
| byte_string_literal;
tuple_literal = "(" [ lambda_expr_list ] ")";
list_literal = "[" [ ( lambda_expr_list
| list_comprehension ) ] "]";
list_comprehension = lambda_expr "for" lambda_expr_list "in" lambda_expr [ "if" lambda_expr ];
dict_literal = "{" [ ( dict_element_list
| dict_comprehension ) ] "}";
dict_element_list = [ dict_element_list "," ] dict_element;
dict_element = lambda_expr ":" lambda_expr;
dict_comprehension = dict_element "for" lambda_expr_list "in" lambda_expr [ "if" lambda_expr ];
string_literal = /[uU]?[rR]?(\u0027(\\.|[^\\\r\n\u0027])*\u0027|\u0022(\\.|[^\\\r\n\u0022])*\u0022)/;
byte_string_literal = /[bB][rR]?(\u0027(\\[\u0000-\u007F]|[\u0000-\u0009\u000B-\u000C\u000E-\u0026\u0028-\u005B\u005D-\u007F])*\u0027|\u0022(\\[\u0000-\u007F]|[\u0000-\u0009\u000B-\u000C\u000E-\u0021\u0023-\u005B\u005D-\u007F])*\u0022)/;
我用来生成解析器的工具是Grako,它生成了一个修改过的 Packrat 解析器,声称支持直接和间接左递归。
当我在这个字符串上运行生成的解析器时:
input.filter(e -> e[0] in ['t', 'T']).map(e -> (e.len().str(), e)).map(e -> '(Line length: ' + e[0] + ') ' + e[1]).list()
我收到以下错误:
grako.exceptions.FailedParse: (1:13) Expecting end of text. :
input.filter(e -> e[0] in ['t', 'T']).map(e -> (e.len().str(), e)).map(e -> '(Line length: ' + e[0] + ') ' + e[1]).list()
^
expr
调试表明解析器似乎到达了 first 的末尾e[0]
,然后永远不会回溯到/到达它将尝试匹配in
令牌的点。
我的语法是否存在一些问题,以至于支持左递归的 Packrat 解析器会失败?或者我应该在 Grako 问题跟踪器上提交问题吗?