regex - 为 LEX 完成评论正则表达式

Question

我正在使用 Lex 和 Yacc 构建一个计算器编译器。这个想法基于以下资源：http ://epaperpress.com/lexandyacc/index.html 。

对于给定的输入文件，我需要识别所有注释：

//.TEST -- JWJ
//.Step final  -- testing all requirements
//.source: test-1m.cal
//.expected output: test-1m_expected.out

/**
 *  This program will use Newton's method to estimate the roots of


 This should be a comment as well, but does not get picked up


 *  f(x) = x^3 - 3*x 
 */
 float xn;
 float xo;
// int num_iterations;
 xo = 3.0;
 xn = 3.0;
 num_iterations = 1;

 /* A do-while loop */
 do {
  print xo;
  xo = xn;
  xn = xo - ( xo * xo * xo - 3.0 * xo  ) / ( 3.0 * xo * xo - 3.0);
  num_iterations = num_iterations + 1;
} while ( num_iterations <= 6 )

print xn; // The root found using Newton's method.
print (xo * xo * xo - 3.0 * xo ); // Print f(xn), which should be 0.

我在我的 lex 文件中使用以下正则表达式：

"//"[^\n]*|"\/\*".*"\*\/"
"\/\*"([^\n])*  
(.)*"\*\/"

我不明白为什么不匹配多行注释？有人可以提供一些见解吗？

score 6 · Accepted Answer

flex 中的.字符匹配任何字符，除了换行符（因此它与相同[^\n]）。因此，您的正则表达式都不匹配任何包含换行符的注释。

C 风格注释的常用正则表达式是：

"/*"([^*]|\*+[^*/])*\*+"/"

这匹配注释标记内的 0 个或多个“除 * 之外的任何内容”或“1 个或多个 *s 后不跟 * 或 /”。

score -1 · Accepted Answer

-1

The regular expression for comments in C or C++ program is like:

"//".*|"/*"(.*[\n].*)*"*/"

于 2017-07-06T05:15:46.503 回答

regex - 为 LEX 完成评论正则表达式

2 回答 2

Related

Reference