-1

实际上,我构建了一个 Java 代码来解析以下文本文件:

     (FAMIX.Attribute (id: 22)
(name 'obj_I')
(parentType (ref: 11))
(declaredType (ref: 27))
(isPrivate true)
   )

   (FAMIX.Attribute (id: 38)
(name 'obj_k')
(parentType (ref: 34))
(declaredType (ref: 43))
(isPrivate true)
   )

  (FAMIX.Attribute (id: 56)
(name 'obj_K')
(parentType (ref: 46))
(declaredType (ref: 43))
(isPrivate true)
    )

  (FAMIX.Attribute (id: 73)
(name 'obj_L')
(parentType (ref: 64))
(declaredType (ref: 45))
(isPrivate true)
    )

 (FAMIX.Attribute (id: 67)
(name 'obj_G')
(parentType (ref: 64))
(declaredType (ref: 46))
(isPrivate true)
    )

 (FAMIX.Attribute (id: 93)
(name 'classD')
(parentType (ref: 85))
(declaredType (ref: 94))
(isPrivate true)
   )

  (FAMIX.Attribute (id: 99)
(name 'classC')
(parentType (ref: 86))
(declaredType(ref: 86))
(isPackage true)
    )

 (FAMIX.Attribute (id: 114)
(name 'classB')
(parentType (ref: 94))
(declaredType (ref: 11))
(isPrivate true)
    )

  (FAMIX.Attribute (id: 107)
(name 'obj_c')
(parentType (ref: 94))
(declaredType (ref: 86))
(isPrivate true)
     )

Java 代码:

// Find Attributes

Pattern p111 = Pattern.compile("FAMIX.Attribute");

Matcher m111 = p111.matcher(line);
while (m111.find()) {

    FAMIXAttribute obj = new FAMIXAttribute();              
    Pattern p222 = Pattern.compile("id:\\s*([0-9]+)");
    Matcher m222 = p222.matcher(line);

    while (m222.find()) {
        System.out.print(m222.group(1));
    }

    while ((line = br.readLine()) != null && !(line.contains("FAMIX"))) {

        Pattern p333 = Pattern.compile("name\\s*'([\\w]+)\\s*'");
        Matcher m333 = p333.matcher(line);

        while (m333.find()) {       

            System.out.print(m333.group(1));
        }

        Pattern p555 = Pattern.compile("parentType\\s*\\(ref:\\s*([0-9]+)\\)");
        Matcher m555 = p555.matcher(line);
        while (m555.find()) {
           System.out.print(m555.group(1));
        }

        Pattern p666 =   Pattern.compile("declaredType\\s*\\(ref:\\s*([0-9]+)\\)");
        Matcher m666 = p666.matcher(line);
        while (m666.find()) {
           System.out.print(m666.group(1));
        } 

    }

} // exit from finding Attribute

输出:

     ***************** Attributes *****************
       obj_k    38   34   43
       obj_L    73   64   45
       classD   93   85   94
       classB   114  94   11   

根据输出,问题是解析器跳过一些输出(跳转)

如果问题不清楚,请告诉我,我会尽力进一步澄清。

4

2 回答 2

0

您忘记了检查IsPrivateorIsPackage部分的正则表达式

编辑:几个步骤将告诉您出了什么问题添加该行的打印输出以准确查看哪些行失败以及 Pattern 如何看到它们

     // Find Attributes
                System.out.print("***"+line+"***"); 
                Pattern p111 = Pattern.compile("FAMIX.Attribute");
                Matcher m111 = p111.matcher(line);
                while (m111.find()) {

"***"将让您了解关于 java.util 的确切开头和结尾。有时,看起来与眼睛相同的字符对于匹配器来说是不同的。

编辑 2:您的代码缺少外循环,其中 line 被第一次读取。您是否意识到代码:

                  while ((line = br.readLine()) != null && !(line.contains("FAMIX"))) {

消耗出现“FAMIX.Attribute”的下一行?如果您在(丢失的)外部循环中再次读取,您将丢失所有其他记录。

于 2013-02-18T18:20:18.667 回答
0

如果您确定该文件包含指定格式的行:

  • id, name, parentType, 中的每一个都declaredType必须在一行上完全声明。即你没有这样的输入:

    (FAMIX.Attribute (id:
    38)
    (name 
    'obj_k')
    (parentType 
      (ref: 34))
    (declaredType (ref: 43))
    (isPrivate true)
    )
    

    但这是允许的:

    (FAMIX.Attribute (id: 38)
    (name 'obj_k') (parentType (ref: 34)) (declaredType (ref: 43)) (isPrivate true))
    

这是下面的修改生效的前提条件。此假设源自您当前的代码。

String line;

FAMIXAttribute obj = new FAMIXAttribute();
boolean isModified = false;

while ((line = br.readLine()) != null) {
    if (line.contains("FAMIX.Attribute")) {
        if (isModified) {
            // TODO: Save the previous obj

            obj = new FAMIXAttribute();
            isModified = false;
        } 
    }

    // TODO: Add the block of code to parse id here
    // TODO: Add id attribute to obj, set isModified to true

    // TODO: Add the block of code to parse other stuffs here
    // TODO: Add those attributes to obj, set isModified to true
}

if (isModified) {
    // TODO: Save the last obj
}
于 2013-02-18T19:08:13.227 回答