4

我有两个包含相同行数的文件。

"file1.txt" contains following lines:

 Attitude is a little thing that makes a big difference
 The only disability in life is a bad attitude
 Abundance is, in large part, an attitude
 Smile when it hurts most

"file2.txt" contains:

 Attitude is a little thing that makes a big difference
 Everyone has his burden. What counts is how you carry it
 Abundance is, in large part, an attitude
 A positive attitude may not solve all your problems  

我想逐行比较两个文件,如果两个文件之间的任何行不匹配,我想

 print "mismatch in line no: 2"
 print "mismatch in line no: 4"   #in this case lineno: 2 and lineno: 4 varies from second file

我试过了。但我只能打印 file1 中与 file2 中的行不同的行。无法打印不匹配行的行号。??

 My code:
 with open("file1.txt") as f1:
    lineset = set(f1)
 with open("file2.txt") as f2:
    lineset.difference_update(f2)
    for line in lineset:
        print line
4

4 回答 4

8

使用itertools.izipenumerate

import itertools

with open('file1.txt') as f1, open('file2.txt') as f2:
    for lineno, (line1, line2) in enumerate(itertools.izip(f1, f2), 1):
        if line1 != line2:
            print 'mismatch in line no:', lineno
于 2013-12-19T16:27:45.693 回答
2

如果:

with open("file1.txt") as f1:
    with open("file2.txt") as f2:
        for idx, (lineA, lineB) in enumerate(zip(f1, f2)):
            if lineA != lineB:
                print 'mismatch in line no: {0}'.format(idx)

或者,如果行数不同,您可以尝试izip_longest

import itertools

with open("file1.txt") as f1:
    with open("file2.txt") as f2:
        for idx, (lineA, lineB) in enumerate(itertools.izip_longest(f1, f2)):
            if lineA != lineB:
                print 'mismatch in line no: {0}'.format(idx)
于 2013-12-19T16:27:00.097 回答
1

您也许可以使用该difflib模块。difflib.Differ这是一个使用其类的简单示例:

import difflib
import sys

with open('file1.txt') as file1, open('file2.txt') as file2:
    line_formatter = '{:3d}  {}'.format
    file1_lines = [line_formatter(i, line) for i, line in enumerate(file1, 1)]
    file2_lines = [line_formatter(i, line) for i, line in enumerate(file2, 1)]
    results = difflib.Differ().compare(file1_lines, file2_lines)
    sys.stdout.writelines(results)

输出:

    1  Attitude is a little thing that makes a big difference
-   2  The only disability in life is a bad attitude
+   2  Everyone has his burden. What counts is how you carry it
    3  Abundance is, in large part, an attitude
-   4  Smile when it hurts most
+   4  A positive attitude may not solve all your problems

第一列中的减号和加号表示以典型diff实用程序样式替换的行。没有任何指示符意味着这两个文件中的行是相同的——如果您愿意,您可以禁止打印这些行,但为了保持示例简单,该compare()方法创建的所有内容都将被打印。

作为参考,以下是两个文件的内容并排显示,并显示了行号:

1  Attitude is a little thing that makes a big difference    Attitude is a little thing that makes a big difference
2  The only disability in life is a bad attitude             Everyone has his burden. What counts is how you carry it
3  Abundance is, in large part, an attitude                  Abundance is, in large part, an attitude
4  Smile when it hurts most                                  A positive attitude may not solve all your problems
于 2013-12-19T17:37:49.820 回答
0
import itertools

with open('file1.txt') as f1, open('file2.txt') as f2:
    for lineno, (line1, line2) in enumerate(zip(f1, f2), 1):
        if line1 != line2:
            print ('mismatch in line no:', lineno)
于 2020-07-05T11:55:07.050 回答