how to compare two files and print mismatched line number in python?

此生再无相见时 提交于 2021-01-22 21:21:37

问题


I have two files which contains same number of lines.

"file1.txt" contains following lines:

 Attitude is a little thing that makes a big difference
 The only disability in life is a bad attitude
 Abundance is, in large part, an attitude
 Smile when it hurts most

"file2.txt" contains:

 Attitude is a little thing that makes a big difference
 Everyone has his burden. What counts is how you carry it
 Abundance is, in large part, an attitude
 A positive attitude may not solve all your problems  

I want to compare two files line by line and if any of the line mismatches between two files i want to

 print "mismatch in line no: 2"
 print "mismatch in line no: 4"   #in this case lineno: 2 and lineno: 4 varies from second file

I tried.but i can print only the line in file1 which is differ from line in file2.can't able to print the line number of mismatched lines.??

 My code:
 with open("file1.txt") as f1:
    lineset = set(f1)
 with open("file2.txt") as f2:
    lineset.difference_update(f2)
    for line in lineset:
        print line

回答1:


Using itertools.izip and enumerate:

import itertools

with open('file1.txt') as f1, open('file2.txt') as f2:
    for lineno, (line1, line2) in enumerate(itertools.izip(f1, f2), 1):
        if line1 != line2:
            print 'mismatch in line no:', lineno



回答2:


What if:

with open("file1.txt") as f1:
    with open("file2.txt") as f2:
        for idx, (lineA, lineB) in enumerate(zip(f1, f2)):
            if lineA != lineB:
                print 'mismatch in line no: {0}'.format(idx)

Or if there are a different number of rows you can try izip_longest

import itertools

with open("file1.txt") as f1:
    with open("file2.txt") as f2:
        for idx, (lineA, lineB) in enumerate(itertools.izip_longest(f1, f2)):
            if lineA != lineB:
                print 'mismatch in line no: {0}'.format(idx)



回答3:


You might be able to use the difflib module. Here's a simple example using its difflib.Differ class:

import difflib
import sys

with open('file1.txt') as file1, open('file2.txt') as file2:
    line_formatter = '{:3d}  {}'.format
    file1_lines = [line_formatter(i, line) for i, line in enumerate(file1, 1)]
    file2_lines = [line_formatter(i, line) for i, line in enumerate(file2, 1)]
    results = difflib.Differ().compare(file1_lines, file2_lines)
    sys.stdout.writelines(results)

Output:

    1  Attitude is a little thing that makes a big difference
-   2  The only disability in life is a bad attitude
+   2  Everyone has his burden. What counts is how you carry it
    3  Abundance is, in large part, an attitude
-   4  Smile when it hurts most
+   4  A positive attitude may not solve all your problems

The minus and plus characters in the first column indicate lines that were replaced in typical diff utility program style. The absence of any indicator means the line was the same in both files -- you could suppress the printing of those if you wished, but to keep the example simple everything the compare() method creates is being printed.

For reference, here's the contents of the two files side-by-side with line numbers shown:

1  Attitude is a little thing that makes a big difference    Attitude is a little thing that makes a big difference
2  The only disability in life is a bad attitude             Everyone has his burden. What counts is how you carry it
3  Abundance is, in large part, an attitude                  Abundance is, in large part, an attitude
4  Smile when it hurts most                                  A positive attitude may not solve all your problems



回答4:


import itertools

with open('file1.txt') as f1, open('file2.txt') as f2:
    for lineno, (line1, line2) in enumerate(zip(f1, f2), 1):
        if line1 != line2:
            print ('mismatch in line no:', lineno)


来源:https://stackoverflow.com/questions/20686674/how-to-compare-two-files-and-print-mismatched-line-number-in-python

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!