Untar archive in Python with errors

匿名 (未验证) 提交于 2019-12-03 00:56:02

问题:

I download a bz2 file using Python. Then I want to unpack the archive using:

def unpack_file(dir, file):     cwd = os.getcwd()     os.chdir(dir)     print "Unpacking file %s" % file     cmd = "tar -jxf %s" % file     print cmd     os.system(cmd)     os.chdir(cwd) 

Unfortunately this ends with error:

bzip2: Compressed file ends unexpectedly;     perhaps it is corrupted?  *Possible* reason follows. bzip2: Inappropriate ioctl for device     Input file = (stdin), output file = (stdout)  It is possible that the compressed file(s) have become corrupted. You can use the -tvv option to test integrity of such files.  You can use the `bzip2recover' program to attempt to recover data from undamaged sections of corrupted files.  tar: Nieoczekiwany EOF w archiwum tar: Nieoczekiwany EOF w archiwum tar: Error is not recoverable: exiting now 

However I can unpack the archive from the shell without any problem.

Do you have any ideas what I do wrong?

回答1:

For the record, python standard library ships with the tarfile module which automatically handles tar, tar.bz2, and tar.gz formats.

Additionally, you can do nifty things like get file lists, extract subsets of files or directories or chunk the archive so that you process it in a streaming form (i.e. you don't have to decompress the whole file then untar it.. it does everything in small chunks)

import tarfile tar = tarfile.open("sample.tar.gz") tar.extractall() tar.close() 


回答2:

I would do it like this:

import tarfile target_folder = '.' with tarfile.open("sample.tar.gz") as tar:     tar.extractall(target_folder) 

That's it. tar / with takes care of the rest.

When you want to have the path to all the files:

import os filepaths = [] for (dirpath, dirnames, filenames) in walk(target_folder):     filepaths.extend([os.path.join(dirpath, f) for f in filenames]) 


标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!