我想在python中从一个zip文件中提取一个特定的文件夹,然后在原始文件名之后重命名它。
例如,我有一个名为test.zip
包含多个文件夹和子文件夹的文件:
xl/media/image1.png
xl/drawings/stuff.png
stuff/otherstuff.png
我希望将媒体文件夹的内容提取到名为 test 的文件夹中:
test/image1.png
采用
zipfile
模块,特别是ZipFile.extractall()
os.path.splitext()
test1
从字符串
中获取test1.zip
tmpfile.mkdtemp()
创建一个临时目录shutil.move()
移动整个目录树。例如:
#!/usr/bin/env python
"""Usage:
./extract.py test.zip
"""
from zipfile import ZipFile
import os
import sys
import tempfile
import shutil
ROOT_PATH = 'xl/media/'
zip_name = sys.argv[1]
zip_path = os.path.abspath(zip_name)
extraction_dir = os.path.join(os.getcwd(), os.path.splitext(zip_name)[0])
temp_dir = tempfile.mkdtemp()
with ZipFile(zip_path, 'r') as zip_file:
# Build a list of only the members below ROOT_PATH
members = zip_file.namelist()
members_to_extract = [m for m in members if m.startswith(ROOT_PATH)]
# Extract only those members to the temp directory
zip_file.extractall(temp_dir, members_to_extract)
# Move the extracted ROOT_PATH directory to its final location
shutil.move(os.path.join(temp_dir, ROOT_PATH), extraction_dir)
# Uncomment if you want to delete the original zip file
# os.remove(zip_path)
print "Sucessfully extracted '%s' to '%s'" % (zip_path, extraction_dir)
使用try..except
块来处理在创建目录、删除文件和解压缩 zip 时可能发生的各种异常。