在这篇文章中,有一个 Python 示例可以将 csv 转换为 xls。
但是,我的文件有超过 65536 行,所以 xls 不起作用。如果我将文件命名为 xlsx,它并没有什么不同。是否有 Python 包可以转换为 xlsx?
这是使用xlsxwriter的示例:
import os
import glob
import csv
from xlsxwriter.workbook import Workbook
for csvfile in glob.glob(os.path.join('.', '*.csv')):
workbook = Workbook(csvfile[:-4] + '.xlsx')
worksheet = workbook.add_worksheet()
with open(csvfile, 'rt', encoding='utf8') as f:
reader = csv.reader(f)
for r, row in enumerate(reader):
for c, col in enumerate(row):
worksheet.write(r, c, col)
workbook.close()
仅供参考,还有一个名为openpyxl的包,可以读取/写入 Excel 2007 xlsx/xlsm 文件。
希望有帮助。
用我的图书馆pyexcel
,
$ pip install pyexcel pyexcel-xlsx
您可以在一个命令行中执行此操作:
from pyexcel.cookbook import merge_all_to_a_book
# import pyexcel.ext.xlsx # no longer required if you use pyexcel >= 0.2.2
import glob
merge_all_to_a_book(glob.glob("your_csv_directory/*.csv"), "output.xlsx")
每个 csv 都有自己的工作表,名称将是它们的文件名。
首先安装openpyxl:
pip install openpyxl
然后:
from openpyxl import Workbook
import csv
wb = Workbook()
ws = wb.active
with open('test.csv', 'r') as f:
for row in csv.reader(f):
ws.append(row)
wb.save('name.xlsx')
添加专门使用 pandas 库读取 .csv 文件并保存为 .xlsx 文件的答案。此示例使用pandas.read_csv
( Link to docs ) 和pandas.dataframe.to_excel
( Link to docs )。
完全可重现的示例仅使用 numpy 生成随机数,如果您想使用自己的 .csv 文件,可以将其删除。
import pandas as pd
import numpy as np
# Creating a dataframe and saving as test.csv in current directory
df = pd.DataFrame(np.random.randn(100000, 3), columns=list('ABC'))
df.to_csv('test.csv', index = False)
# Reading in test.csv and saving as test.xlsx
df_new = pd.read_csv('test.csv')
writer = pd.ExcelWriter('test.xlsx')
df_new.to_excel(writer, index = False)
writer.save()
使用 pandas 的简单两行代码解决方案
import pandas as pd
read_file = pd.read_csv ('File name.csv')
read_file.to_excel ('File name.xlsx', index = None, header=True)
简单的 1 对 1 CSV 到 XLSX 文件转换,无需枚举/循环遍历行:
import pyexcel
sheet = pyexcel.get_sheet(file_name="myFile.csv", delimiter=",")
sheet.save_as("myFile.xlsx")
笔记:
我如何使用openpyxl lib 做到这一点:
import csv
from openpyxl import Workbook
def convert_csv_to_xlsx(self):
wb = Workbook()
sheet = wb.active
CSV_SEPARATOR = "#"
with open("my_file.csv") as f:
reader = csv.reader(f)
for r, row in enumerate(reader):
for c, col in enumerate(row):
for idx, val in enumerate(col.split(CSV_SEPARATOR)):
cell = sheet.cell(row=r+1, column=idx+1)
cell.value = val
wb.save("my_file.xlsx")
有一个简单的方法
import os
import csv
import sys
from openpyxl import Workbook
reload(sys)
sys.setdefaultencoding('utf8')
if __name__ == '__main__':
workbook = Workbook()
worksheet = workbook.active
with open('input.csv', 'r') as f:
reader = csv.reader(f)
for r, row in enumerate(reader):
for c, col in enumerate(row):
for idx, val in enumerate(col.split(',')):
cell = worksheet.cell(row=r+1, column=c+1)
cell.value = val
workbook.save('output.xlsx')