6

我有一个写入 xslx 文件的熊猫数据框,并且想在该数据上添加一个表。我还想保留我已经写过的标题,而不是再次添加它们。那可能吗?

例子:

import pandas as pd
import xlsxwriter as xw

# random dataframe
d = {'one' : pd.Series([1., 2., 3.], index=['a', 'b', 'c']), 'two' : pd.Series([5., 6., 7., 8.], index=['a', 'b', 'c', 'd'])}
df = pd.DataFrame(d)


# write data to file
writer = pd.ExcelWriter("test.xlsx", engine='xlsxwriter')
df.to_excel(writer,"sheet without table")
df.to_excel(writer,"sheet with table")
df.to_excel(writer,"sheet with table and header")

# get sheets to add the tables
workbook  = writer.book
worksheet_table = writer.sheets['sheet with table']
worksheet_table_header = writer.sheets['sheet with table and header']

# the range in which the table is
end_row = len(df.index)
end_column = len(df.columns)
cell_range = xw.utility.xl_range(0, 0, end_row, end_column)


# add the table that will delete the headers
worksheet_table.add_table(cell_range,{'header_row': True,'first_column': True})

######################################
# The hack

# Using the index in the Table
df.reset_index(inplace=True)
header = [{'header': di} for di in df.columns.tolist()]
worksheet_table_header.add_table(cell_range,{'header_row': True,'first_column': True,'columns':header})

writer.save()
4

4 回答 4

7

hack / work around 是唯一的选择(从@jmcnamara 可以看出)。简而言之就是:

import pandas as pd
import xlsxwriter as xw

# random dataframe
d = {'one' : pd.Series([1., 2., 3.], index=['a', 'b', 'c']), 'two' : pd.Series([5., 6., 7., 8.], index=['a', 'b', 'c', 'd'])}
df = pd.DataFrame(d)


# write data to file
writer = pd.ExcelWriter("test.xlsx", engine='xlsxwriter')
df.to_excel(writer,"sheet with table and header")

# get sheets to add the tables
workbook  = writer.book
worksheet_table_header = writer.sheets['sheet with table and header']

# the range in which the table is
end_row = len(df.index)
end_column = len(df.columns)
cell_range = xw.utility.xl_range(0, 0, end_row, end_column)

######################################
# The hack

# Using the index in the Table
df.reset_index(inplace=True)
header = [{'header': di} for di in df.columns.tolist()]
worksheet_table_header.add_table(cell_range,{'header_row': True,'first_column': True,'columns':header})

writer.save()
于 2016-04-26T06:19:01.113 回答
6

怎么样(请注意,仅当数据框包含 NA 时才需要“选项”):

import pandas as pd
import xlsxwriter

# random dataframe
d = {'one':pd.Series([1., 2., 3.]), 'two':pd.Series([5., 6., 7., 8.])}
df = pd.DataFrame(d)

workbook = xlsxwriter.Workbook('test.xlsx', options={'nan_inf_to_errors': True})
worksheet = workbook.add_worksheet('sheet1')
worksheet.add_table(0, 0, df.shape[0], df.shape[1]-1,
    {'data': df.values.tolist(),
    'columns': [{'header': c} for c in df.columns.tolist()],
    'style': 'Table Style Medium 9'})
workbook.close()
于 2019-03-27T16:05:06.730 回答
1

我还想保留我已经写过的标题,而不是再次添加它们。那可能吗?

不。

您的第三个解决方案worksheet_table_header可能是实现它的最佳方式。

于 2016-04-25T12:10:41.920 回答
1

在使用 xlsxwriter 版本 0.9.6 时,我不得不修改 @jmcnamara 的 hack。我必须从列数中减去一个,否则我会得到一个不在 pandas.DataFrame 中的额外列(请参阅 end_column 分配)。下面的修改版本(熊猫版本 0.19.2)。

import pandas as pd
import xlsxwriter

# random dataframe
d = {'one':pd.Series([1., 2., 3.]), 'two':pd.Series([5., 6., 7., 8.])}
df = pd.DataFrame(d)
print df

# write data to file
writer = pd.ExcelWriter("test.xlsx", engine='xlsxwriter')
df.to_excel(writer, 'sheet1', index=False)

# get sheets to add the tables
ws = writer.sheets['sheet1']

# the range in which the table is
end_row = len(df.index)
end_column = len(df.columns) - 1
cell_range = xlsxwriter.utility.xl_range(0, 0, end_row, end_column)

######################################
# The hack
header = [{'header': c} for c in df.columns.tolist()]
ws.add_table(cell_range,{'header_row': True, 'columns':header, 'style':'Table Style Medium 11'})
ws.freeze_panes(1, 1)
writer.save()
writer.close()
于 2017-04-21T14:16:49.803 回答