ruby-on-rails - xlsx 问题：创建包含大量行的 excel 工作表

Question

我正在使用 Axlsx 创建一个 excel 文件。对于一个小数据集，它工作得很好。但是一旦数据集变大，它就会挂起。我在这个过程上运行了 strace，它做了很多 brk。

a = Axlsx::Package.new
book = a.workbook
book.add_worksheet(:name => "test") do |sheet|

  input_array.each do |input_data|
     ...# covert input_data to row_data
     sheet.add_row(row_data)
  end
end
File.open("testfile", 'w') { |f| f.write(p.to_stream().read) }

我的 input_array 大小约为 400,000，因此工作表有 400,000 行，非常大。它卡在p.to_stream().read. 任何帮助都会很棒。谢谢。

score 5 · Accepted Answer

看来我需要开始关注SO了！这是 randym（axlsx 的作者）

有几件事我想指出应该可以帮助你完成你需要做的事情，嗯……完成了！

如果您正在写入文件，请考虑 Package#serialize - 不是因为它更快，而是因为它需要您维护的代码更少。

p.serialize '文件名.xlsx'
在过去的几周里，性能得到了重大改进。请升级到 1.1.1 gem 不再依赖 RMagic，并且不再需要 use_autowidth = false。

https://github.com/randym/axlsx

大师的基准：

Benchmarks w/40k rows:
                            user     system      total        real
axlsx_noautowidth      68.130000   1.690000  69.820000 ( 80.257108)
axlsx                  61.520000   2.290000  63.810000 ( 78.187423)
axlsx_shared           53.280000   1.170000  54.450000 ( 62.880780)
axlsx_stream           52.110000   1.360000  53.470000 ( 61.980672)
csv                    10.670000   0.930000  11.600000 ( 14.901387)

Benchmarks w/4k rows:
                            user     system      total        real
axlsx_noautowidth       4.880000   0.120000   5.000000 (  5.314383)
axlsx                   5.470000   0.110000   5.580000 (  5.853739)
axlsx_shared            5.720000   0.080000   5.800000 (  6.135263)
axlsx_stream            4.840000   0.090000   4.930000 (  5.194801)
csv                     1.090000   0.090000   1.180000 (  1.484763)

这是基准测试文件：

https://gist.github.com/2411144

希望这可以帮助

score 2 · Accepted Answer

您可以尝试禁用处理列自动宽度功能的 RMagick，因为它是一个非常繁重的过程 AFAIK。

a = Axlsx::Package.new   
a.use_autowidth = false

ruby-on-rails - xlsx 问题：创建包含大量行的 excel 工作表

2 回答 2

Related

Reference