我想加速将数据加载到 PostgreSQL。我开始使用 pgloader https://github.com/dimitri/pgloader并想利用并行加载。我正在修改不同的参数,但我无法在我的机器上激活两个以上的核心(其中有 32 个)。我找到了文档https://github.com/dimitri/pgloader/blob/master/pgloader.1.md并尝试设置那里描述的批处理选项。目前,我有这些设置:
LOAD CSV
FROM '/home/data1_1.csv'
--FROM 'data/data.csv'
INTO postgresql://:postgres@localhost:5432/test?test
WITH truncate,
skip header = 0,
fields optionally enclosed by '"',
fields escaped by double-quote,
fields terminated by ',',
batch rows = 100,
batch size = 1MB,
batch concurrency = 64
SET client_encoding to 'utf-8',
work_mem to '10000MB',
maintenance_work_mem to '20000 MB'