我正在尝试读取一个巨大的 csv 文件 CUDF,但出现内存问题。
import cudf
cudf.set_allocator("managed")
cudf.__version__
user_wine_rate_df = cudf.read_csv('myfile.csv',
sep = "\t",
parse_dates = ['created_at'])
'0.17.0a+382.gbd321d1e93'
terminate called after throwing an instance of 'thrust::system::system_error'
what(): parallel_for failed: cudaErrorIllegalAddress: an illegal memory access was encountered
Aborted (core dumped)
如果我删除cudf.set_allocator("managed")
我得到
MemoryError: std::bad_alloc: CUDA error at: /opt/conda/envs/rapids/include/rmm/mr/device/cuda_memory_resource.hpp:69: cudaErrorMemoryAllocation out of memory
我正在使用 CUDFrapidsai/rapidsai:cuda11.0-runtime-ubuntu16.04-py3.8
我想知道whar可能是命中内存的原因,而我可以用pandas读取这个大文件
**更新
我安装了dask_cudf
并使用dask_cudf.read_csv('myfile.csv')
- 但仍然得到
parallel_for failed: cudaErrorIllegalAddress: an illegal memory access was encountered