问题标签 [cudf]

问问题

For questions regarding programming in ECMAScript (JavaScript/JS) and its various dialects/implementations (excluding ActionScript). Note JavaScript is NOT the same as Java! Please include all relevant tags on your question; e.g., [node.js], [jquery], [json], [reactjs], [angular], [ember.js], [vue.js], [typescript], [svelte], etc.

95 问题

0 投票

0 回答

62 浏览

cupy - 如何在没有数据类型错误的情况下使用 cuml 进行网格搜索？

我尝试使用 cuml 进行网格搜索。(rapids 21.10) 我得到一个cupy转换错误。如果我在没有网格搜索的情况下使用相同的数据集构建模型，则不会发生这种情况。它也适用于不位于 Videomemory 中的数据，但它显然比 cpu 慢。X 的数据是 float32，y 的数据是 int32：

错误：

2021-11-28T00:37:16.040

0 投票

1 回答

46 浏览

python - 日期时间与 cudf 和 pandas 的兼容性，用于在 python 中过滤日期时间

我想测试cudf但坚持按日期时间过滤的第一个简单任务。代码适用于 pandas，但不适用于cudf.

testdata.csv 看起来像

随着 cudf 抛出“KeyError：'Datetime'”

python pandas datetime cudf

2021-12-28T19:17:26.650

0 投票

1 回答

26 浏览

python - 如何将 cuDF 数据框传递给 cuML.ensemble.RandomForestClassifier？

我正在尝试将数据拟合到，但cuml.ensemble.RandomForestClassifier我不断收到错误消息：“标签需要是从 0 到唯一标签值数量的连续值”

我将 cudf.DataFrame 对象传递给具有相同行数但列数不同的函数。列标签从 0 开始，逐步增加 1 直到最后一列（在下面的示例中为 108）。我究竟做错了什么？我附上了我在下面传递的数据帧的打印输出和一些上下文代码：

clf1 = modelClass(max_depth=D1, random_state=random.randrange(0, 1024, 1), n_bins=15, n_streams=4, split_criterion=criterion, bootstrap=bootstrap, n_estimators=trs1)

clf1.fit(X1, Y1)

X1 的数据框如下所示：

	0	1	2
0	1.000000e-11	1.000000e-11	1.647421e-01
1	1.000000e-11	1.000000e-11	1.760000e-02
2	1.000000e-11	1.000000e-11	-1.772000e-01
3	1.000000e-11	1.000000e-11	8.254000e-01
4	1.000000e-11	1.000000e-11	2.587000e-01
...	...	...	...
5402	1.000000e-11	1.000000e-11	1.704444e-01
5403	1.000000e-11	1.000000e-11	-1.860000e-01
5404	0.000000e+00	1.000000e-11	1.229714e-01
5405	1.000000e-11	1.959500e-01	1.984667e-01
5406	1.000000e-11	1.000000e-11	1.000000e-11

[5407 行 x 3 列]；dtype=('0', dtype('float64')); <cudf.core.dataframe._DataFrameLocIndexer 对象位于 0x7f9c3d0f3070>

Y1 的 Dataframe 如下所示：

	0
0	-2.0
1	4.0
2	-3.0
3	1.0
4	0.0
...	...
5402	0.0
5403	-2.0
5404	0.0
5405	0.0
5406	0.0

[5407 行 x 1 列]；dtype=('0', dtype('float64')); <cudf.core.dataframe._DataFrameLocIndexer 对象位于 0x7f9c1b847b50>

系统信息：Ubuntu 20.04、Titan RTX、CUDA 11.5、Rapids 21.12 内置Conda、Python 3.8

python random-forest cudf

2022-01-12T16:38:15.523

0 投票

0 回答

14 浏览

dataframe - 如何使用 GPU Tesla K80 在 google colab 上安装 cuDF？

我试图在 Google Colab 上安装 cuDF 几个小时。我应该使用GPU Tesla T4安装 cuDF 的要求之一。虽然 google colab 每次都会给我GPU Tesla K80，但我无法安装 cuDF。我尝试了这段代码来检查我每次使用的 GPU 类型：

通过 google colab 获得特定类型的 GPU 太令人沮丧了，因为这是一种运气。我在这里问是否有人遇到过同样的问题，以及如何解决的？

dataframe installation gpu google-colaboratory cudf

2022-02-28T12:18:22.137

0 投票

0 回答

8 浏览

rapids - TypeError：字段元组的第一个元素既不是元组也不是str

我是 RAPIDS 的初学者。我正在尝试在 Colab 上运行以下代码。这会导致错误。TypeError：字段元组的第一个元素既不是元组也不是str

Similari 代码在使用 pandas 时运行良好。但是在使用 cudf 数据框时失败了。

代码片段：

dataSet_formatted_gdf = cudf.DataFrame.from_pandas(pd.DataFrame(dataset_formatted_npa)) conAttrs_gdf = cudf.DataFrame.from_pandas(pd.DataFrame(dataset_formatted_npa[:, 0]))

equivalenceClasses = conAttrs_gdf.groupby(0).groups

decAttr_gdf = cudf.DataFrame.from_pandas(pd.DataFrame(dataset_formatted_npa[:, 1]))

决策类 = decAttr_gdf.groupby(0).groups

rough_membership_values = dataSet_formatted_gdf.apply(lambda x: calculateRoughMembership(x, equivalenceClasses, decisionClasses), axis=1)

rapids cudf

2022-03-04T11:07:58.663

1 2 3 4 5 6 7 8 9 10

问题标签 [cudf]

cupy - 如何在没有数据类型错误的情况下使用 cuml 进行网格搜索？

python - 日期时间与 cudf 和 pandas 的兼容性，用于在 python 中过滤日期时间

python - 如何将 cuDF 数据框传递给 cuML.ensemble.RandomForestClassifier？

dataframe - 如何使用 GPU Tesla K80 在 google colab 上安装 cuDF？

rapids - TypeError：字段元组的第一个元素既不是元组也不是str

Reference