python - 纸浆：每组仅使用一项

Question

我有一个具有以下值的 Pandas 数据框：

     Name    Age    City    Points
1    John    24     CHI     35
2    Mary    18     NY      25
.
.
80   Steve   30     CHI     32

我正在尝试组建一个 5 人小组，以最大限度地提高积分总和。我想有两个限制：年龄和城市。最大年龄必须在 110 岁以下，并且不能有两个人来自同一个城市。

目前我有一个脚本可以最大化积分并考虑年龄限制：

x = pulp.LpVariable.dicts("x", df.index, cat='Integer', lowBound=0)
mod = pulp.LpProblem("prog", pulp.LpMaximize)

objvals_p = {idx: (df['Points'][idx]) for idx in df.index}
mod += sum([x[idx]*objvals_p[idx] for idx in df.index])

objvals_a = {idx: (df['Age'][idx]) for idx in df.index}
mod += pulp.lpSum([x[idx]*objvals_a[idx] for idx in df.index]) < 110

但是我不知道如何将城市约束添加到我的脚本中。

对我有什么建议吗？

谢谢！

score 3 · Accepted Answer

你可以这样做：

for city in df['City'].unique():
    sub_idx = df[df['City']==city].index
    mod += pulp.lpSum([x[idx] for idx in sub_idx]) <= 1

对于 DataFrame 中的每个城市，这个总和超过了 DataFrame 的一个子集（由 sub_idx 索引），这个总和应该小于或等于 1，因为来自同一城市的 2 个人不能在团队中。

为了使这个（和您的其他约束）起作用，您需要更改决策变量的定义。它应该是二进制的；完整性是不够的。

x = pulp.LpVariable.dicts("x", df.index, 0, 1, pulp.LpInteger)

python - 纸浆：每组仅使用一项

1 回答 1

Related

Reference