c++ - 了解 Negamax 的约束

Question

这段代码是为计算 tictactoe 游戏中某个位置的 bestMove 而构建的。我几乎得到了代码的每一部分，除了 for 循环中的条件，它表示 minRating != LOSING_POSITION。此代码来自给定伪代码的实现。

moveT FindBestMove(stateT state, int depth, int & rating) {
for (*each possible move or until you find a forced win*) {
 *Make the move.
 Evaluate the resulting position, adding one to the depth indicator.
 Keep track of the minimum rating so far, along with the corresponding move.
 Retract the move to restore the original state.*
 }
*Store the move rating into the reference parameter.
Return the best move.*
}

我无法将 for 循环的第二个条件与给定的代码匹配，直到你找到一个强制获胜。我找不到这个事实和那个 minRating != LOSING_POSITION 之间的相似之处

moveT FindBestMove(stateT state, int depth, int & rating) {
Vector<moveT> moveList;
GenerateMoveList(state, moveList);
int nMoves = moveList.size();
if (nMoves == 0) Error("No moves available");
moveT bestMove;

int minRating = WINNING_POSITION + 1;

for (int i = 0; i < nMoves && minRating != LOSING_POSITION; i++) {

 moveT move = moveList[i];
 MakeMove(state, move);
 int curRating = EvaluatePosition(state, depth + 1);

 if (curRating < minRating) {
  bestMove = move;
  minRating = curRating;
  }

 RetractMove(state, move);
 }
rating = -minRating;
return bestMove;

}


int EvaluatePosition(stateT state, int depth) {
int rating;

if (GameIsOver(state) || depth >= MAX_DEPTH) {
 return EvaluateStaticPosition(state);
}

FindBestMove(state, depth, rating);
return rating;
}

score 1 · Accepted Answer

您的程序从分配WINNING_POSITION（我想为您的对手获胜）开始minRating，然后循环移动，试图找到具有最大伤害的移动，最小化minRating.

当EvaluatePosition返回LOSING_POSITION比意味着这个动作在任何情况下都会导致你的对手失败，因此，可以终止搜索并且这个动作被认为是最好的动作。

如果没有明显LOSING_POSITIONS的，那么您的算法会根据静态评估选择“最佳”移动。

c++ - 了解 Negamax 的约束

1 回答 1

Related

Reference