Q\* is the optimal (ie, correct for the decision problem) function computing the...

NhanH · on Nov 23, 2023

That is just Q. The asterisk is new

sudosysgen · on Nov 23, 2023

Depends on the notation. Sometimes Q* is used to denote optimality, for example here : https://www.cs.toronto.edu/~jlucas/teaching/csc411/lectures/... , page 31

debugnik · on Nov 23, 2023

The "star" in A* means optimal (as in actually proven to be optimal, so they could stop publishing A^[whatever] algorithms). I assume either Q* is considered optimal in a way regular Q-learning isn't, or they're mixing Q-learning with some A*-like search algorithm. (Or someone picked an undeserving name.)