Bundle Classic AI algorithms: graph search, adversarial game search, optimization and tabular reinforcement learning (-lib ai)
StepResult
One environment transition: the next state, the reward received, and whether the episode ended.
One environment transition: the next state, the reward received, and whether the episode ended.