The reasoning: In the current frame, you are facing a birch log, which is part of a tree. The task is to chop a tree. Since the birch log is directly in front of you, the next action to perform is to attack (or chop) the block. You have not yet obtained the target material, so continuing to chop this visible log makes sense to fulfill the task. There is no need for additional camera movement at this moment, as the target is already in view, next action: attack, and next frame: 