The reasoning: In the current frame, you are facing a birch log which is part of a tree. Your task is to chop a tree, and since you are already looking directly at the target block (birch log), the next logical action is to continue attacking the block to obtain the wood.

No camera movement is needed as you have the target directly in view. Continue the action until the block is collected, next action: attack, and next frame: 