Abstract: Highlights•We utilize grammar and commonsense knowledge for compositional action recognition.•Our method builds unseen “action-objects” compositions to improve generalization.•We inject structured commonsense into label to verify the rationality.•Our method achieves excellent results in both compositional and few-shot settings.
Loading