Abstract: Highlights•A general framework for multistage context learning and utilization in computer vision.•Applicable to various visual detection tasks.•An automation mechanism for minimal modification in using different deep learning models.•Capability of different types of context information (local, semantic, spatial) at various stages of the detection process.
Loading