Abstract: Highlights•A novel MFIF method using Vision Transformers to predict focus map.•A new transformer block to model global connectivity across input images.•The proposed model does not require post-processing.•SOTA results on standard benchmark datasets.
Loading