Abstract: Highlights•A novel self-supervised text-to-image synthesis approach with label augmentation.•A label augmented discriminator is introduced for goal consistency among tasks.•A perceptual loss is introduced to capture accurate features in rotated samples.•A cycleGAN architecture is introduced to improve the image and text alignment.
External IDs:dblp:journals/eswa/TanLLL26
Loading