FaceChain-MMID: Generating highly identity-consistent realistic portraits via dividing & merging multi-modal representations
Abstract: Highlights•We utilize facial image, masks, prompts for identity, and design a dividing strategy to express them.•We propose merging strategy including designed training pairs, networks, and loss functions to achieve multi-modal fusion.•Experiments show superior resutls than SOTA.
External IDs:dblp:journals/pr/XuWYSZ25
Loading