RD-FGM: A novel model for high-quality and diverse food image generation and ingredient classification

Published: 01 Jan 2024, Last Modified: 04 Mar 2025Expert Syst. Appl. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•We develop RD-FGM method, optimizing food generation and multi-modal alignment of recipes and images.•We introduce RecipeCLIP that aligns features from images and recipes for comprehensive ingredient embedding.•We devise a guided attention mechanism for multi-modal diffusion, controlling generation with U-Net transformers.•Validating RD-FGM’s efficiency and downstream task scalability across multiple datasets, achieving optimal performance.
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview