Visual Computing

University of Konstanz
ACM Transactions on Graphics

ProSpect: Prompt Spectrum for Attribute-Aware Personalization of Diffusion Models

Y. Zhang, W. Dong, F. Tang, N. Huang, H. Huang, C. Ma, T. Lee, O. Deussen, C. Xu
Teaser of ProSpect: Prompt Spectrum for Attribute-Aware Personalization of Diffusion Models

Material

Paper (.pdf, 16.0MB)

Abstract

Personalizing generative models offers a way to guide image generation with user-provided references. Current personalization methods can invert an object or concept into the textual conditioning space and compose new natural sentences for text-to-image diffusion models. However, representing and editing specific visual attributes such as material, style, and layout remains a challenge, leading to a lack of disentanglement and editability. To address this problem, we propose a novel approach that leverages the step-by-step generation process of diffusion models, which generate images from low to high frequency information, providing a new perspective on representing, generating, and editing images. We develop the Prompt Spectrum Space P*, an expanded textual conditioning space, and a new image representation method called ProSpect. ProSpect represents an image as a collection of inverted textual token embeddings encoded from per-stage prompts, where each prompt corresponds to a specific generation stage (i.e., a group of consecutive steps) of the diffusion model. Experimental results demonstrate that P* and ProSpect offer better disentanglement and controllability compared to existing methods. We apply ProSpect in various personalized attribute-aware image generation applications, such as image-guided or text-driven manipulations of materials, style, and layout, achieving previously unattainable results from a single image input without fine-tuning the diffusion models.

BibTeX

@article{zhang2023prospect,
  author    = {Y. Zhang, W. Dong, F. Tang, N. Huang, H. Huang, C. Ma, T. Lee, O. Deussen, C. Xu},
  doi       = {10.1145/3618342},
  issn      = {1557-7368},
  journal   = {ACM Transactions on Graphics},
  month     = {December},
  number    = {6},
  pages     = {1--14},
  publisher = {Association for Computing Machinery (ACM)},
  title     = {ProSpect: Prompt Spectrum for Attribute-Aware Personalization of Diffusion Models},
  volume    = {42},
  year      = {2023}
}