Skip to content

Latest commit

 

History

History
51 lines (37 loc) · 2.05 KB

README.md

File metadata and controls

51 lines (37 loc) · 2.05 KB

Latent-DIffusion-Transformer

This is an implementation of Latent Diffusion Model from scratch.

Usage Guide

1. Download the Repository

2. Install all the dependencies using the following line of code:

pip install -r requirements.txt

3. Download pretrained weights and tokenizer files

2. Open the Notebook

  • Go to the sd folder and open the demo.ipynb notebook.

3. Text-to-Image Generation

  • Modify the prompt variable to specify the image you want to generate.
  • (Optional) Use uncond_prompt to exclude certain elements from the generated image.

4. Image-to-Image Generation

  • Create an images folder in the root directory.
  • Add your desired image to this folder.
  • Set the image_path variable to the relative path of the image.
  • Uncomment the following line:
    input_image = Image.open(image_path)
  • Optimize Performance If your system supports CUDA (for NVIDIA GPUs) or MPS (for Apple Silicon), enable faster processing by setting the corresponding constant to True: ALLOW_CUDA = True # Enable CUDA for NVIDIA GPUs OR ALLOW_MPS = True # Enable MPS for Apple Silicon (M1/M2)

Example

Prompt: A very high quality image of a scenic view containing mountains and a river flowing between them, 8k resolution.

output

Huge Thanks to the following repositories

  1. https://github.com/CompVis/stable-diffusion/
  2. https://github.com/divamgupta/stable-diffusion-tensorflow
  3. https://github.com/kjsman/stable-diffusion-pytorch
  4. https://github.com/huggingface/diffusers/
  5. https://github.com/hkproj/pytorch-stable-diffusion/tree/main
  6. https://github.com/haideraqeeb/latent-diffusion/