This version of Stable Diffusion features a slick WebGUI, an interactive command-line script that combines text2img and img2img functionality in a "dream bot" style interface, and multiple features and other enhancements. For more info, see the website link below.
Go to file
2022-11-02 18:29:34 -04:00
.dev_scripts Replace --full_precision with --precision that works even if not specified 2022-09-20 17:08:00 -04:00
.github add release-candidate-branch to mkdocs action 2022-11-02 18:17:16 -04:00
assets add missing image needed by nsfw filter 2022-10-25 00:39:00 -04:00
backend webgui working again 2022-11-02 18:07:18 -04:00
configs fix models example weights for sd-v1.4 2022-10-31 21:35:09 -04:00
data
docker-build update entrypoint 2022-10-27 17:06:50 -04:00
docs updated documentation 2022-11-02 17:28:50 -04:00
frontend copy dev frontend code over again 2022-11-02 17:56:30 -04:00
installer more bug fixes to install scripts 2022-11-02 15:26:02 -04:00
ldm use refined model by default 2022-11-02 18:29:34 -04:00
models/ldm/stable-diffusion-v1 documentation and usability fixes 2022-10-29 10:37:38 -04:00
notebooks add a strength value to inpaint_replace 2022-10-16 10:06:47 -04:00
scripts more bug fixes to install scripts 2022-11-02 15:26:02 -04:00
server add option to show intermediate latent space 2022-11-02 17:53:11 -04:00
static Generalize facetool strength argument 2022-10-14 00:03:06 -04:00
tests add tests/validate_pr_prompt.txt 2022-10-28 13:47:45 -04:00
.dockerignore add .dockerignore to repo-root 2022-10-27 17:06:50 -04:00
.gitattributes
.gitignore preload_models.py script downloads the weight files 2022-10-29 01:02:45 -04:00
.gitmodules
.prettierrc.yaml change printWidth for markdown files to 80 2022-09-17 02:23:00 +02:00
environment-linux-aarch64.yml more bug fixes to install scripts 2022-11-02 15:26:02 -04:00
environment-mac.yml more bug fixes to install scripts 2022-11-02 15:26:02 -04:00
environment.yml remove antlr4 from requirements 2022-11-02 16:35:14 -04:00
invoke.bat Open the developer console on windows, and print some debugging info 2022-10-29 23:26:21 +05:30
invoke.sh Script to create the installer zips 2022-10-29 23:40:03 +05:30
LICENSE Update license again 2022-10-14 16:40:35 -04:00
LICENSE-ModelWeights.txt
main.py fix a number of bugs in textual inversion 2022-10-21 16:35:35 +02:00
mkdocs.yml more updates to many docs, including: 2022-10-11 21:41:52 -04:00
pyproject.toml.hide Fix Mac Issue #723 2022-09-21 13:42:47 -04:00
README.md Fix typo in docs: s/Formally/Formerly 2022-10-20 02:44:16 -04:00
requirements-lin-AMD.txt
requirements-lin-win-colab-CUDA.txt
requirements-linux-arm64.txt add support for safety checker (NSFW filter) 2022-10-23 22:26:18 -04:00
requirements-mac-MPS-CPU.txt Install older version of torch and matching torchvision, fix pytorch-lightning=1.7.7 2022-11-02 14:49:36 -04:00
requirements-mkdocs.txt update requirements-mkdocs.txt 2022-09-19 08:38:46 +02:00
requirements.txt more bug fixes to install scripts 2022-11-02 15:26:02 -04:00
setup.py update requirements to address #1149 2022-10-18 16:28:58 -04:00
shell.nix nix: add shell.nix file 2022-10-25 07:08:31 -04:00
Stable_Diffusion_v1_Model_Card.md
update.bat more bug fixes to install scripts 2022-11-02 15:26:02 -04:00
update.sh more bug fixes to install scripts 2022-11-02 15:26:02 -04:00

InvokeAI: A Stable Diffusion Toolkit

Formerly known as lstein/stable-diffusion

project logo

discord badge

latest release badge github stars badge github forks badge

CI checks on main badge CI checks on dev badge latest commit to dev badge

github open issues badge github open prs badge

This is a fork of CompVis/stable-diffusion, the open source text-to-image generator. It provides a streamlined process with various new features and options to aid the image generation process. It runs on Windows, Mac and Linux machines, with GPU cards with as little as 4 GB of RAM. It provides both a polished Web interface (see below), and an easy-to-use command-line interface.

Quick links: [Discord Server] [Documentation and Tutorials] [Code and Downloads] [Bug Reports] [Discussion, Ideas & Q&A]

Note: This fork is rapidly evolving. Please use the Issues tab to report bugs and make feature requests. Be sure to use the provided templates. They will help aid diagnose issues faster.

Table of Contents

  1. Installation
  2. Hardware Requirements
  3. Features
  4. Latest Changes
  5. Troubleshooting
  6. Contributing
  7. Contributors
  8. Support
  9. Further Reading

Installation

This fork is supported across multiple platforms. You can find individual installation instructions below.

Hardware Requirements

System

You wil need one of the following:

  • An NVIDIA-based graphics card with 4 GB or more VRAM memory.
  • An Apple computer with an M1 chip.

Memory

  • At least 12 GB Main Memory RAM.

Disk

  • At least 12 GB of free disk space for the machine learning model, Python, and all its dependencies.

Note

If you have a Nvidia 10xx series card (e.g. the 1080ti), please run the dream script in full-precision mode as shown below.

Similarly, specify full-precision mode on Apple M1 hardware.

Precision is auto configured based on the device. If however you encounter errors like 'expected type Float but found Half' or 'not implemented for Half' you can try starting invoke.py with the --precision=float32 flag:

(ldm) ~/stable-diffusion$ python scripts/invoke.py --precision=float32

Features

Major Features

Other Features

Latest Changes

  • v2.0.1 (13 October 2022)

    • fix noisy images at high step count when using k* samplers
    • dream.py script now calls invoke.py module directly rather than via a new python process (which could break the environment)
  • v2.0.0 (9 October 2022)

    • dream.py script renamed invoke.py. A dream.py script wrapper remains for backward compatibility.
    • Completely new WebGUI - launch with python3 scripts/invoke.py --web
    • Support for inpainting and outpainting
    • img2img runs on all k* samplers
    • Support for negative prompts
    • Support for CodeFormer face reconstruction
    • Support for Textual Inversion on Macintoshes
    • Support in both WebGUI and CLI for post-processing of previously-generated images using facial reconstruction, ESRGAN upscaling, outcropping (similar to DALL-E infinite canvas), and "embiggen" upscaling. See the !fix command.
    • New --hires option on invoke> line allows larger images to be created without duplicating elements, at the cost of some performance.
    • New --perlin and --threshold options allow you to add and control variation during image generation (see Thresholding and Perlin Noise Initialization
    • Extensive metadata now written into PNG files, allowing reliable regeneration of images and tweaking of previous settings.
    • Command-line completion in invoke.py now works on Windows, Linux and Mac platforms.
    • Improved command-line completion behavior. New commands added:
      • List command-line history with !history
      • Search command-line history with !search
      • Clear history with !clear
    • Deprecated --full_precision / -F. Simply omit it and invoke.py will auto configure. To switch away from auto use the new flag like --precision=float32.

For older changelogs, please visit the CHANGELOG.

Troubleshooting

Please check out our Q&A to get solutions for common installation problems and other issues.

Contributing

Anyone who wishes to contribute to this project, whether documentation, features, bug fixes, code cleanup, testing, or code reviews, is very much encouraged to do so. If you are unfamiliar with how to contribute to GitHub projects, here is a Getting Started Guide.

A full set of contribution guidelines, along with templates, are in progress, but for now the most important thing is to make your pull request against the "development" branch, and not against "main". This will help keep public breakage to a minimum and will allow you to propose more radical changes.

Contributors

This fork is a combined effort of various people from across the world. Check out the list of all these amazing people. We thank them for their time, hard work and effort.

Support

For support, please use this repository's GitHub Issues tracking service. Feel free to send me an email if you use and like the script.

Original portions of the software are Copyright (c) 2020 Lincoln D. Stein

Further Reading

Please see the original README for more information on this software and underlying algorithm, located in the file README-CompViz.md.