mirror of https://github.com/invoke-ai/InvokeAI.git synced 2025-01-07 03:17:05 +08:00

This version of Stable Diffusion features a slick WebGUI, an interactive command-line script that combines text2img and img2img functionality in a "dream bot" style interface, and multiple features and other enhancements. For more info, see the website link below.

ai-art artificial-intelligence generative-art image-generation img2img inpainting latent-diffusion linux macos outpainting stable-diffusion txt2img windows

Go to file

Ryan Dick b46d7abfb0 Partial Loading PR3: Integrate 1) partial loading, 2) quantized models, 3) model patching (#7500 ) ## Summary This PR is the third in a sequence of PRs working towards support for partial loading of models onto the compute device (for low-VRAM operation). This PR updates the LoRA patching code so that the following features can cooperate fully: - Partial loading of weights onto the GPU - Quantized layers / weights - Model patches (e.g. LoRA) Note that this PR does not yet enable partial loading. It adds support in the model patching code so that partial loading can be enabled in a future PR. ## Technical Design Decisions The layer patching logic has been integrated into the custom layers (via `CustomModuleMixin`) rather than keeping it in a separate set of wrapper layers, as before. This has the following advantages: - It makes it easier to calculate the modified weights on the fly and then reuse the normal forward() logic. - In the future, it makes it possible to pass original parameters that have been cast to the device down to the LoRA calculation without having to re-cast (but the current implementation hasn't fully taken advantage of this yet). ## Know Limitations 1. I haven't fully solved device management for patch types that require the original layer value to calculate the patch. These aren't very common, and are not compatible with some quantized layers, so leaving this for future if there's demand. 2. There is a small speed regression for models that have CPU bottlenecks. This seems to be caused by slightly slower method resolution on the custom layers sub-classes. The regression does not show up on larger models, like FLUX, that are almost entirely GPU-limited. I think this small regression is tolerable, but if we decide that it's not, then the slowdown can easily be reclaimed by optimizing other CPU operations (e.g. if we only sent every 2nd progress image, we'd see a much more significant speedup). ## Related Issues / Discussions - https://github.com/invoke-ai/InvokeAI/pull/7492 - https://github.com/invoke-ai/InvokeAI/pull/7494 ## QA Instructions Speed tests: - Vanilla SD1 speed regression - Before: 3.156s (8.78 it/s) - After: 3.54s (8.35 it/s) - Vanilla SDXL speed regression - Before: 6.23s (4.46 it/s) - After: 6.45s (4.31 it/s) - Vanilla FLUX speed regression - Before: 12.02s (2.27 it/s) - After: 11.91s (2.29 it/s) LoRA tests with default configuration: - [x] SD1: A handful of LoRA variants - [x] SDXL: A handful of LoRA variants - [x] flux non-quantized: multiple lora variants - [x] flux bnb-quantized: multiple lora variants - [x] flux ggml-quantized: muliple lora variants - [x] flux non-quantized: FLUX control LoRA - [x] flux bnb-quantized: FLUX control LoRA - [x] flux ggml-quantized: FLUX control LoRA LoRA tests with sidecar patching forced: - [x] SD1: A handful of LoRA variants - [x] SDXL: A handful of LoRA variants - [x] flux non-quantized: multiple lora variants - [x] flux bnb-quantized: multiple lora variants - [x] flux ggml-quantized: muliple lora variants - [x] flux non-quantized: FLUX control LoRA - [x] flux bnb-quantized: FLUX control LoRA - [x] flux ggml-quantized: FLUX control LoRA Other: - [x] Smoke testing of IP-Adapter, ControlNet All tests repeated on: - [x] cuda - [x] cpu (only test SD1, because larger models are prohibitively slow) - [x] mps (skipped FLUX tests, because my Mac doesn't have enough memory to run them in a reasonable amount of time) ## Merge Plan No special instructions. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [x] _Tests added / updated (if applicable)_ - [x] _Documentation added / updated (if applicable)_ - [ ] _Updated `What's New` copy (if doing a release after this PR)_		2024-12-31 13:58:13 -05:00
.dev_scripts	Apply black	2023-07-27 10:54:01 -04:00
.github	feat(ci): add typegen check workflow	2024-12-22 06:05:17 +11:00
coverage	combine pytest.ini with pyproject.toml	2023-03-05 17:00:08 +00:00
docker	(docker) add comments in docker-entrypoint.sh and ensure variables are not null in bash expansion	2024-12-04 17:02:08 +00:00
docs	Get rid of ModelLocker. It was an unnecessary layer of indirection.	2024-12-24 14:23:18 +00:00
installer	removing periods from update link to prevent page not found error	2024-11-01 07:42:31 +11:00
invokeai	Fix layer patch dtype selection for CLIP text encoder models.	2024-12-29 21:48:51 +00:00
scripts	Add scripts/extract_sd_keys_and_shapes.py	2024-10-10 07:59:29 -04:00
tests	Fix bitsandbytes imports in unit tests on MacOS.	2024-12-30 10:41:48 -05:00
.dockerignore	Update dockerignore, set venv to 3.10, pass cache to yarn vite buidl	2023-07-12 16:51:15 -04:00
.editorconfig	Merge dev into main for 2.2.0 (#1642 )	2022-11-30 16:12:23 -05:00
.git-blame-ignore-revs	(meta) hide the 'black' formatting commit from git blame	2023-07-27 11:29:22 -04:00
.gitattributes	Enforce Unix line endings in container (#4990 )	2023-10-30 12:34:30 -04:00
.gitignore	feat: no frontend build in repo	2023-12-11 12:30:13 +11:00
.gitmodules	remove src directory, which is gumming up conda installs; addresses issue #77	2022-08-25 10:43:05 -04:00
.pre-commit-config.yaml	Adding isort GHA and pre-commit hooks	2023-09-12 13:01:58 -04:00
.prettierrc.yaml	feat: automated releases via github action	2024-02-29 21:57:20 -05:00
flake.lock	update flake (#7032 )	2024-10-08 10:55:49 +11:00
flake.nix	update flake (#7032 )	2024-10-08 10:55:49 +11:00
InvokeAI_Statement_of_Values.md	Add @ebr to Contributors (#2095 )	2022-12-21 14:33:08 -05:00
LICENSE	Update LICENSE	2023-07-05 23:46:27 -04:00
LICENSE-SD1+SD2.txt	updated LICENSE files and added information about watermarking	2023-07-26 17:27:33 -04:00
LICENSE-SDXL.txt	updated LICENSE files and added information about watermarking	2023-07-26 17:27:33 -04:00
Makefile	build: fix Makefile docs target	2024-09-22 17:10:14 +03:00
mkdocs.yml	docs: fix installation docs home again	2024-12-20 17:35:50 +11:00
pyproject.toml	Bump bitsandbytes. The new verson contains improvements to state_dict loading/saving for LLM.int8 and promises improved speed on some HW.	2024-12-24 14:32:11 +00:00
README.md	Update README.md	2024-12-20 17:01:34 +11:00
SECURITY.md	Create SECURITY.md	2024-11-25 04:10:03 -08:00
Stable_Diffusion_v1_Model_Card.md	Global replace [ \t]+$, add "GB" (#1751 )	2022-12-19 16:36:39 +00:00

README.md

Invoke - Professional Creative AI Tools for Visual Media

To learn more about Invoke, or implement our Business solutions, visit invoke.com

Invoke is a leading creative engine built to empower professionals and enthusiasts alike. Generate and create stunning visual media using the latest AI-driven technologies. Invoke offers an industry leading web-based UI, and serves as the foundation for multiple commercial products.

Invoke is available in two editions:

Community Edition	Professional Edition
For users looking for a locally installed, self-hosted and self-managed service	For users or teams looking for a cloud-hosted, fully managed service
- Free to use under a commercially-friendly license	- Monthly subscription fee with three different plan levels
- Download and install on compatible hardware	- Offers additional benefits, including multi-user support, improved model training, and more
- Includes all core studio features: generate, refine, iterate on images, and build workflows	- Hosted in the cloud for easy, secure model access and scalability
Quick Start -> Installation and Updates	More Information -> www.invoke.com/pricing

Documentation

Quick Links
Installation and Updates - Documentation and Tutorials - Bug Reports - Contributing

Installation

To get started with Invoke, Download the Installer.

For detailed step by step instructions, or for instructions on manual/docker installations, visit our documentation on Installation and Updates

Troubleshooting, FAQ and Support

Please review our FAQ for solutions to common installation problems and other issues.

For more help, please join our Discord.

Features

Full details on features can be found in our documentation.

Web Server & UI

Invoke runs a locally hosted web server & React UI with an industry-leading user experience.

Unified Canvas

The Unified Canvas is a fully integrated canvas implementation with support for all core generation capabilities, in/out-painting, brush tools, and more. This creative tool unlocks the capability for artists to create with AI as a creative collaborator, and can be used to augment AI-generated imagery, sketches, photography, renders, and more.

Workflows & Nodes

Invoke offers a fully featured workflow management solution, enabling users to combine the power of node-based workflows with the easy of a UI. This allows for customizable generation pipelines to be developed and shared by users looking to create specific workflows to support their production use-cases.

Board & Gallery Management

Invoke features an organized gallery system for easily storing, accessing, and remixing your content in the Invoke workspace. Images can be dragged/dropped onto any Image-base UI element in the application, and rich metadata within the Image allows for easy recall of key prompts or settings used in your workflow.

Other features

Support for both ckpt and diffusers models
SD1.5, SD2.0, SDXL, and FLUX support
Upscaling Tools
Embedding Manager & Support
Model Manager & Support
Workflow creation & management
Node-Based Architecture

Contributing

Anyone who wishes to contribute to this project - whether documentation, features, bug fixes, code cleanup, testing, or code reviews - is very much encouraged to do so.

Get started with contributing by reading our contribution documentation, joining the #dev-chat or the GitHub discussion board.

We hope you enjoy using Invoke as much as we enjoy creating it, and we hope you will elect to become part of our community.

Thanks

Invoke is a combined effort of passionate and talented people from across the world. We thank them for their time, hard work and effort.