InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI.git synced 2025-01-07 03:17:05 +08:00

Author	SHA1	Message	Date
Ryan Dick	7214d4969b	Workaround a weird quirk of QuantState.to() and add a unit test to exercise it.	2024-12-24 14:32:11 +00:00
Ryan Dick	a83a999b79	Reduce peak memory used for unit tests.	2024-12-24 14:32:11 +00:00
Ryan Dick	f8a6accf8a	Fix bitsandbytes imports to avoid ImportErrors on MacOS.	2024-12-24 14:32:11 +00:00
Ryan Dick	f8ab414f99	Add CachedModelOnlyFullLoad to mirror the CachedModelWithPartialLoad for models that cannot or should not be partially loaded.	2024-12-24 14:32:11 +00:00
Ryan Dick	c6795a1b47	Make CachedModelWithPartialLoad work with models that have non-persistent buffers.	2024-12-24 14:32:11 +00:00
Ryan Dick	0a8fc74ae9	Add CachedModelWithPartialLoad to manage partially-loaded models using the new autocast modules.	2024-12-24 14:32:11 +00:00
Ryan Dick	dc54e8763b	Add CustomInvokeLinearNF4 to enable CPU -> GPU streaming for InvokeLinearNF4 layers.	2024-12-24 14:32:11 +00:00
Ryan Dick	1b56020876	Add CustomInvokeLinear8bitLt layer for device streaming with InvokeLinear8bitLt layers.	2024-12-24 14:32:11 +00:00
Ryan Dick	3f990393a1	Simplify the state management in InvokeLinear8bitLt and add unit tests. This is in preparation for wrapping it to support streaming of weights from cpu to gpu.	2024-12-24 14:32:11 +00:00
Ryan Dick	97d56f7dc9	Add torch module autocast unit test for GGUF-quantized models.	2024-12-24 14:32:11 +00:00
Ryan Dick	fe0ef2c27c	Add torch module autocast utilities.	2024-12-24 14:32:11 +00:00
Ryan Dick	65fcbf5f60	Bump bitsandbytes. The new verson contains improvements to state_dict loading/saving for LLM.int8 and promises improved speed on some HW.	2024-12-24 14:32:11 +00:00
Ryan Dick	d3916dbdb6	Partial Loading PR1: Tidy ModelCache (#7492 ) ## Summary This PR tidies up the model cache code in preparation for further refactoring to support partial loading of models onto the GPU. These code changes should not change the functional behavior in any way. Changes: - Remove the `ModelCacheBase` class. `ModelCache` is the only implementation, so there is no benefit to the separate abstract class. - Split `CacheRecord` and `CacheStats` out into their own files. - Remove the `ModelLocker` class. This extra layer of indirection was not providing any benefit. Locking is now done directly with the `ModelCache`. - Tidy up relative imports that were contributing to circular import issues. - Pull the 'submodel' concern out of the `ModelCache`. The `ModelCache` should not need to be aware of the model manager submodel system. - Delete unused properties from the `ModelCache` (e.g. `.lazy_offloading`, `.storage_device`, etc.) ## QA Instructions I ran smoke tests with a variety of SD1, SDXL and FLUX models. No change to behavior is expected. ## Merge Plan <!--WHEN APPLICABLE: Large PRs, or PRs that touch sensitive things like DB schemas, may need some care when merging. For example, a careful rebase by the change author, timing to not interfere with a pending release, or a message to contributors on discord after merging.--> ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [x] _Tests added / updated (if applicable)_ - [x] _Documentation added / updated (if applicable)_ - [ ] _Updated `What's New` copy (if doing a release after this PR)_	2024-12-24 09:30:44 -05:00
Ryan Dick	55b13c1da3	(minor) Add TODO comment regarding the location of get_model_cache_key().	2024-12-24 14:23:19 +00:00
Ryan Dick	7dc3e0fdbe	Get rid of ModelLocker. It was an unnecessary layer of indirection.	2024-12-24 14:23:18 +00:00
Ryan Dick	a39bcf7e85	Move lock(...) and unlock(...) logic from ModelLocker to the ModelCache and make a bunch of ModelCache properties/methods private.	2024-12-24 14:23:18 +00:00
Ryan Dick	a7c72992a6	Pull get_model_cache_key(...) out of ModelCache. The ModelCache should not be concerned with implementation details like the submodel_type.	2024-12-24 14:23:18 +00:00
Ryan Dick	d30a9ced38	Rename model_cache_default.py -> model_cache.py.	2024-12-24 14:23:18 +00:00
Ryan Dick	e0bfa6157b	Remove ModelCacheBase.	2024-12-24 14:23:18 +00:00
Ryan Dick	83ea6420e2	Move CacheStats to its own file.	2024-12-24 14:23:18 +00:00
Ryan Dick	ce11a1952e	Move CacheRecord out to its own file.	2024-12-24 14:23:18 +00:00
Ryan Dick	e48dee4c4a	Rip out ModelLockerBase.	2024-12-24 14:23:18 +00:00
Simon Fuhrmann	712674b6dd	Add Stereogram Nodes to communityNodes.md	2024-12-23 13:51:53 -05:00
psychedelicious	de0043f443	docs: update download links for launcher	2024-12-23 13:23:14 +11:00
Riku	d21506da6f	feat(ci): add typegen check workflow	2024-12-22 06:05:17 +11:00
psychedelicious	a49894901a	docs: fix installation docs home again	2024-12-20 17:35:50 +11:00
psychedelicious	e7e26c8a93	docs: fix installation docs home	2024-12-20 17:12:44 +11:00
psychedelicious	9adcd2cc31	docs: update install-related docs	2024-12-20 17:01:34 +11:00
Kent Keirsey	f9edd009f5	Update README.md	2024-12-20 17:01:34 +11:00
Kent Keirsey	91a4160e36	Update Installation Docs	2024-12-20 17:01:34 +11:00
Kent Keirsey	9c9cec1b43	Update README.md	2024-12-20 17:01:34 +11:00
psychedelicious	948ecf9333	chore: bump version to v5.5.0	2024-12-20 16:17:23 +11:00
psychedelicious	1038f7bcab	Update invokeai_version.py	2024-12-20 10:17:09 +11:00
Riccardo Giovanetti	c7d9e2d62a	translationBot(ui): update translation (Italian) Currently translated at 99.3% (1635 of 1645 strings) translationBot(ui): update translation (Italian) Currently translated at 99.3% (1634 of 1645 strings) Co-authored-by: Riccardo Giovanetti <riccardo.giovanetti@gmail.com> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/it/ Translation: InvokeAI/Web UI	2024-12-20 10:07:15 +11:00
Riku	11c3a2e15d	translationBot(ui): update translation (German) Currently translated at 70.8% (1165 of 1645 strings) Co-authored-by: Riku <riku.block@gmail.com> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/de/ Translation: InvokeAI/Web UI	2024-12-20 10:07:15 +11:00
psychedelicious	9e3ca383ec	fix(ui): add missing model config to AnyModelConfig union type	2024-12-20 09:45:04 +11:00
Riku	bda83c2634	chore(ui): update typegen schema	2024-12-20 09:45:04 +11:00
Riku	525cb38c71	fix(app): fixed InputField default values	2024-12-20 09:30:56 +11:00
psychedelicious	a9a6720bad	feat(app): change queue item execution log from debug to info This provides useful context for subsequent logs during queue item execution.	2024-12-20 09:19:04 +11:00
psychedelicious	858bf9cf8c	feat(api): less verbose uvicorn logs Uvicorn's logging is rather verbose. This change adds a `log_level_network` config setting to independently control uvicorn's log outputs. The setting defaults to warning. The change hides the helpful startup message that says the host and port we are running on. For example: `Uvicorn running on http://0.0.0.0:9090 (Press CTRL+C to quit` The ASGI lifespan handler is updated to log an equivalent message on startup, regardless of log level settings. Besides being helpful, the launcher relies on a message like this to launch the app. So, previously, if the user set their log level to anything above info (e.g. warning or error), the launcher would fail to open the app. This change prevents that edge case.	2024-12-20 09:19:04 +11:00
David Hauptman	74a29c3735	re-format to fix ruff error	2024-12-19 22:33:17 +11:00
David Hauptman	6fc6be3aa0	Fix error message when adding a local path with quotes around the string	2024-12-19 22:33:17 +11:00
Mary Hipp	174ea021a6	lint	2024-12-18 12:48:15 -05:00
Mary Hipp	50b804e087	remove space	2024-12-18 12:48:15 -05:00
Mary Hipp	23270d7dfe	update copy again	2024-12-18 12:48:15 -05:00
Mary Hipp	39e6f6d53f	update whats new copy for control LOras	2024-12-18 12:48:15 -05:00
Mary Hipp	c154d833b9	raise error if control lora used with schnell	2024-12-18 10:19:28 -05:00
Mary Hipp	899a00af62	fix double filter on slow networks	2024-12-18 08:40:50 -05:00
Hosted Weblate	7c9ecdb362	translationBot(ui): update translation files Updated by "Cleanup translation files" hook in Weblate. Co-authored-by: Hosted Weblate <hosted@weblate.org> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/ Translation: InvokeAI/Web UI	2024-12-18 18:05:42 +11:00
Riccardo Giovanetti	4a5255611b	translationBot(ui): update translation (Italian) Currently translated at 99.3% (1634 of 1644 strings) Co-authored-by: Riccardo Giovanetti <riccardo.giovanetti@gmail.com> Translate-URL: https://hosted.weblate.org/projects/invokeai/web-ui/it/ Translation: InvokeAI/Web UI	2024-12-18 18:05:42 +11:00

1 2 3 4 5 ...

15216 Commits