InvokeAI

mirror of https://github.com/invoke-ai/InvokeAI.git synced 2025-01-07 03:17:05 +08:00

Author	SHA1	Message	Date
psychedelicious	868e06eb8b	tests: fix test_model_install.py	2025-01-03 11:21:23 -05:00
Ryan Dick	9a0a226ce1	Fix bitsandbytes imports in unit tests on MacOS.	2024-12-30 10:41:48 -05:00
Ryan Dick	6fd9b0a274	Delete old sidecar wrapper implementation. This functionality has moved into the custom layers.	2024-12-29 17:33:08 +00:00
Ryan Dick	52fc5a64d4	Add a unit test for a LoRA patch applied to a quantized linear layer with weights streamed from CPU to GPU.	2024-12-29 17:14:55 +00:00
Ryan Dick	a8bef59699	First pass at making custom layer patches work with weights streamed from the CPU to the GPU.	2024-12-29 17:01:37 +00:00
Ryan Dick	6d49ee839c	Switch the LayerPatcher to use 'custom modules' to manage layer patching.	2024-12-29 01:18:30 +00:00
Ryan Dick	2855bb6b41	Update BaseLayerPatch.get_parameters(...) to accept a dict of orig_parameters rather than orig_module. This will enable compatibility between patching and cpu->gpu streaming.	2024-12-28 21:12:53 +00:00
Ryan Dick	918f541af8	Add unit test for a SetParameterLayer patch applied to a CustomFluxRMSNorm layer.	2024-12-28 20:44:48 +00:00
Ryan Dick	93e76b61d6	Add CustomFluxRMSNorm layer.	2024-12-28 20:33:38 +00:00
Ryan Dick	f2981979f9	Get custom layer patches working with all quantized linear layer types.	2024-12-27 22:00:22 +00:00
Ryan Dick	ef970a1cdc	Add support for FluxControlLoRALayer in CustomLinear layers and add a unit test for it.	2024-12-27 21:00:47 +00:00
Ryan Dick	5ee7405f97	Add more unit tests for custom module LoRA patching: multiple LoRAs and ConcatenatedLoRALayers.	2024-12-27 19:47:21 +00:00
Ryan Dick	e24e386a27	Add support for patches to CustomModuleMixin and add a single unit test (more to come).	2024-12-27 18:57:13 +00:00
Ryan Dick	b06d61e3c0	Improve custom layer wrap/unwrap logic.	2024-12-27 16:29:48 +00:00
Ryan Dick	7d6ab0ceb2	Add a CustomModuleMixin class with a flag for enabling/disabling autocasting (since it incurs some runtime speed overhead.)	2024-12-26 20:08:30 +00:00
Ryan Dick	9692a36dd6	Use a fixture to parameterize tests in test_all_custom_modules.py so that a fresh instance of the layer under test is initialized for each test.	2024-12-26 19:41:25 +00:00
Ryan Dick	b0b699a01f	Add unit test to test that isinstance(...) behaves as expected with custom module types.	2024-12-26 18:45:56 +00:00
Ryan Dick	a8b2c4c3d2	Add inference tests for all custom module types (i.e. to test autocasting from cpu to device).	2024-12-26 18:33:46 +00:00
Ryan Dick	03944191db	Split test_autocast_modules.py into separate test files to mirror the source file structure.	2024-12-24 22:29:11 +00:00
Ryan Dick	987c9ae076	Move custom autocast modules to separate files in a custom_modules/ directory.	2024-12-24 22:21:31 +00:00
Ryan Dick	6d7314ac0a	Consolidate the LayerPatching patching modes into a single implementation.	2024-12-24 15:57:54 +00:00
Ryan Dick	80db9537ff	Rename model_patcher.py -> layer_patcher.py.	2024-12-24 15:57:54 +00:00
Ryan Dick	0148512038	(minor) Rename num_layers -> num_loras in unit tests.	2024-12-24 15:57:54 +00:00
Ryan Dick	d0f35fceed	Add test_apply_smart_lora_patches_to_partially_loaded_model(...).	2024-12-24 15:57:54 +00:00
Ryan Dick	cefcb340d9	Add LoRAPatcher.smart_apply_lora_patches()	2024-12-24 15:57:54 +00:00
Ryan Dick	0fc538734b	Skip flaky test when running on Github Actions, and further reduce peak unit test memory.	2024-12-24 14:32:11 +00:00
Ryan Dick	7214d4969b	Workaround a weird quirk of QuantState.to() and add a unit test to exercise it.	2024-12-24 14:32:11 +00:00
Ryan Dick	a83a999b79	Reduce peak memory used for unit tests.	2024-12-24 14:32:11 +00:00
Ryan Dick	f8a6accf8a	Fix bitsandbytes imports to avoid ImportErrors on MacOS.	2024-12-24 14:32:11 +00:00
Ryan Dick	f8ab414f99	Add CachedModelOnlyFullLoad to mirror the CachedModelWithPartialLoad for models that cannot or should not be partially loaded.	2024-12-24 14:32:11 +00:00
Ryan Dick	c6795a1b47	Make CachedModelWithPartialLoad work with models that have non-persistent buffers.	2024-12-24 14:32:11 +00:00
Ryan Dick	0a8fc74ae9	Add CachedModelWithPartialLoad to manage partially-loaded models using the new autocast modules.	2024-12-24 14:32:11 +00:00
Ryan Dick	dc54e8763b	Add CustomInvokeLinearNF4 to enable CPU -> GPU streaming for InvokeLinearNF4 layers.	2024-12-24 14:32:11 +00:00
Ryan Dick	1b56020876	Add CustomInvokeLinear8bitLt layer for device streaming with InvokeLinear8bitLt layers.	2024-12-24 14:32:11 +00:00
Ryan Dick	3f990393a1	Simplify the state management in InvokeLinear8bitLt and add unit tests. This is in preparation for wrapping it to support streaming of weights from cpu to gpu.	2024-12-24 14:32:11 +00:00
Ryan Dick	97d56f7dc9	Add torch module autocast unit test for GGUF-quantized models.	2024-12-24 14:32:11 +00:00
Ryan Dick	fe0ef2c27c	Add torch module autocast utilities.	2024-12-24 14:32:11 +00:00
Ryan Dick	d30a9ced38	Rename model_cache_default.py -> model_cache.py.	2024-12-24 14:23:18 +00:00
Ryan Dick	e0bfa6157b	Remove ModelCacheBase.	2024-12-24 14:23:18 +00:00
Ryan Dick	dd09509dbd	Rename ModelPatcher -> LayerPatcher to avoid conflicts with another ModelPatcher definition.	2024-12-17 13:20:19 +00:00
Ryan Dick	7fad4c9491	Rename LoRAModelRaw to ModelPatchRaw.	2024-12-17 13:20:19 +00:00
Ryan Dick	b820862eab	Rename ModelPatcher methods to reflect that they are general model patching methods and are not LoRA-specific.	2024-12-17 13:20:19 +00:00
Ryan Dick	c604a0956e	Rename LoRAPatcher -> ModelPatcher.	2024-12-17 13:20:19 +00:00
Ryan Dick	37e3089457	Push LoRA layer reshaping down into the patch layers and add a new FluxControlLoRALayer type.	2024-12-17 13:20:19 +00:00
Ryan Dick	fe09f2d27a	Move handling of LoRA scale and patch weight down into the layer patch classes.	2024-12-17 13:20:19 +00:00
Ryan Dick	606d58d7db	Add sidecar wrapper for FLUX RMSNorm layers to support SetParameterLayers used by FLUX structural control LoRAs.	2024-12-17 13:20:19 +00:00
Ryan Dick	c76a448846	Delete old sidecar_layers/ dir.	2024-12-17 13:20:19 +00:00
Ryan Dick	1e0552c813	Add optimized implementations for the LinearSidecarWrapper when using LoRALayer or ConcatenatedLoRALayer patch types (since these are the most common).	2024-12-17 13:20:19 +00:00
Ryan Dick	e2451ef5ca	A unit tests for LinearSidecarWrapper (and fix a bug).	2024-12-17 13:20:19 +00:00
Ryan Dick	3a8a5442ea	Add basic unit tests for SetParameterLayer.	2024-12-17 13:20:19 +00:00

1 2 3 4 5 ...

523 Commits