Cache up to X models with automatic LRU unloading

### Feature Idea

Would it be possible to add a setting like “Cache up to X models” in ComfyUI, where only a limited number of models are kept in memory at once, and when the limit is reached, the system automatically unloads the least recently used model and replaces it with the newly requested one?

I’m aware that cache-lru exists, but it applies at the node level rather than specifically to model handling, which makes it less effective for controlling VRAM/RAM usage when frequently switching between large models.

A model-focused LRU cache would give users more predictable memory control, reduce OOM errors, and improve workflow efficiency when working with multiple checkpoints, LoRAs, or diffusion models in a single session.

### Existing Solutions

_No response_

### Other

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache up to X models with automatic LRU unloading #11930

Feature Idea

Existing Solutions

Other

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Cache up to X models with automatic LRU unloading #11930

Description

Feature Idea

Existing Solutions

Other

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions