Does it unload the current model if VRAM is full, to allow swapping to a new model?
Erik
eribob
AI & ML interests
None yet
Recent Activity
commented on
an
article
about 1 month ago
New in llama.cpp: Model Management
Organizations
None yet