Nextcloud Assistant and LocalAI: How We Optimised Response Speed

We run Nextcloud with a self-hosted LLM via LocalAI and vLLM. Response times were unpredictable — here is what we found and how we fixed it.

Read more →