LLM Serving Pack

Status: Official (OpenTeams)
Maintainer: nebari-dev
Source: <a href="https://github.com/nebari-dev/nebari-llm-serving-pack" target="_blank" rel="noopener noreferrer">nebari-dev/nebari-llm-serving-pack

Add LLM serving to your Nebari cluster so your team can run large language models behind a managed API. The pack handles model downloading, serving, routing, and per-model access control, with rate limiting and token counting included so usage stays accountable.