About
This skill enables Claude to set up and manage Mozilla llamafile, a cross-platform distribution format for running large language models locally without cloud dependencies. It facilitates the installation of llamafile binaries, downloading various GGUF models, and configuring high-performance local servers with GPU acceleration. It is particularly useful for developers building air-gapped tools, troubleshooting local inference connections, or integrating local models with existing OpenAI-compatible SDKs and LiteLLM workflows for cost-effective or private AI development.