llama-cpp-python in Docker
A Dockerfile and docker-compose setup for running llama.cpp with its Python bindings in a container, because finding a working one shouldn't be this hard.
read more →A Dockerfile and docker-compose setup for running llama.cpp with its Python bindings in a container, because finding a working one shouldn't be this hard.
read more →An all-in-one Docker container that bundles LLMs, embeddings, vision, and TTS into a single unified inference server.
read more →