Sleeping Robots

Sleeping RobotsProject write-ups, experiments, and things I wanted to exist.https://sleepingrobots.com/Pi Web UI: A Browser Interface for the Pi Coding Agenthttps://sleepingrobots.com/dreams/pi-web-ui/https://sleepingrobots.com/dreams/pi-web-ui/A full-stack web interface that puts the Pi coding agent in the browser — with system-level access, session history, and model switching through a local LiteLLM proxy.Sat, 11 Apr 2026 00:00:00 GMTagentsllmlocalaiinfralinuxZetaphorLocal LLM Infrastructure on Strix Halohttps://sleepingrobots.com/dreams/local-llm-infrastructure-strix-halo/https://sleepingrobots.com/dreams/local-llm-infrastructure-strix-halo/How LiteLLM, llama-swap, and Lemonade Server compose into a unified local inference platform — routing dozens of models across GPU and NPU through a single API endpoint, accessible anywhere via Tailscale and a local reverse proxy.Fri, 10 Apr 2026 00:00:00 GMTstrix-halollmamdlocalaihardwarelinuxinfradockerZetaphorRunning LLMs on the AMD NPU with Lemonade Serverhttps://sleepingrobots.com/dreams/lemonade-server-npu-strix-halo/https://sleepingrobots.com/dreams/lemonade-server-npu-strix-halo/Setting up AMD's Lemonade Server on Strix Halo to run LLM and Whisper inference on the XDNA 2 NPU — driver builds, architecture decisions, and benchmarks against the integrated GPU.Thu, 09 Apr 2026 00:00:00 GMTllmamdlocalaihardwarelinuxspeech-to-textinfrastrix-haloZetaphorBenchmarking OmniVoice on Strix Halohttps://sleepingrobots.com/dreams/omnivoice-strix-halo/https://sleepingrobots.com/dreams/omnivoice-strix-halo/Running a 600+ language zero-shot TTS model on an AMD integrated GPU — voice cloning benchmarks, ROCm compatibility adventures, and the container workaround that actually worked.Thu, 09 Apr 2026 00:00:00 GMTttsrocmamdlocalaivoicestrix-haloZetaphorBenchmarking VoxCPM2 on Strix Halohttps://sleepingrobots.com/dreams/voxcpm-strix-halo/https://sleepingrobots.com/dreams/voxcpm-strix-halo/Running a 2B parameter tokenizer-free TTS model in both Python and C++ on AMD's integrated GPU — near-real-time speech synthesis on CPU, and the Vulkan crash that stopped GPU acceleration in its tracks.Thu, 09 Apr 2026 00:00:00 GMTttsamdlocalaivoicelinuxstrix-haloZetaphorSelf-Hosting Fish Audio on Strix Halohttps://sleepingrobots.com/dreams/fish-audio-strix-halo/https://sleepingrobots.com/dreams/fish-audio-strix-halo/Running Fish Audio's 4B parameter S2-Pro text-to-speech model locally on an AMD Strix Halo integrated GPU via ROCm and Podman.Sun, 15 Mar 2026 00:00:00 GMTttsrocmamdlocalaistrix-haloClaude Opus 4.6Medium-Claw: A Persistent AI Companion on Telegramhttps://sleepingrobots.com/dreams/medium-claw/https://sleepingrobots.com/dreams/medium-claw/A Telegram bot backed by the Pi coding agent with autonomous scheduling, persistent memory, cross-session search, and a web dashboard.Sun, 15 Mar 2026 00:00:00 GMTagentstelegramlocalaiZetaphorLoopMaker Webhttps://sleepingrobots.com/dreams/loopmaker-web/https://sleepingrobots.com/dreams/loopmaker-web/A browser-based AI music generation tool powered by ACE-Step, ported to Linux for local generation on AMD Strix Halo hardware.Thu, 05 Mar 2026 00:00:00 GMTmusicailocallinuxstrix-haloZetaphorQuizForge: Self-Learning Quiz Makerhttps://sleepingrobots.com/dreams/self-learning-quiz-maker/https://sleepingrobots.com/dreams/self-learning-quiz-maker/A full-stack quiz platform that turns markdown files and YouTube transcripts into mixed-format quizzes with AI grading, contextual chat, and performance analytics.Fri, 20 Feb 2026 00:00:00 GMTlearningailocaltoolsZetaphorOneiros: A Personal AI Agent Platformhttps://sleepingrobots.com/dreams/oneiros/https://sleepingrobots.com/dreams/oneiros/A modular collection of services for building a personal AI agent — tool use, memory, browser automation, TTS, and multi-platform chat interfaces.Fri, 30 Jan 2026 00:00:00 GMTagentslocalinfraaiZetaphorOCR List Makerhttps://sleepingrobots.com/dreams/ocr-list-maker/https://sleepingrobots.com/dreams/ocr-list-maker/Snap a photo of a handwritten list, OCR it with a local vision model, and print a formatted checklist on a thermal receipt printer.Sun, 28 Dec 2025 00:00:00 GMTocrlocalhardwareaiZetaphorllama-cpp-python in Dockerhttps://sleepingrobots.com/dreams/llama-cpp-python-docker/https://sleepingrobots.com/dreams/llama-cpp-python-docker/A Dockerfile and docker-compose setup for running llama.cpp with its Python bindings in a container, because finding a working one shouldn't be this hard.Mon, 03 Nov 2025 00:00:00 GMTllmdockerlocalinfraZetaphorLavabo: The Kitchen Sink of Local AIhttps://sleepingrobots.com/dreams/lavabo/https://sleepingrobots.com/dreams/lavabo/An all-in-one Docker container that bundles LLMs, embeddings, vision, and TTS into a single unified inference server.Sun, 10 Aug 2025 00:00:00 GMTllmdockerlocalinfraZetaphorWeb Browser Wrappedhttps://sleepingrobots.com/dreams/web-browser-wrapped/https://sleepingrobots.com/dreams/web-browser-wrapped/Generating weekly Spotify Wrapped-style reports from browser history using local models and Browser Recall data.Mon, 14 Apr 2025 00:00:00 GMTbrowserrecalllocalaiZetaphorSpeech To Text Typing for Wayland Usershttps://sleepingrobots.com/dreams/speech-to-text-typing-wayland/https://sleepingrobots.com/dreams/speech-to-text-typing-wayland/Building a custom speech-to-text solution for Linux Wayland users using NVIDIA's Canary model, Silero VAD, and ydotool.Sun, 13 Apr 2025 00:00:00 GMTspeech-to-textlinuxpythonlocalZetaphorTotal (Browser) Recallhttps://sleepingrobots.com/dreams/total-browser-recall/https://sleepingrobots.com/dreams/total-browser-recall/Building a personal browser history search engine with full-text recall, inspired by Microsoft's Recall and rewind.ai.Sun, 13 Apr 2025 00:00:00 GMTbrowserrecallproductivitylocalZetaphorA Practical, Fully Local Desktop Voice Agenthttps://sleepingrobots.com/dreams/desktop-voice-agent/https://sleepingrobots.com/dreams/desktop-voice-agent/Building a natural language voice controller for the Linux desktop using Qt6, a tiny 1.7B LLM, and a clever vector embedding trick for tool calling.Wed, 05 Feb 2025 00:00:00 GMTvoiceagentlocallinuxllmZetaphorA Fully Local, In-Browser Voice Assistanthttps://sleepingrobots.com/dreams/browser-based-voice-assistant/https://sleepingrobots.com/dreams/browser-based-voice-assistant/Building a private, browser-based voice assistant using WebAssembly, Moonshine STT, Piper TTS, and local LLMs.Thu, 16 Jan 2025 00:00:00 GMTvoicelocalaiwebassemblyllmZetaphorOffline Voice Chatbot in the Browserhttps://sleepingrobots.com/dreams/wasm-voice-chatbot/https://sleepingrobots.com/dreams/wasm-voice-chatbot/Building a fully offline voice interface for LLMs using WebAssembly — VAD, speech-to-text, and text-to-speech all running client-side.Sat, 04 Jan 2025 00:00:00 GMTvoicewasmbrowserlocalZetaphor & Claude Opus 4.6You Are Johnhttps://sleepingrobots.com/dreams/you-are-john/https://sleepingrobots.com/dreams/you-are-john/A text-driven simulation where you interact with a guy named John through natural language, and an LLM determines how his world responds.Sun, 13 Oct 2024 00:00:00 GMTllmgamessimulationaiZetaphor