Running the MOSS-TTS Family on Strix Halo
Benchmarking OpenMOSS's 8B llama.cpp GGUF backend and 100M ONNX Nano model on AMD's Ryzen AI MAX+ 395, with thread scaling analysis and a surprising Nano result that beats everything else I've tested.
read more →