Homebrew AI

Model serving, local inference, GPU optimization... all homebrewed!