I've been running some of the biggest open-weight LLMs for free on Nvidia's cloud
…Setup is basically an OpenAI-compatible endpoint If your tool speaks OpenAI, it speaks Nvidia The whole thing is built around an OpenAI-compatible API, which makes it trivially easy to drop…