I've been running some of the biggest open-weight LLMs for free on Nvidia's cloud
… You sign up at build.nvidia.com with the free Nvidia Developer account, you generate an API key prefixed with "nvapi-", and that's about it. The base URL is "https://integrate.api.nvidia.com/v1" and the rest of the calls look exactly like what you'd send to any OpenAI-compatible service. …