The biggest local LLM on your machine is useless if it can't call a single tool, no matter how many parameters it has
…The test harness was made open source, and the methodology was as straightforward as it sounds. It literally just involved giving a model a set of tools like a shopping cart API…
