Claude for Financial Services
…Claude 4 models outperform other frontier models as research agents across financial tasks in Vals AI's Finance Agent benchmark . When deployed by FundamentalLabs to build an Excel agent, Claude Opus 4…
…Claude 4 models outperform other frontier models as research agents across financial tasks in Vals AI's Finance Agent benchmark . When deployed by FundamentalLabs to build an Excel agent, Claude Opus 4…
…As models have become significantly better at long-horizon tasks over the last year or so, a new way of working emerged: rather than getting involved with every detail, we can specify…
…Together we launched Project Rainier, one of the largest compute clusters in the world, and we currently use over one million Trainium2 chips to train and serve Claude. Today’s agreement expands…
…a more traditional threat model. We're not protecting user machines from agents; we're protecting our own infrastructure and each tenant from one another. Our pre-launch work for claude.ai…
…At the core of this work are probes , which measure activations within the model as it generates a response and allow us to detect specific harms at scale. With this launch, we…
…We plan to launch new safeguards with an upcoming Claude Opus model, allowing us to improve and refine them with a model that does not pose the same level of risk as…
…We hope that this post helps to update defenders' mental model of the risks to match reality—now is the time to adopt AI for defense. If you want to contribute to…
…First is that models tend to lose coherence on lengthy tasks as the context window fills (see our post on context engineering ). Some models also exhibit "context anxiety," in which they begin…
To show you the most relevant results, we’ve omitted some entries very similar to those already shown. Repeat the search with the omitted results included.