Report
EQS AI Benchmark Report Vol. 2
AI Performance in Compliance & Ethics
The results are in. Download the report that gives Compliance teams an evidence-based view of what today’s AI can and can’t do.
Visit us in:
Barcelona, Copenhagen, Hamburg, Hong Kong, Kochi, London, Madrid, Milan, Munich, New York, Paris, Vienna, Zurich
Want to know more about our suite of solutions? Wondering if they are right for your specific business needs? Have feedback? Our global sales team is ready to answer all your questions and clear away those niggling doubts.
Looking for your next career opportunity?
Want to know more about our suite of solutions? Wondering if they are right for your specific business needs? Have feedback? Our global sales team is ready to answer all your questions and clear away those niggling doubts.
Looking for your next career opportunity?
The results are in. Download the report that gives Compliance teams an evidence-based view of what today’s AI can and can’t do.
Vol. 2 updates the industry’s first comprehensive AI benchmark for Compliance & Ethics work, now covering the latest frontier models from OpenAI, Google, Anthropic and Mistral — tested against the same 120 real-world tasks used in Vol. 1.
A few things stood out in this year’s results. The new generation made its biggest gains in open-ended Compliance work — policy drafts, investigation plans, board-level reports — improving by up to 18 percentage points compared to last year’s models. Five models from three vendors now sit within less than one percentage point of each other at the top of the leaderboard. And for the first time, a single frontier model carries a full conflict of interest workflow end-to-end, above 90%.
The report also covers where human oversight remains essential, what the convergence at the top of the leaderboard means for tool selection, and what practical steps Compliance teams can take now.