• Menu
  • Skip to right header navigation
  • Skip to main content
  • Skip to primary sidebar

DigiBanker

Bringing you cutting-edge new technologies and disruptive financial innovations.

  • Home
  • Pricing
  • Features
    • Overview Of Features
    • Search
    • Favorites
  • Share!
  • Log In
  • Home
  • Pricing
  • Features
    • Overview Of Features
    • Search
    • Favorites
  • Share!
  • Log In

Anthropic’s new Claude Opus 4.1 model scores 74.5% on SWE-bench Verified, surpassing OpenAI’s o3 model at 69.1% and Google’s Gemini 2.5 Pro at 67.2%, indicating its dominance in AI-powered coding assistance

August 7, 2025 //  by Finnovate

Anthropic unveiled the latest version of its flagship artificial intelligence model, the same day that OpenAI released its first two open reasoning models since 2019. Claude Opus 4.1 is better at agentic tasks, coding and reasoning, according to a company blog post. Leaks of Claude Opus 4.1 began appearing the day before on social platform X and TestingCatalog. Anthropic Chief Product Officer Mike Krieger said this release is different from previous model unveilings. Claude Opus 4.1 is a successor to Claude Opus 4, which launched May 22. Opus 4.1 shows gains on benchmarks such as SWE-Bench Verified, a coding evaluation test, where it scores two percentage points higher than the previous model. The 4.1 model is also strong in agentic terminal coding, with a score of 43.3% on the Terminal-Bench benchmark compared with 39.2% for Opus 4, 30.2% for OpenAI’s o3, and 25.3% for Google’s Gemini 2.5 Pro. Customers such as Windsurf, a coding app being acquired by Cognition, and Japan’s Rakuten Group have reported quicker and more accurate completion of coding tasks using Claude Opus 4.1. The Claude Opus 4.1 release came amid signs that rival OpenAI is nearing the debut of GPT-5

Read Article

Category: Essential Guidance

Previous Post: « US-based crypto and fintech companies are expanding their platforms to build ‘super apps’ to offer a broad range of services that include tokenized RWAs, stocks, derivatives, ETF tokens and prediction markets, indicating unification of crypto UX
Next Post: NACHA says AI is enhancing payment operations through real-time decision-making powered by intelligent exception handling to automated return code analysis; also end-user UX is improving through real-time status updates and self-service tools; AI is also identifying potential fraud »

Copyright © 2025 Finnovate Research · All Rights Reserved · Privacy Policy
Finnovate Research · Knyvett House · Watermans Business Park · The Causeway Staines · TW18 3BA · United Kingdom · About · Contact Us · Tel: +44-20-3070-0188

We use cookies to provide the best website experience for you. If you continue to use this site we will assume that you are happy with it.