shadow of a person walking past an Anthropic ad on a wall
Feature

Anthropic's Claude 3.7 Sonnet Lets You Look Inside Its 'Thought' Process

5 minute read
David Barry avatar
By
SAVED
Anthropic released Claude 3.7 Sonnet in late February, the first "hybrid reasoning" model on the market. Here's what's inside for business users.

Anthropic's Claude 3.7 Sonnet landed in late February as the first "hybrid reasoning" model on the market. Hybrid reasoning means users can choose between "normal" reasoning — the rapid responses we've grown accustomed to — "extended," more involved responses whose step-by-step "thinking" process is visible to the end user. 

reasoning options in Claude 3.7 Sonnet

Claude 3.7 Sonnet is available to everyone, however only paid users have access to the reasoning option. The price carries over from previous models, with one million input tokens costing $3 and one million output tokens costing $15. Claude 3.7 Sonnet can also be accessed across platforms like Amazon Bedrock, Google Cloud's Vertex AI and Anthropic's API. The release comes only four months after the release of Claude 3.5 Sonnet.

What's Inside the Claude 3.7 Sonnet Release

Claude 3.7 Sonnet builds upon its predecessor, Claude 3.5 Sonnet, offering substantial improvements in reasoning, coding and task execution. What really makes it stand out from the others though, is:

1. Hybrid Reasoning Modes

As mentioned before, the model operates in two modes — standard and extended thinking. In standard mode, it provides quick responses optimized for tasks like knowledge retrieval and automation. In extended thinking mode, it breaks down problems into manageable steps, visually displays its reasoning process, and delivers highly accurate solutions for complex challenges such as coding, physics problems and strategic analysis. 

While competitors offer the same functionality, it is split across different models (think the difference between OpenAI's ChatGPT 4o model and its ChatGPT o1 model), rather than combining it in a single model. OpenAI also has a hybrid reasoning model in the works, according to Sam Altman.

2. Enhanced Coding Capabilities

Claude 3.7 excels in software engineering tasks, scoring an industry-leading accuracy of 70.3% on SWE-bench Verified when using structured prompts. It supports end-to-end software development processes, including debugging, refactoring and creating production-ready code across multiple languages. 

3. Expanded Output Capacity

The model can generate outputs up to 128K tokens long (64K tokens generally available), which allows for detailed analyses, comprehensive outlines or extensive content generation.

4. Agentic Tool Use

Its improved performance in tasks requiring interaction with digital interfaces — such as clicking buttons or typing text — makes it suitable for automating workflows and testing applications. According to Anthropic, Claude 3.7 Sonnet is optimized for diverse use cases:

  • Software development: From planning to debugging and maintenance, the model supports complex coding tasks with reduced error rates and improved precision.
  • Customer-facing AI agents: Its advanced reasoning capabilities enhance customer support workflows by improving instruction-following and tool selection.
  • Strategic analysis: Extended thinking mode allows businesses to tackle intricate problems requiring deep reflection and multi-step planning.

Anthropic has positioned Claude 3.7 as its most intelligent model to date, which promises significant improvements in coding, front-end development and enterprise workflow automation.

Adding to its appeal, Anthropic introduced Claude Code in limited research preview, a command-line tool that allows developers to offload substantial engineering tasks to AI agents directly from their terminal.

Transparency and Collaborative Development Stand Out

The hybrid reasoning capabilities are one of the most noteworthy elements in the new release, Andrew Reece, chief AI scientist at BetterUp, told Reworked.

“The hybrid reasoning approach might best be understood as giving you options in how your AI partner thinks — sometimes you need quick, efficient responses, and other times you want more deliberate analysis,” he said.

The visibility into the thinking process, which creates a level of transparency that helps bridge the gap between human and machine cognition, is potentially transformative, Reece added.  

The value proposition for enterprise users is the flexibility to match the AI's thinking mode to the task complexity, potentially allowing people to redirect their cognitive bandwidth toward refining strategic vision rather than managing execution details. 

Claude Sonnet 3.7's improvements in coding and front-end development offer what is effectively an evolution from prediction-based code assistance to something more closely resembling collaborative development, Reece continued.

“The interesting shift isn't necessarily in code quality — although Sonnet 3.7’s coding chops are head-and-shoulders above the competition — but in how this new class of models reduces the cognitive load of translating high-level ideas into implementation details,” he said.

This change moves users towards a best-case scenario, where developers are able to spend more time on things like visioning, planning and design — areas where human judgment adds unique value. People call it “vibe coding,” but Reece predicts in time we won’t even think of it as “coding” at all, it will just be creating.

Technical Barriers Might Prevent Knowledge Workers From Fully Benefitting (for Now)

Coders and software engineers aren't the only people likely to benefit. Knowledge workers also can benefit from Claude 3.7 Sonnet's capabilities, Pmfm.ai founder Aditya Saxena told Reworked.

He cites the example of a writer who trains a custom Claude 3.7 Sonnet bot on their style to use as an assistant, a lawyer using it to draft briefs or a small business tapping it for market research, report generation or document analysis. 

“However, it’s fair to say that its impact would be felt the most in the realm of software development,” he said, due to the complexities of using APIs and setting up AI agents.  

So while Saxena sees the potential productivity gains for knowledge workers, he expects adoption will remain low due to the high technical barrier.

Learning Opportunities

He also reminds businesses that no matter how sophisticated a model is — including Claude 3.7 Sonnet — humans still need to be in the loop, particularly in critical processes like code deployment or review.  

“AI models will incrementally get better at agentic reasoning and automating workflows. That is inevitable. But that shouldn’t translate into humans playing a less critical role by any means," Saxena said. “Humans have to ultimately take ownership of their work — whether it’s done by AI or not. All organizations using AI agents should know that LLMs aren’t infallible and all AI work should be scrutinized by a human."

Claude Code Is the Blueprint for What's to Come

TetraNoodle Technologies founder and CIO Manuj Aggarwal echoes Saxena's reminder, stating AI is just a tool, not a decision-maker. “I advise business leaders to let AI handle execution, but keep humans in charge of strategy and critical decisions. Set clear guardrails, monitor and audit the decisions made by AI, and use AI as a co-worker for your workforce. AI should assist, not replace humans. The companies that get this right will scale faster, work smarter and avoid costly mistakes."

Reasoning models, in their ability to peel back the curtain on their thought processes, will help enterprises get familiar with how generative AI “thinks,” which can help in building trust in results, said Reece.

While Sonnet 3.7 is an impressive model, it is just one in a cohort of models with these advanced capabilities, he continued. “I’m not sure it’s going to be a game-changer for knowledge work in the enterprise on its own. Claude Code is the blueprint, though, for where things are headed, in terms of a human-agent hybrid workforce as the dominant labor model for most enterprises,” he said.

Editor's Note: Read more about other recent developments in AI:

About the Author
David Barry

David is a European-based journalist of 35 years who has spent the last 15 following the development of workplace technologies, from the early days of document management, enterprise content management and content services. Now, with the development of new remote and hybrid work models, he covers the evolution of technologies that enable collaboration, communications and work and has recently spent a great deal of time exploring the far reaches of AI, generative AI and General AI.

Main image: adobe stock
Featured Research