Phuriwaj

Claude Code — 2026 Capabilities and AI Dev Stack

Current state of Claude Code and the broader AI developer tooling ecosystem as of May 2026.

Why / When to Use

Reference when choosing AI coding tools, designing agentic workflows, or evaluating which tool to use for which task.

Claude Code — Key Features (May 2026)

Agent View

  • One CLI screen to manage all background sessions simultaneously
  • Shows elapsed duration, last response, status per session
  • /resume support to jump back into a background session
  • Start with claude --bg for background sessions

New Commands & Flags

  • /goal command — set a success criterion; Claude iterates until met
  • --add-dir, --settings, --mcp-config — granular session control
  • Fast mode upgraded to Opus 4.7 by default

Ultraplan

  • Draft implementation plans from the CLI
  • Review and comment in a web editor
  • Execute remotely or pull back locally
  • Bridges planning and execution for team coding workflows

Rate Limits

  • Doubled limits on Pro, Max, and Enterprise plans (as of May 2026)
  • Longer coding sessions without interruption

AI Coding Stack — Which Tool for What (2026 Verdict)

ToolBest For
Claude CodeSenior devs on hard refactor / architectural work
Codex (GPT-5.5)Async task delegation (hand off and review cold)
CursorDaily-driver IDE (fast completions, file edits)
GitHub CopilotSafest enterprise default
Devin / OpenHands / JulesBounded, well-described tasks (bug sweeps, not exploration)

Note: Devin averages ~800 LLM turns per task vs Claude Code’s ~30. Use cloud-async agents to parallelize, not for exploratory or architectural work.

MCP Ecosystem (May 2026)

  • 6,400+ servers in the official registry
  • LangChain, CrewAI, LangGraph, LlamaIndex have all moved MCP to default
  • OpenAI deprecated Assistants API in favour of MCP (sunset mid-2026)
  • Think of MCP as “USB-C for AI tools” — standardised tool-calling interface

MCP Best Practice

  • Give the LLM only the tools it needs — too many tools degrade reasoning quality
  • Invest heavily in tool and argument descriptions — they directly affect decision quality
  • FastMCP (Python) is the gold standard for rapid MCP server development

Agent Frameworks — Quick Pick Guide

FrameworkUse When
CrewAIWorkflow maps cleanly to distinct roles/agents
LangGraphComplex routing, state management, built-in checkpointing
Claude SDKTool-use-first agents with extended thinking and MCP

Most production systems combine all three.

Context Engineering (vs Prompt Engineering)

The real production failure is not a bad prompt — it’s bad context assembly:

  • Wrong documents retrieved
  • Too much history stuffed into the context window
  • Missing tool definitions

LangChain’s four context strategies: write, select, compress, isolate.

Key Anthropic insight: give the model explicit permission to express uncertainty (“if data is insufficient, say so”) — reduces hallucinations significantly in agentic workflows.

Agentic Cost Control

Critical at scale:

  • A single Devin bug-fix task can cost $180 and return a non-compiling PR
  • Need: per-task spend caps, trajectory scoring, webhook stop signals in your AI gateway
  • Token tracking alone is insufficient

Gotchas

  • claude --bg sessions accumulate; use Agent View to manage them actively
  • OpenAI Assistants API sunset is mid-2026 — migrate anything using it to MCP-based tooling now
  • FastMCP abstracts protocol complexity; don’t build raw MCP unless you have a specific reason

Source

Conversation: “LLM-powered news search and summarization sites” — 2026-05-23 AI Dev Brief section, Releasebot, TECHSY, Medium, gurusup.com, Simon Willison