Weekly Report
May 23, 2026 – May 29, 2026
A curated summary of the most important updates in AI from the last 7 days.
New Products
Pi Coding Agent
A minimal, hackable terminal coding harness that lets developers customize their AI coding agent workflow through extensions, skills, prompt templates, and themes distributed as npm packages.
Edgee Fallback Models
An agent gateway that provides automatic fallback routing for Claude Code when Anthropic is down, rate-limited, or plan caps are hit. Routes to alternative models with zero code changes.
MashuPack
A developer tool that turns codebases into clean files optimized for Claude and ChatGPT consumption, helping developers prepare repository context for AI coding agents.
Stitch 3.0
Google's AI-powered tool for generating and iterating UI screens with AI on a live canvas, specifically designed for developer workflows.
ModelHub
A menu bar app for Mac that provides management and access to local LLMs, enabling developers to run and manage AI models locally for coding workflows.
AgentRail
An open-source local control plane and task lifecycle layer for AI coding agents that connects agents to GitHub, Linear, and CI with structured typed events.
BobCA
A sovereign agent that learns to code with your preferences. Focuses on personalized AI coding agents that adapt to individual developer workflows and coding styles.
Harbor CLI
CLI tool with companion app to spin up complete local LLM stacks. Provides a containerized toolkit for developers to manage local Large Language Model environments.
MashuPack
A tool that turns codebases into clean text files for ChatGPT, Claude, and Gemini. It selects exact parts of a repository and compiles them into one file.
CodeGraph
Pre-indexed code knowledge graph for AI coding agents. Provides semantic code intelligence with 35% cost reduction, 70% fewer tool calls, and 100% local operation.
New Features
Claude Code v2.1.150
Internal infrastructure improvements with no user-facing changes. Released May 23, 2026.
Claude Code v2.1.149
Enhanced Usage Insights with per-category breakdown, UI improvements, security fixes including PowerShell permission bypass and sandbox issues.
Gemini CLI v0.43.0
Surgical Code Edits & Session Portability. Gemini models now prefer edit tool for modifications. Introduced session export/import features and adaptive token calculator.
GitHub Copilot Eclipse Open Source
GitHub Copilot for Eclipse is now open source, making the codebase available for community contribution and transparency.
Semantic Issue Search in Copilot Chat
New semantic search capability in Copilot Chat helps find and understand issues more effectively by understanding meaning rather than just keywords.
Background Agents in Slack Integration
Cursor now supports launching Background Agents directly from Slack by mentioning @Cursor. Agents can read threads and create PRs without leaving Slack.
Composer 2.5 Release
Composer 2.5 is now available with substantial improvements in intelligence and behavior. Better at sustained work on long-running tasks.
Claude Opus 4.7 Fast Mode
Claude Opus 4.7 (fast mode) is now available in Windsurf, offering the full intelligence of Opus 4.7 but with approximately 2.5x higher output speeds.
Team-level Copilot Usage Metrics API
New API endpoints provide team-level usage metrics for Copilot, helping organizations track adoption and usage patterns across teams.
BugBot Automatic Code Review
BugBot for automatic code review that catches potential bugs and issues in PRs, leaving comments directly on GitHub with 'Fix in Cursor' integration.
New Technologies
MOSS: Self-Evolution through Source-Level Rewriting
ArXiv paper introducing MOSS, a system for autonomous agent self-evolution through source-level rewriting. Agents can modify their own code to improve capabilities.
Compiling Agentic Workflows into LLM Weights
ArXiv paper proposing a method to compile agentic workflows directly into LLM weights, achieving near-frontier quality at two orders of magnitude less cost.
TerminalWorld: Benchmarking Agents on Terminal Tasks
ArXiv paper introducing TerminalWorld, a benchmark for evaluating AI agents on real-world terminal tasks with standardized measurement of agent performance.
Cursor Composer 2.5 Model
Cursor's third-generation proprietary coding model trained on 25x more synthetic tasks. Built on Kimi K2.5 base, priced at $0.50/$2.50 per million tokens.
Qwen 3.7 Max API
Alibaba's closed-weight coding model API scoring 56.6 on Artificial Analysis Index and 72.5 on SWE-Bench Verified. Natively supports Anthropic Messages protocol.
SpaceX-Anthropic $1.25B/Month Compute Deal
SpaceX S-1 filing revealed Cloud Services Agreements with Anthropic for compute capacity. Anthropic agreed to pay $1.25B/month through May 2029.
Others
FTC Settlement Over 'Active Listening' AI Marketing
FTC required Cox Media Group to pay nearly $1M for deceiving customers about 'Active Listening' AI. Service didn't actually listen, just resold email lists.
Karpathy: Programming Has Fundamentally Changed
Karpathy tweeted it's 'hard to communicate how much programming has changed due to AI.' AI agents now write 80% of his code and he hasn't written code in months.
Datasette Agent: Extensible AI Assistant
Simon Willison announced Datasette Agent, integrating his LLM Python library (3+ years work) with Datasette. Includes per-user spending limits and sandbox execution.
Pope Leo XIV's Encyclical on AI Ethics
Simon Willison analyzes Magnifica Humanitas, Pope Leo XIV's encyclical on safeguarding the human person in the time of artificial intelligence.
Microsoft Copilot Cowork Exfiltration Vulnerability
Security vulnerability where agents could send emails with external images triggering network requests, enabling data exfiltration through OneDrive links.
Building AI Agents the Safe Way
Practical guidance on defending against prompt injection and other AI attacks in agent systems, emphasizing basic engineering practices and thorough testing.
LLM Coding Workflow Challenges in 2026
Active HN discussion about LLM coding workflows. Key themes: balance between human specification vs agent autonomy, multi-step pipelines, need for scaffolding.
Microsoft Cancels Claude Code Licenses
Microsoft began canceling Claude Code licenses for thousands of developers. Cancellation deadline is June 30, 2026. Decision driven by financial motives and spiraling AI costs.