Weekly Report
Dec 25, 2025 – Dec 31, 2025
A curated summary of the most important updates in AI from the last 7 days.
New Products
Toad
A unified terminal front-end for AI coding tools including OpenHands, Claude Code, and Gemini CLI. Supports 12 agent CLIs via the ACP protocol with enhanced UI features like fuzzy file search, markdown streaming, integrated shell, and tab completion.
Wafer
GPU development stack that lives inside your IDE, bringing together profiling, compiler explorer, and GPU documentation in one place. Eliminates context switching when writing GPU kernels by integrating fragmented tools directly into the editor workflow.
cto bench
A ground truth code agent benchmark that measures AI coding agent performance. The benchmark tracks merged code as a percentage of completed tasks and reports a 4-day rolling success rate with a 1-day lag for task resolution.
Netlify AI Gateway
A developer tool that simplifies using AI inference in code by eliminating the need to manage API keys or create accounts with AI providers. Provides access to 30+ AI models with unified monitoring of usage and costs.
GitHub Copilot Agent Skills
A new feature for GitHub Copilot that allows developers to create custom agent capabilities and skills. Agent Skills enables more customized and powerful agent-based workflows within the Copilot ecosystem, integrating with VS Code and supporting multi-agent orchestration.
New Features
Claude Code Browser Control
Claude Code launched full browser automation through Chrome integration. The AI can navigate pages, click buttons, fill forms, read console logs, monitor network requests, and record GIFs of browser interactions - all while keeping the browser visible.
Cursor 2.3 & 2.2 Updates
Cursor released version 2.3 with layout customization and stability improvements, following the major 2.2 release that introduced Debug Mode with runtime thinking capabilities and a Visual Editor for natural language-based web design with point-and-click editing.
GitHub Copilot Agent Skills & Memory
GitHub Copilot now supports Agent Skills for creating specialized, repeatable tasks, and introduced Memory feature in public preview that allows Copilot to learn from your codebase and remember context across sessions.
GPT-5.2-Codex
OpenAI introduced GPT-5.2-Codex, optimized for agentic coding with improvements for long-horizon work, enhanced long-context understanding, improved tool calling reliability, better factuality, and the ability to parse UI screenshots and technical diagrams.
Windsurf Wave 13
Windsurf released Wave 13 featuring first-class support for parallel, multi-agent sessions, GPT-5.2 integration, and various bug fixes. Latest version 1.13.104 includes performance improvements and stability enhancements.
GitLab 18.7
GitLab release with enhanced AI capabilities including improved Duo analytics dashboard, AI-powered model chat selection, Duo Data Analyst beta, AI-powered SAST false positive detection, and new building blocks for the upcoming Duo Agent Platform GA.
Claude Code Async Background Agents
Claude Code now supports asynchronous background agents that can run tasks in parallel while you continue working. Agents can be sent to background with Ctrl+B and automatically hook back in when complete, enabling parallel AI development workflows.
Claude Code LSP Support
Claude Code released native LSP (Language Server Protocol) support for better code intelligence and language understanding, along with improved VS Code extension integration including multi-terminal setup support and syntax highlighting toggle.
New Technologies
Agent Skills (Open Standard)
Anthropic's open standard for AI agent capabilities that enables cross-platform portability of agent skills. Skills are special sets of instructions that teach AI agents how to handle specific work tasks, allowing them to be portable across different AI platforms.
Google Agent Development Kit for TypeScript
An open-source, modular framework that brings a code-first approach to building AI agents with TypeScript and JavaScript. Enables developers to build autonomous multi-agent AI systems using familiar software engineering practices.
GLM-4.7
Zhipu AI's latest flagship model specifically designed for 'Agentic Coding' scenarios, achieving 73.8% on SWE-bench benchmark with enhanced programming abilities, better long-term task planning, improved tool collaboration, and more stable multi-step reasoning.
Others
Cursor acquires Graphite
Cursor, an AI-powered code editor, has acquired Graphite, a code review platform. Graphite specializes in AI-powered code review workflow that helps teams review changes more efficiently by breaking large changes into smaller, connected pieces.
Cursor CEO warns about 'vibe coding'
Cursor CEO Michael Truell warned that over-reliance on AI-generated 'vibe coding' creates unstable foundations for software projects, noting that while it works for simple tasks, 'eventually things start to crumble' as complexity increases.
Anthropic acquires Bun
Anthropic announced its first acquisition - Bun, a high-performance JavaScript runtime - as Claude Code reaches $1 billion in run-rate revenue in just 6 months since launch.