Weekly Report
Feb 24, 2026 – Mar 2, 2026
A curated summary of the most important updates in AI from the last 7 days.
New Products
GLM-5 with OpenClaw
Open-source AI coding agent framework with 161K+ GitHub stars. Features built-in LanceDB memory plugin for persistent workspace memory, autonomous coding capabilities, and Terminal Bench 2.0 score of 56.2 (open-source SOTA).
OpenCode AI
Terminal-based AI coding agent built specifically for the CLI with TUI interface. Supports installation via npm, brew, and curl. Works with multiple LLMs including Claude, OpenAI, Google Gemini, and local models.
Figr
AI product assistant for product managers. Chrome extension that real-time parses applications, imports Figma designs, adds documentation and analytics, creates user flows, UX evaluations, and A/B test variants based on 200,000+ UX patterns.
Base44
Complete backend service for building AI agent applications. Optimized for Claude Code and Cursor. Deploy full-stack applications with a single command. Tested across millions of real-world applications.
Qwen Coder
Autonomous coding agent from Alibaba's Qwen team. Part of Qwen Agent framework, optimized for coding tasks including writing, debugging, and fixing code. Released as part of Qwen 3.5 framework.
Mastra Code
AI programming agent featuring observational memory and support for long-term coding sessions. Provides precise context management, rapid application building, and data compression for extended development work with workspace memory capabilities.
Superset
Full AI IDE that runs multiple coding assistants (Claude Code, Codex, etc.) simultaneously with independent sandboxing, unified monitoring, built-in diff viewer, and real-time notifications. Designed to reduce context switching and improve development efficiency by up to 10x.
Happycapy
Browser-based cloud computer pre-loaded with Claude Code and other AI models. Provides 2 vCPU / 4GB RAM / 10GB SSD sandbox environment with no local installation required. Works on both desktop and mobile devices as an 'Agent-Native Computer for ordinary people'.
Trae
China's first AI-native IDE by ByteDance, launched February 24, 2026. Features full-scenario AI-native development environment, seamless VS Code migration, full-process automated development, optimized for Chinese language context with enterprise-level flexible deployment.
Straion
Enterprise-focused AI coding agent solving the 'babysitting problem' by providing organizational context and rules automatically, ensuring agents work within enterprise constraints. Founded by ex-Elastic, Dynatrace, Netlify team. Reached #2 on Product Hunt with 349 upvotes.
New Features
Claude Opus 4.6 with 1M Token Context
Anthropic released Claude Opus 4.6 on February 5, 2026, featuring up to 1 million token context window (beta), 80.8% SWE-bench Verified score, Agent Teams multi-agent collaboration, and Fast Mode with 2.5x speed improvement. Pricing: $30/$150 per million tokens.
GPT-5.3-Codex Model Release
OpenAI released GPT-5.3-Codex on February 5, 2026, featuring 25% faster performance with half token usage compared to GPT-5.2-Codex. Combines coding capabilities with reasoning and professional knowledge. SWE-Bench Pro score of 56.8%, Terminal-Bench 2.0 score of 77.3% (+13% improvement).
Apple Xcode 26.3
Major update to Apple's IDE introducing native agentic coding capabilities. Integrates Anthropic's Claude Agent and OpenAI's Codex directly into Xcode with MCP (Model Context Protocol) support. Agents can access project context, modify files, and verify visual output in SwiftUI previews.
GitHub Agentic Workflows
GitHub introduced Agentic Workflows, a feature for automating repository tasks using AI agents that integrate with CI/CD pipelines. The system creates automated status reports and enables continuous integration/continuous deployment automation through AI agents.
GitHub Agentic Workflows (gh-aw)
Official GitHub tool that allows AI agents to execute automated tasks within GitHub Actions using natural language/Markdown instead of YAML. Features include issue triage, PR code review, CI/CD failure analysis, and documentation maintenance with sandboxed execution.
JetBrains Junie AI Coding Agent
AI coding agent from JetBrains that executes complex tasks including writing code, debugging, running applications, writing tests, and verifying results. Now publicly available to all JetBrains customers with support for IntelliJ IDEA Ultimate, PyCharm Pro, WebStorm, and GoLand.
JetBrains AI Free Tier
New free tier from JetBrains offering unlimited code completion, access to local AI models, and credit-based cloud AI assistance with 30 days of AI Pro access. All JetBrains AI tools are now available through a single subscription starting from the 2025.1 version.
Agent Technology Integration & Visual Capabilities
GitHub Copilot integrated full agent technology with autonomous code iteration capabilities, added Copilot Edits for multi-file editing, image upload support for code generation from visuals, and Next Edit Suggestions for predictive editing.
Anthropic Claude Sonnet 4.6 Integration
GitHub Copilot integrated Anthropic's Claude Sonnet 4.6 model, which excels at complex agent-based coding tasks and search operations. Available across web, iOS/Android mobile apps, CLI, VS Code, and Visual Studio.
OpenAI Codex macOS Application and GPT-5.3-Codex Model
OpenAI released macOS desktop application version of Codex in February 2026, featuring GPT-5.2-Codex model with 40% faster reasoning, 400k token input support, and 50+ programming languages. Integrated MCP protocol and 'work tree' mechanism for enhanced workflow management.
Composer 1.5 Model Release
Cursor released Composer 1.5 on February 10, 2026, featuring 20x increased reinforcement learning training scale, thinking tokens, and self-summary mechanisms for deeper reasoning in complex programming tasks. Continues evolution of proprietary AI coding model with enhanced multi-file editing capabilities.
Cursor 2.5 with Plugin Marketplace
Cursor released version 2.5 in February 2026, introducing a new Plugin Marketplace allowing users to package multiple capabilities into one-click installations including Rules, tools, and extensions. Also added asynchronous sub-agents for parallel agent operations and enhanced sandbox network control.
Windsurf Wave 13 with SWE-1.5 Model
Windsurf released Wave 13 in February 2026, introducing the SWE-1.5 'Penguin Alpha' model as default for all users with free 3-month access. Features include context window indicator, multi-session parallel view with split panes/tabs, Cascade Hooks for lifecycle actions, and Git Worktrees support.
New Technologies
Qwen 3.5 and Qwen3-Coder-Next
397B Mixture-of-Experts model with 17B active parameters and 262K native context (extensible to 1M). Released February 16, 2026. Qwen3-Coder-Next is optimized for programming agents and local development with enhanced code execution at low inference costs.
Gemini 3.1 Pro Release with Extended Context
Google released Gemini 3.1 Pro on February 19-20, 2026, featuring stronger code generation capabilities, more stable agent performance, and extended context length of 1M-2M+ tokens. Achieved 77.4% score on SWE-bench Verified benchmark and 80.6% on SWE-Bench Verified. Free for all users.
Claude Sonnet 4.6 Release with Enhanced Code Processing
Anthropic released Claude Sonnet 4.6 on February 17, 2026, featuring enhanced code processing capabilities, accelerated performance for long-text reasoning, and becoming the new default model. Achieved 79.6% on SWE-bench Verified benchmark and 72.5% on OSWorld-Verified (near human-level computer operation).
GLM-5 Release: China's Open-Source Agentic Engineering Model
Zhipu AI released GLM-5 as an open-source model in February 2026. The model demonstrated capability to build a complete Game Boy Advance emulator from scratch, making 700+ tool calls and 800+ context switches without human intervention. Signals evolution from 'Vibe Coding' to AI that can autonomously plan, execute, test, and debug.
GPT-5.3-Codex and macOS App Release
OpenAI released GPT-5.3-Codex programming model in February 2026, featuring 25% faster performance than GPT-5.2-Codex and designed specifically for coding agents with 69-72% score on SWE-bench Verified. Additionally released a macOS version of the Codex application and GPT-5.3-Spark ultra-fast real-time version on February 12, 2026.
git-lrc: Free AI Code Review Tool
Free, unlimited AI code review tool that automatically runs on every commit. Hooks into Git to perform AI review on diffs before changes go live. Acts as a safety layer for AI-generated code, catching logic removals, relaxed constraints, expensive cloud calls, and credential leaks.
Qwen Code: Open-Source Terminal AI Coding Agent
Open-source terminal-based AI coding agent optimized for Qwen3-Coder models. Forked from Gemini CLI, it provides repository-level code understanding, editing, generation, and automated Git operations. Command-line interface for developers who prefer terminal workflows.
Alibaba Cloud Coding Plan adds 4 new models
Alibaba Cloud's Coding Plan updates added support for 4 new programming models: Qwen 3.5-Plus, Qwen 3-Coder-Next, GLM-4.7, and Kimi-K2.5. Users can freely switch between models without changing subscriptions. Reflects trend of multiple coding models becoming available.
ggml.ai joins Hugging Face to ensure long-term progress of Local AI
Georgi Gerganov's ggml.ai (creator of llama.cpp) joined Hugging Face. Simon notes that Georgi's March 2023 release of llama.cpp made it possible to run local LLMs on consumer hardware, kicking off the local model movement. The joint effort aims towards seamless 'single-click' integration with transformers library.
Taalas serves Llama 3.1 8B at 17,000 tokens/second
Canadian hardware startup Taalas announced their first product - a custom hardware implementation of Llama 3.1 8B running at 17,000 tokens/second. The 'Silicon Llama' is described as 'aggressively quantized, combining 3-bit and 6-bit parameters.' Demo available at chatjimmy.ai.
Others
Anthropic's 8 Agentic Coding Trends for 2026
Anthropic published their 2026 Agentic Coding Trend Report highlighting: shortened software development lifecycle, evolution from single agents to multi-agent collaboration, agents capable of long-term system building, agents requiring human supervision, AI coding becoming accessible to everyone, transformation of software development economics.
Peking University Report: 2026 as the 'Agentic Coding Era'
Peking University's AI Xiao Rui Team published a 105-page report identifying 2026 as the 'Agentic Coding Era' (元年). The report traces evolution from Vibe Coding to SPEC Coding to Agentic Coding. The transformation from code completion to AI autonomously handling complex tasks occurred in less than 100 days.
Fireship: 2026 Tech Trends Predictions
Fireship released their annual '2026 Tech Trends Predictions' video covering: Developer job market analysis for 2026, H-1B visa policy changes affecting tech hiring, AI replacement concerns and the emergence of 'code cleaner' job positions. The video also covers AI coding tools including Cursor 2.0 analysis and MCP servers.
Agentic Engineering Patterns - New documentation project
Simon Willison started a new project to document 'Agentic Engineering Patterns' - coding practices and patterns for getting the best results out of coding agent development. He notes that the biggest challenge in adopting agentic engineering is getting comfortable with the fact that 'writing code is cheap now.'
Red/green TDD for better coding agent results
Simon advocates for using Test Driven Development (TDD) with coding agents. The red/green TDD approach - writing tests first, confirming they fail, then iterating on implementation until tests pass - is described as 'a pleasingly succinct way to get better results out of a coding agent.'
Cognition Reaches $10.2B Valuation
Cognition.AI reached a $10.2 billion valuation, making it the highest-valued company in the AI programming sector after raising over $400 million in funding. The company's ARR grew from $1M (September 2024) to $73M (June 2025).
Karpathy Introduces 'Claw' - New AI Architecture Layer Above LLM Agents
Andrej Karpathy coined a new term 'Claw' as a conceptual layer above LLM Agents, forming a framework: Chat → Code → Claw. He explained that Claws take orchestration, scheduling, context, tool calls and persistence to the next level.
MIT Technology Review 2026: 10 Breakthrough Technologies - AI Dominates
MIT Technology Review released their 2026 '10 Breakthrough Technologies' (TR10) list in January 2026. AI-related technologies occupied 4 out of 10 entries: Hyperscale AI Data Centers, AI Mechanistic Interpretability, AI Companionship, and Generative Programming.
The Claude C Compiler: What It Reveals About the Future of Software
Chris Lattner (creator of Swift, LLVM, Clang, Mojo) reviewed Anthropic's Claude C Compiler project. Key insights: 'Good software depends on judgment, communication, and clear abstraction. AI has amplified this.' Lattner notes that 'AI coding is automation of implementation, so design and stewardship become more important.'
Simon Willison's 2026 AI Predictions: LLM-Generated Code No Longer 'Garbage'
Simon Willison, Django co-founder, published predictions for AI technology trends. By end of 2026, he predicts claiming LLM-generated code is 'garbage' will no longer be defensible. Human-written code proportion will drop to single digits.