Agent SDKs are evolving


Is a token crunch coming? This week GitHub paused sign-us for GitHub Copilot Pro, Pro+ and Student plans, tightened up their usage limits, and removed Opus from their Pro plans. And today, Anthropic seemed to remove Claude Code from new Pro plans, though that has been reversed quickly. In general, while this is only seeming to affect individual plans related to coding agents, it could point to an inflection point where AI companies start considering how their pricing matches up to their costs.

In other, more amusing, news the shoe manufacturer Allbirds is pivoting to AI compute infrastructure. Selling inference is hot, I just never expected a move from shoes to GPUs.

​Phil Nash​
Developer relations engineer for
IBM​

πŸ› οΈ Building with AI, Agents & MCP

AI engineer Europe videos

​AI Engineer Europe was a couple of weeks ago and the videos are coming out in this YouTube playlist. There's plenty of content on Coding Agents, OpenClaw, MCP and more. One of my takeaways was on how split the community was on speeding up vs slowing down when writing code with Ryan Lopopolo of OpenAI who banned his team from touching their editors going up against Mario Zechner, creator of the coding agent pi, saying "Slow the **** down". You can also catch my talk on an open-source stack for RAG.

Langflow 1.9 released

The latest version of Langflow is now live, with version 1.9 bringing features like an assistant to help you build custom components, the Flow DevOps Toolkit to help you version, test and deploy flows, and support for connecting Langflow itself to your coding agents via MCP.

What's the future of MCP?

In more video news, the keynote from the MCP Dev Summit has been published. It covers where MCP is going in 2026 including features like the stateless redesign, long running tasks, and skills over MCP.

Cloudflare shared their reference architecture for MCP deployments, covering a lot of detail particularly in the security aspects.

Evolving agent SDKs

Both OpenAI and Cloudflare announced evolutions of their agent SDKs both highlighting how they will help build harnesses for longer running agents. Sandboxes and durable execution both appear as features for each SDK and show where these companies think agent style execution is heading.

On the server side, Google introduced Deep Research Max, an API based agent with comprehensive research capabilities.

🧠 New models

  • Today OpenAI launched GPT-Image-2 with a very cool blog post made up of generated images, including all the text of the post
  • ​Claude Opus 4.7 is available over the API
  • The Qwen team have been rolling out Qwen 3.6 including the 3.6-max-preview as well as smaller open versions
  • ​Kimi K2.6 is claiming to be the best open-source coding model
  • After Llama 4 fell flat, Meta is back with a new model family, starting with Muse Spark. There is a private API preview, but expect to see more on this over the next few weeks
  • Last newsletter we shared some new speech models, and yet more have dropped, this time Grok has released speech-to-text and text-to-speech APIs and Google released Gemini 3.1 Flash TTS with audio tags that can transform the tone of the voice throughout the text

πŸ—žοΈ Other news

  • ​ALTK-evolve is a memory system for agents that helps them learn and use guidelines for decision making from previous executions
  • IBM research released VAKRA, a new benchmark for evaluating how well AI agents reason and act in enterprise-like environments
  • ​KellyBench is another benchmark, this one focused on how agents perform in sports betting markets. I'm not sure we should be optimising for gambling, but at the moment they all lose money.
  • ​I still prefer MCP over Skills​
  • The Slack Engineering team share how to manage context in long-run agentic applications​
  • Using too many tokens? Try caveman mode!

πŸ§‘β€πŸ’» Code & Libraries

  • Check out all the AI examples in this awesome-ai-apps repo​
  • ​Manifest is an open-source smart model router
  • ​Chunky allows you to convert and validate document parsing and chunking
  • OpenWebUI released a desktop app​

πŸ”¦ Langflow Spotlight

One of the biggest additions to Langflow 1.9 is the Langflow Assistant. It generates and updates custom components for you directly within the Playground.

This makes it even easier to build out your own components to complete your flows.

πŸ—“οΈ Events

This week you can find Phil at NDC Sydney talking about how to build an agent harness.

Enjoy this newsletter? Forward it to a friend.

2755 Augustine Dr, 8th Floor, Santa Clara, CA 95054
​Unsubscribe Β· Preferences​

AI++ newsletter

Subscribe for all the latest news for developers on AI, Agents and MCP curated by the Langflow team.

Read more from AI++ newsletter

The big news last week was that Anthropic mistakenly leaked the source code of Claude Code by leaving source maps in the package. Part of the source code referenced Claude Mythos, which has been properly announced this week as a model that’s incredibly good at finding software bugs and creating security exploits. It’s so good that it’s only being shared with 40 partners as part of Project Glasswing. Here’s a quick heads up, this newsletter is going to be changing format soon. As Langflow...

Jensen Huang has declared on a podcast that we have reached AGI. For a very specific definition of AGI that probably doesn't agree with what you might think AGI is. One would have thought that the afterglow of NVIDIA GTC would have provided enough hype for at least the rest of the month. Meanwhile, for those building agents, there has been a lot of talk about CLIs and Agent Skills, and this week we focus on evaluating skills to make sure they do what they are supposed to. WebMCP has been an...

Agents are starting to take on a life of their own, and as agent builders we need to consider the potential outcomes. The story of the autonomous agent that was denied when it opened a pull request to contribute to matplotlib and consequently wrote a hit piece on the maintainer had opinions on the internet veering between a fabrication or the beginning of Skynet. I think the lesson we should be taking away from this is that the the end user of an agent is not the only human that may come into...