AI++ // new releases: agent frameworks, coding sandboxes, and agent builders


If you like building agents that get work done, you're in for a treat in this newsletter. CUGA is a new agent framework that is topping benchmarks and using all sorts of cunning under the hood to help you build better agents that can execute complex tasks.

There is also news on model releases, code execution sandboxes, and the latest podcast episode from The Flow, all on OAuth and MCP.

Phil Nash
Developer relations engineer for Langflow

🛠️ Building with AI, Agents & MCP

IBM Research releases CUGA

CUGA is IBM's ConfigUrable Generalist Agent, an open-source agent framework that you can use to build agents that are good at complex task execution, can use MCP servers, OpenAPI specced API, and custom tools, and has configurable reasoning modes. It topped the AppWorld benchmark and is in second place on the WebArena benchmark, and it has been integrated in Langflow (though that's not been released publicly yet, you can sign up to hear more or build the latest from the main branch). Check out the source code and how to run CUGA directly on GitHub.

Agent builders

There have been a lot of releases of various agent builders, so before you dig into the following news, check out this guide on choosing an AI agent framework in 2025.

Now for the releases: ElevenLabs released Agent Workflows, and Microsoft combined Autogen and Semantic Kernel into the Microsoft Agent Framework. We also compared building a simple agent with Langflow and OpenAI's AgentKit.

New model alerts 🚨

Anthropic released Claude Haiku 4.5 which appears to be faster, cheaper and, on several benchmarks, better than Sonnet 4. Meanwhile, Microsoft AI released their first image model, and Google Deepmind trained a specialized version of Gemini 2.5 Pro on computer use and added grounding via Google Maps.

Claude gets some new skills

Anthropic announced that Claude now supports Skills, where Skills are a collection of markdown content and scripts that the agent can run. It seems to me surprisingly close to MCP, but it is intended to mitigate MCP tools filling the context (something Pulse MCP tried to fix recently with agentic MCP configuration). I enjoyed Simon Willison's take on Skills as well as the Anthropic engineering team's deep dive into creating the feature.

Building with in-browser AI

This in-depth blog post goes into how to build the classic game Guess Who? in the browser using Chrome's built-in Prompt API.

Cloudflare releases Sandboxes

Getting LLMs to generate code is second nature to developers at this point, but if it's part of an application then having a safe sandbox in which to run the untrusted code is important. Sandboxes is Cloudflare's isolated environment to help you build this safely.

How to Use OAuth in MCP with Max Gerber from Stytch

In the latest episode of The Flow we bring on Stytch's Max Gerber for a hands-on tour of how OAuth powers secure AI agents and MCP servers, complete with live demos and practical tips on making your app an OAuth provider so users can safely bring their own agent.

🗞️ Other news

🧑‍💻 Code & Libraries

🔦 Langflow Spotlight

You know that you can build MCP servers in Langflow, but with the release of Langflow 1.6 using MCP servers got an upgrade. The sidebar now stores all the MCP servers you have configured, so you can grab them and use them in any of your flows. There is also a dedicated area to manage your MCP servers. Giving your agents access to great tools has never been easier!

🗓️ Events

The Langflow team are out east in Australia and Malaysia in the next couple of weeks.

October 25th, Brisbane - Catch me at the GDG Devfest in Brisbane talking about AI on the web.

October 29th, Melbourne - Me again, this time talking web AI at the Melbourne GDG meetup.

November 4th, Kuala Lumpur - You can find Tejas speaking about thriving as a professional, particularly leveraging AI, at Infobip Shift Kuala Lumpur.

Enjoy this newsletter? Forward it to a friend.

2755 Augustine Dr, 8th Floor, Santa Clara, CA 95054
Unsubscribe · Preferences

AI++ newsletter

Subscribe for all the latest news for developers on AI, Agents and MCP curated by the Langflow team.

Read more from AI++ newsletter

I normally like to open the newsletter with general news in the world of AI, but I've been heads-down with the rest of the Langflow team working on the newly released Langflow version 1.6. With features like OAuth for MCP, a Docling powered file component, and compatibility with the OpenAI API, it's been worth it. What else has been going on then? OpenAI just hosted their DevDay event with a pile of new releases, while Anthropic had people queue for 2 hours for free hats. Everyone else seems...

Over the last weekend the Langflow team was out at the CascadiaJS conference and Cascadia AI Hackathon. It was inspiring to meet so many developers, work with them on their AI hacks, and see fewer demo hiccups than a Meta product launch. The winning team built a full music sequencer, MIDI keyboard and visualizer that generated beats that could then be edited by hand or further with AI. Congratulations to the winning teams and to everyone who built something and learned something new over the...

Who is naming these model releases? Gemini’s new image editing model is called Nano Banana (and they dropped a great tutorial on how to use it as a developer). Meanwhile, Microsoft AI launched two models in the new MAI series, presumably pronounced “my” because they’re not OpenAI models. At least OpenAI, the creators of gpt-4o-mini and o4-mini, were a bit more sensible with their recent release of gpt-realtime along with the generally available Realtime API. And that’s only some of the newly...