AI++ // Did we reach AGI? Depends on how you define AGI...


Jensen Huang has declared on a podcast that we have reached AGI. For a very specific definition of AGI that probably doesn't agree with what you might think AGI is. One would have thought that the afterglow of NVIDIA GTC would have provided enough hype for at least the rest of the month.

Meanwhile, for those building agents, there has been a lot of talk about CLIs and Agent Skills, and this week we focus on evaluating skills to make sure they do what they are supposed to. WebMCP has been an exciting experiment in the browser, so we have more on what it is and how it differs from MCP. Oh, and if you're a LiteLLM user, sorry if your builds broke today.

Phil Nash
Developer relations engineer for
Langflow

🛠️ Building with AI, Agents & MCP

Supply chain attack

If you're building agents with Python, you might be a user of LiteLLM to give you a unified interface to access multiple LLMs. You might have also discovered that the latest versions of LiteLLM have fallen victim to a supply chain attack. The compromised versions look to have been pulled from PyPI now, but it's a good reminder to stay vigilant with your dependencies.

CLIs and Skills

Agents are good at using CLIs, but can they be better? Justin Poehnelt argues that you should rewrite your CLIs for agents. Don't worry about big rewrites though, many of these patterns can be added incrementally.

Agent Skills are the hot way to expose CLIs to agents, and Angie Jones wrote 3 principals for designing skills. You might want to test or evaluate those skills too, Phil Schmid shared a practical guide to testing skills and Robert Xu at LangChain also wrote up how they evaluate skills. For an automated test, there is also a skill validator to check your skills against the spec.

MCP and WebMCP

Before we get too carried away with skills, we can't forget about MCP. This article makes good arguments that MCP provides much more than token bloat, so please read MCP is dead; long live MCP.

WebMCP is still a new experiment, but there has already been enough confusion that the Chrome team had to write up when to use WebMCP and MCP. And if you want a good primer on what WebMCP can do, check out WebMCP for beginners.

Sandboxes

If an agent can get work done by calling a CLI via a skill or using an MCP tool, then they are increasingly writing their own code. But we should never let them execute that untrusted code in our environments. Sandboxes isolate the untrusted code and protect your data, so we're seeing more options. First up is NVIDIA's OpenShell, released as part of their NemoClaw additions to OpenClaw. LangChain also released LangSmith Sandboxes as a hosted sandbox.

🧠 New models

🗞️ Other news

🧑‍💻 Code & Libraries

Enjoy this newsletter? Forward it to a friend.

2755 Augustine Dr, 8th Floor, Santa Clara, CA 95054
Unsubscribe · Preferences

AI++ newsletter

Subscribe for all the latest news for developers on AI, Agents and MCP curated by the Langflow team.

Read more from AI++ newsletter

When we last published AI++ Anthropic had just launched Fable 5 and everyone was very excited about another step-change in the quality of models. That excitement didn’t last long as the US government issued an export directive that effectively meant Anthropic had to withdraw access. If you missed out, Ethan Mollick wrote about what it was like to work with Fable 5. I found two interesting looks at the world of AI this week. First, Anthropic published the results of 81,000 interviews with...

Claude Mythos is here, except it’s called Fable 5 and comes with a few restrictions. It appears to be the largest model released and, according to the benchmarks, the most accomplished. Even more so than Opus 4.8 that was only released 2 weeks ago. It’s also the most expensive model, so you might want to think twice before swapping it into your RAG support chat bot. While it’s impressive to see the frontier march forward, this week in AI++ we’ll take a look at some of the techniques people...

The last couple of weeks has seen students booing commencement speakers at graduation ceremonies in Florida and Arizona when they mentioned AI. This is a visceral reaction to what they see as a threat to their careers. Meanwhile, developers working with AI are burning out and getting “Brain Fry” from doing more work at higher intensities without the same fulfillment. I wrote a bit about this myself, sharing that I found it hard to be proud of a useful little app that I built. AI is changing...