AI++ // Did we reach AGI? Depends on how you define AGI...


Jensen Huang has declared on a podcast that we have reached AGI. For a very specific definition of AGI that probably doesn't agree with what you might think AGI is. One would have thought that the afterglow of NVIDIA GTC would have provided enough hype for at least the rest of the month.

Meanwhile, for those building agents, there has been a lot of talk about CLIs and Agent Skills, and this week we focus on evaluating skills to make sure they do what they are supposed to. WebMCP has been an exciting experiment in the browser, so we have more on what it is and how it differs from MCP. Oh, and if you're a LiteLLM user, sorry if your builds broke today.

​Phil Nash​
Developer relations engineer for
Langflow​

πŸ› οΈ Building with AI, Agents & MCP

Supply chain attack

If you're building agents with Python, you might be a user of LiteLLM to give you a unified interface to access multiple LLMs. You might have also discovered that the latest versions of LiteLLM have fallen victim to a supply chain attack. The compromised versions look to have been pulled from PyPI now, but it's a good reminder to stay vigilant with your dependencies.

CLIs and Skills

Agents are good at using CLIs, but can they be better? Justin Poehnelt argues that you should rewrite your CLIs for agents. Don't worry about big rewrites though, many of these patterns can be added incrementally.

Agent Skills are the hot way to expose CLIs to agents, and Angie Jones wrote 3 principals for designing skills. You might want to test or evaluate those skills too, Phil Schmid shared a practical guide to testing skills and Robert Xu at LangChain also wrote up how they evaluate skills. For an automated test, there is also a skill validator to check your skills against the spec.

MCP and WebMCP

Before we get too carried away with skills, we can't forget about MCP. This article makes good arguments that MCP provides much more than token bloat, so please read MCP is dead; long live MCP.

WebMCP is still a new experiment, but there has already been enough confusion that the Chrome team had to write up when to use WebMCP and MCP. And if you want a good primer on what WebMCP can do, check out WebMCP for beginners.

Sandboxes

If an agent can get work done by calling a CLI via a skill or using an MCP tool, then they are increasingly writing their own code. But we should never let them execute that untrusted code in our environments. Sandboxes isolate the untrusted code and protect your data, so we're seeing more options. First up is NVIDIA's OpenShell, released as part of their NemoClaw additions to OpenClaw. LangChain also released LangSmith Sandboxes as a hosted sandbox.

🧠 New models

πŸ—žοΈ Other news

πŸ§‘β€πŸ’» Code & Libraries

Enjoy this newsletter? Forward it to a friend.

2755 Augustine Dr, 8th Floor, Santa Clara, CA 95054
​Unsubscribe Β· Preferences​

AI++ newsletter

Subscribe for all the latest news for developers on AI, Agents and MCP curated by the Langflow team.

Read more from AI++ newsletter

Agents are starting to take on a life of their own, and as agent builders we need to consider the potential outcomes. The story of the autonomous agent that was denied when it opened a pull request to contribute to matplotlib and consequently wrote a hit piece on the maintainer had opinions on the internet veering between a fabrication or the beginning of Skynet. I think the lesson we should be taking away from this is that the the end user of an agent is not the only human that may come into...

The last couple of weeks has seen the explosion of OpenClaw (nΓ©e ClawdBot), developers around the world have finally found the agent that acts like the AI they've been promised. I'm personally a little concerned over the security and privacy aspects of letting a powerful agent run wild with an all access pass to your computer and all your data, but I am excited to see the experimentation. It's fun, and maybe a little silly, to see the growth of social media for agents, with Moltbook providing...

It seems I can't look at the internet without seeing talk of Ralph Wiggum or Gas Town. Developers are either running their coding agent in loops or strapping together many parallel coding agents. Coding agents are far and away the most successful agents out there right now, so I always think it's worth keeping an eye on. It's fun to see multiple sub-agents, roles, memory and coordination on one side and a loop on the other. This week in AI++ we have articles on building agent memory,...