AI++ // Did we reach AGI? Depends on how you define AGI...


Jensen Huang has declared on a podcast that we have reached AGI. For a very specific definition of AGI that probably doesn't agree with what you might think AGI is. One would have thought that the afterglow of NVIDIA GTC would have provided enough hype for at least the rest of the month.

Meanwhile, for those building agents, there has been a lot of talk about CLIs and Agent Skills, and this week we focus on evaluating skills to make sure they do what they are supposed to. WebMCP has been an exciting experiment in the browser, so we have more on what it is and how it differs from MCP. Oh, and if you're a LiteLLM user, sorry if your builds broke today.

Phil Nash
Developer relations engineer for
Langflow

🛠️ Building with AI, Agents & MCP

Supply chain attack

If you're building agents with Python, you might be a user of LiteLLM to give you a unified interface to access multiple LLMs. You might have also discovered that the latest versions of LiteLLM have fallen victim to a supply chain attack. The compromised versions look to have been pulled from PyPI now, but it's a good reminder to stay vigilant with your dependencies.

CLIs and Skills

Agents are good at using CLIs, but can they be better? Justin Poehnelt argues that you should rewrite your CLIs for agents. Don't worry about big rewrites though, many of these patterns can be added incrementally.

Agent Skills are the hot way to expose CLIs to agents, and Angie Jones wrote 3 principals for designing skills. You might want to test or evaluate those skills too, Phil Schmid shared a practical guide to testing skills and Robert Xu at LangChain also wrote up how they evaluate skills. For an automated test, there is also a skill validator to check your skills against the spec.

MCP and WebMCP

Before we get too carried away with skills, we can't forget about MCP. This article makes good arguments that MCP provides much more than token bloat, so please read MCP is dead; long live MCP.

WebMCP is still a new experiment, but there has already been enough confusion that the Chrome team had to write up when to use WebMCP and MCP. And if you want a good primer on what WebMCP can do, check out WebMCP for beginners.

Sandboxes

If an agent can get work done by calling a CLI via a skill or using an MCP tool, then they are increasingly writing their own code. But we should never let them execute that untrusted code in our environments. Sandboxes isolate the untrusted code and protect your data, so we're seeing more options. First up is NVIDIA's OpenShell, released as part of their NemoClaw additions to OpenClaw. LangChain also released LangSmith Sandboxes as a hosted sandbox.

🧠 New models

🗞️ Other news

🧑‍💻 Code & Libraries

Enjoy this newsletter? Forward it to a friend.

2755 Augustine Dr, 8th Floor, Santa Clara, CA 95054
Unsubscribe · Preferences

AI++ newsletter

Subscribe for all the latest news for developers on AI, Agents and MCP curated by the Langflow team.

Read more from AI++ newsletter

Working with LLMs is weird, but I never thought it would be as weird as OpenAI having to specifically tell their models not to talk about goblins, gremlins, raccoons, trolls, ogres, or pigeons. It raises so many questions. Thankfully after someone spotted the instructions in the Codex base instructions, OpenAI did give an explanation as to where the goblins came from. They never mentioned why raccoons and pigeons got caught up in the fantasy creature fascination though. In this edition of...

Is a token crunch coming? This week GitHub paused sign-us for GitHub Copilot Pro, Pro+ and Student plans, tightened up their usage limits, and removed Opus from their Pro plans. And today, Anthropic seemed to remove Claude Code from new Pro plans, though that has been reversed quickly. In general, while this is only seeming to affect individual plans related to coding agents, it could point to an inflection point where AI companies start considering how their pricing matches up to their...

The big news last week was that Anthropic mistakenly leaked the source code of Claude Code by leaving source maps in the package. Part of the source code referenced Claude Mythos, which has been properly announced this week as a model that’s incredibly good at finding software bugs and creating security exploits. It’s so good that it’s only being shared with 40 partners as part of Project Glasswing. Here’s a quick heads up, this newsletter is going to be changing format soon. As Langflow...