Ai-Agents

Cloud data center with server racks in colored clusters, a central registry terminal, engineers reviewing approval workflows at workstations

Pinterest's MCP Deployment: 66,000 Monthly Invocations and 7,000 Engineering Hours Saved

Pinterest’s Model Context Protocol rollout hits 66,000 calls per month across 844 active users. It’s the most detailed public case study of MCP at scale. A central registry, two-layer auth, safety reviews, and human checkpoints set this apart from a prototype. The payoff: about 7,000 engineering hours saved each month.

The story comes from Pinterest’s engineering blog post in March 2026 and later coverage by InfoQ . For any team weighing MCP for live use, this rollout is a solid guide.

Claude Code Remote Agents: Dispatch, Scheduled Tasks, and /loop Explained

Claude Code now ships four ways to run agents remotely: Dispatch, Remote Control, Scheduled Tasks, and /loop. Pick the wrong one and you either over-build a simple polling job or under-build something that needs real persistence. Each works at a different layer of the stack. Each has its own lifecycle, infrastructure needs, and rules for what survives a closed terminal or a sleeping laptop.

Dispatch: Send Tasks from Your Phone to Your Desktop

Dispatch launched on March 17, 2026 as a research preview inside Claude Cowork. Open the Claude mobile app, describe a task, and Dispatch routes it to your Claude Desktop instance on your dev machine. Claude Code runs the task locally with your file system, MCP servers, skills, connectors, and any other tools you’ve set up. The result comes back to your phone.

Four distinct robots in a sealed glass workshop, each cabled to one central llama-stamped engine, with an eight-link reliability gauge fading at the end.

Self-Hosted AI Agent Frameworks in 2026: Local-First Compared

A self-hosted AI agent needs to run entirely on your own Ollama or vLLM with no OpenAI key. All four major frameworks claim that support, but only LangGraph and CrewAI wire to a local model with zero workarounds. AutoGen needs a client swap, and Flowise needs one base-URL field. The model, not the framework, is the real reliability ceiling.

Key Takeaways

All four run on Ollama, but only LangGraph and CrewAI need zero workarounds.
The small local model, not the framework, is what breaks tool calling.
Flowise is the only true no-code pick; LangGraph is the most code-heavy.
Most framework docs still assume an OpenAI key, so budget setup time.
Use Qwen3 or larger for agents; smaller models drop tool calls under load.

Why Local-First Fitness Is the Axis That Counts

Most “best agent framework” roundups assume you have an OpenAI key and a credit card. The first code sample spins up a hosted client, and the “swap to local” path is a footnote if it shows up at all. Self-hosters ask a sharper question about whether any of these run on their own box with no cloud call.

Dark server room at night with racks of glowing servers and a terminal showing red terraform destroy text

When Claude Code Ran terraform destroy on Production - The DataTalks.Club Incident

On February 26, 2026, Claude Code ran terraform destroy against a stale state file. It wiped 2.5 years of DataTalks.Club production data: the RDS database, VPC, ECS cluster, load balancers, and every automated snapshot. Four cascading failures, each one preventable, took down a platform serving 100,000 learners.

Alexey Grigorev runs DataTalks.Club , a data engineering school with over 100,000 learners. He lost 1,943,200 rows of homework, project entries, and leaderboard scores when Claude Code ran the command against his whole production stack. The database, the VPC, the ECS cluster, load balancers, bastion host, and every automated snapshot were gone in seconds.

A lightning-bolt-shaped racing vehicle speeds across a landscape of terminal windows while small subagents fan out and a rocket waits on a launchpad.

Gemini 3.5 Flash: 76% on Terminal-Bench, 4x Faster Output

Google released Gemini 3.5 Flash on May 19, 2026. The fast, lower-cost tier scored 76.2% on Terminal-Bench 2.1 and, by Google’s own measure, generates output about 4 times faster than other frontier models. Flash is available today across the Gemini app, Search, and the API. Gemini 3.5 Pro is confirmed for next month.

Key Takeaways

Gemini 3.5 Flash launched on May 19, 2026 and is free to use in the Gemini app and Google Search.
It scored 76.2% on Terminal-Bench 2.1, a test of finishing real terminal tasks end to end.
Google says Flash produces output about 4 times faster than rival frontier models.
The model is built for agents that run long, multi-step jobs and call tools.
Gemini 3.5 Pro, the larger sibling, is confirmed for next month.

What is Gemini 3.5 Flash?

Gemini 3.5 Flash is Google’s new fast, lower-cost tier of the Gemini 3.5 family. It was announced and made generally available on May 19, 2026, according to the Google announcement post . The “Flash” name has always meant a model tuned for speed and price.

Claude Agent SDK: Build Custom AI Agents Without Reinventing the Orchestration Layer

The Claude Agent SDK is the Claude Code engine stripped down to a library. Same agent loop, same built-in tools, same context handling, but you call it from your own Python or TypeScript code instead of the CLI. If you’ve used Claude Code to read files, run shell commands, search codebases, and edit code, the SDK points that same machinery at any problem you want. No human needs to sit in the loop.