FOCUS.AI

EST. 24

Report from Microsoft Build 2025

Microsoft is betting big on an open, agent-powered web—where protocols like MCP, A2A, and NLWeb redefine how AI and services interact. The real opportunity in AI isn’t just smarter models, but the “capability overhang” waiting to be unlocked by better reasoning and open standards.

Report from Microsoft Build 2025

Look at what the web has become. Can you imagine if a company had tried to own parts of it? Locking it in with browser features that only they controlled? We would never have the breadth and width and amazing things that we have in the web now. — Not actually a quote but they kept repeating that sentiment

Microsoft is hammering home how much they want things to be open. They are not like the borg-ro-soft that they used to be. This new, open agentic web is the future, they are using open protocols, putting out open protocols, and hoping that you’ll deploy everything on Azure.

azure datacenters have enough fiber each to go around the world 4.5 times

And at the end of the keynote, I’m keyed up to deploy everything on AI Foundry. (Haven’t even signed up yet, but it answers a lot of questions about security and evaluations that are hard to answer otherwise.)

Reasoning and the Capability Overhang

2025 may be the year of AI Agents, but its reasoning that really has unlocked all of this functionality. Reasoning, which is like telling the models to “think real hard about this” or — if you prefer — to move some of the work from the pre-training period to increasing inference time activity. This is so powerful that now they’ve branded a term to explain how much more awesome the models are compared to the production build on it.

i say the productivity gains of deploying ai haven’t been recognized, you say that the models are just so amazingly awesome that products haven’t had time to catch up. tomato, tomahto

So that area between the solid line and the dotted line is our opportunity.

  • models are more powerful than we are collectively using them for
  • set the ambition level to 11, build for the future
  • put more reasoning into everything, assume that it will be there

Reasoning is something that we tend to use sparingly. It costs so much more, where the non-reasoning models are if not “free”, significantly cheaper. I could blow $50 on one deep coding session for example, in a major hack session. Things that are just wrappers around some simple tools it would take months and months to go through $10. So there’s a cost trade-off now, and the compute providers are advocating not worrying about that cost, since it will soon come down..

Agents and MCP everywhere

What is an agent? In client conversations, we use the word agent to mean, roughly, “a model and prompt and a bunch of ways that we throw stuff together”. (Similar to the Pydantic, Agent is a Container of these things definition.) Or “An agent is something that acts in an environment; it does something. Agents include worms, dogs, thermostats, airplanes, robots, humans, companies, and countries.”

In the Microsoft parlance an agent is “something that you assign tasks to”, shifting the idea from “what is it” to “why would I care about it”. I like this reframing. There’s 3 levels that they are rocking:

  • Level 1: Ask a question, Model Answers
  • Level 2: Assign a task, AI executes
  • Level 3: Assign a task, AI coordinations with other AIs to complete the task

And the the open agentic web is the vision of how Microsoft thinks we’ll get there.

How it works

MCP (Anthropic) is how make services available to agents. A2A (Google) is how Agents interact with each other. NLWeb (Microsoft) is how agents discover each other.

So if you want to participate into this new open agentic web, you basically

  • Implement a MCP server with specific tools that understand your business services
  • Use NLWeb to add authorization and also security and (sort of) discoverability
  • Expose more complex services by calling other agents over A2A.

NLWeb

Natural Language Web is a fancy way of saying:

  • Websites should have an /ask endpoint that you can hit that lets you run a query
  • Websites should have an /mcp endpoint that you can hit that lets you plug them into a model

So what does this mean? The ask endpoint means that you need to run the model, so it’ll cost you money, but on the other hand you can control the agent’s prompt. For mcp you don’t need to bear the cost of the model, so that’s nice.

We’ll see how they all get adopted.

Finally, fashion

These Indian-American’s are crushing it. Satya Nadella, Jay Parikh. Looking sleek, fit and just a whole lot of warrior energy. There’s no receding hairline, its a immaculately shaved bald head. There were a series of people coming up during the keynote to do the demos, again mainly Indian — with clear yet pronounced accents of being raised abroad — everyone fit, trim and completely put together. You got the sense that they had a very structured day, very deliberately and organized which was reflected in their appearance.

In the beginning there was some ruckus with shouting and yelling in the audience. Satya didn’t hesitate or get distracted. In the second keynote there was some more brief yelling, and Jay calmly paused for a half a minute and then continued unwavered.

Fashion wise, excepting the women who were colorful, the men tended to be in simple cut clothes, natural fabrics, dark jeans. Back sneakers with white soles. The uniform of the new era.

Repos

Release notes

Will Schenk May 21, 2025

Related Posts

Thoughts on gemini

Thoughts on gemini

Despite popular narratives about Google lagging in AI, their Gemini models reveal engineering excellence that's hard to ignore when you strip away the conservative product decisions and UI polish. From the lightweight yet powerful Gemma 3 to the multimodal capabilities of Gemini 2.5, Google's models demonstrate a level of speed, precision, and fundamental understanding that suggests they're not playing catch-up—they're just being cautious.

VIEW POST

Subscribe to our newsletter

Archived issues

What we offer

2024

01.
AI DIGITAL TRANSFORMATION USING BEST IN CLASS AI
02.
PRODUCT DESIGN AND ENGINEERING
03.
MODELING AND DATA INTERPRETATION FOR AI TUNING
04.
UPDATING OLD MOBILE AND WEB APPS FOR AI

CONTACT US TO GET STARTED.