MCP Protocol Explained: Ecosystem, Servers & Production Guide (2026)

Model Context Protocol (MCP) is an open standard developed by Anthropic that provides a universal interface for AI models to connect to external tools, data sources, and services. If you have been following the agentic AI space, you have heard MCP called "the USB standard for AI" — and the analogy is accurate. Before USB, every device had its own proprietary connector. MCP does the same thing for AI integrations: one standard protocol, any AI model, any tool.

MCP grew rapidly after its November 2024 release and was donated by Anthropic in December 2025 to the Agentic AI Foundation, a Linux Foundation directed fund co-founded with Block and OpenAI. OpenAI, Google, Microsoft Azure, and dozens of enterprise platforms have adopted it. This is not a niche tool — it is becoming the default way AI agents interact with the outside world.

What Problem Does MCP Solve?

Before MCP, connecting an AI agent to external tools required custom integration code for every model-tool combination.

If you wanted your GPT-4o agent to access a CRM, you wrote custom function definitions, request handling, and response parsing. If you then wanted Claude to access the same CRM, you wrote a different set of integration code. Switching models or adding tools meant rewriting integrations.

MCP eliminates this by providing a standardized contract between AI models (clients) and external tools (servers). Build an MCP server once for your CRM, and any MCP-compatible AI model can use it. Swap GPT-4o for Claude — no integration changes needed. Add a new AI model next year — it works automatically if it supports MCP.

Before MCP

GPT-4o → Custom integration → CRM
GPT-4o → Custom integration → Database
GPT-4o → Custom integration → Email
Claude → Different custom integration → CRM
Claude → Different custom integration → Database
Claude → Different custom integration → Email

6 custom integrations for 2 models and 3 tools. The number of integrations grows as models × tools.

With MCP

GPT-4o → MCP Client → MCP Server → CRM
 → Database
 → Email
Claude → MCP Client → MCP Server → CRM
 → Database
 → Email

3 MCP servers, and any MCP-compatible model can use all of them. Adding a new model requires zero additional integration work.

How MCP Works

MCP uses a client-server architecture inspired by the Language Server Protocol (LSP) that powers code editors like VS Code.

Core components

MCP Host — The application that runs the AI model and needs to access external tools. This could be a chat interface, an agentic workflow, or an IDE.

MCP Client — The protocol layer inside the host that communicates with MCP servers. It discovers available tools, sends requests, and processes responses.

MCP Server — A lightweight service that exposes specific tools or data sources through the MCP standard. Each server provides a defined set of capabilities.

Communication flow

The MCP client connects to one or more MCP servers
The client asks each server to list its available tools (capabilities discovery)
The AI model receives the tool descriptions and decides which tools to call
The client sends tool execution requests to the appropriate server
The server executes the action and returns structured results
The AI model uses the results to continue reasoning

What MCP servers expose

MCP servers can provide three types of capabilities:

Tools — Functions the AI can call to perform actions (search, create records, send emails, run queries). These are the most common capability.

Resources — Data the AI can read but not modify (documents, database records, configuration files). Resources provide context without allowing side effects.

Prompts — Pre-built prompt templates that the server provides to guide the AI's behavior for specific tasks. These help ensure consistent interactions with complex tools.

MCP vs Function Calling

If you have built AI agents before, you are already familiar with function calling (tool use) in GPT-4o, Claude, or Gemini. MCP builds on top of function calling, not replaces it.

Dimension	Native Function Calling	MCP
Scope	Model-specific tool definitions	Universal standard across models
Discovery	You define tools in the prompt	Client auto-discovers tools from servers
Portability	Tied to one model's API format	Works across any MCP-compatible model
Authentication	You manage per-tool auth	MCP server handles auth internally
Transport	HTTP/API	JSON-RPC over stdio, HTTP/SSE, or WebSocket
Ecosystem	Custom per project	Growing ecosystem of pre-built MCP servers

Think of function calling as the low-level mechanism and MCP as the high-level standard that makes function calling portable and discoverable.

MCP vs A2A vs ACP

MCP is not the only protocol in the agentic AI space. Understanding how the three main protocols relate helps you choose the right one.

Model Context Protocol (MCP)

What it does: Agent-to-tool communication. Connects AI agents to external tools and data.

Created by: Anthropic (donated to the Linux Foundation's Agentic AI Foundation in December 2025)

Use when: You need your agent to use external tools, APIs, databases, or data sources.

Agent-to-Agent Protocol (A2A)

What it does: Agent-to-agent communication. Enables different AI agents to discover each other's capabilities and coordinate work.

Created by: Google

Use when: You have multiple agents (potentially from different vendors) that need to collaborate on a task. For example, a sales agent from one system needs to coordinate with an inventory agent from another system.

Agent Communication Protocol (ACP)

What it does: Standardized message exchange between agents. Focuses on how agents negotiate task ownership and share context.

Created by: Community-driven, gaining traction in enterprise environments

Use when: You need detailed control over how agents hand off work, share state, and negotiate responsibilities.

How they work together

These protocols are complementary, not competing:

MCP handles the vertical integration — connecting agents to tools and data
A2A handles the horizontal integration — connecting agents to each other
ACP handles the messaging layer — standardizing how agents communicate

A production multi-agent system might use MCP for tool access, A2A for agent discovery, and internal orchestration (via LangGraph or CrewAI) for coordination logic.

Building an MCP Server

An MCP server is surprisingly simple to build. Here is a minimal example in Python that exposes a weather lookup tool.

from mcp.server import Server
from mcp.types import Tool, TextContent

server = Server("weather-server")

@server.list_tools()
async def list_tools():
 return [
 Tool(
 name="get_weather",
 description="Get the current weather for a city",
 inputSchema={
 "type": "object",
 "properties": {
 "city": {
 "type": "string",
 "description": "City name (e.g., Houston, TX)"
 }
 },
 "required": ["city"]
 }
 )
 ]

@server.call_tool()
async def call_tool(name: str, arguments: dict):
 if name == "get_weather":
 city = arguments["city"]
 weather_data = await fetch_weather_api(city)
 return [TextContent(
 type="text",
 text=f"Weather in {city}: {weather_data['temp']}°F, {weather_data['condition']}"
 )]

if __name__ == "__main__":
 server.run()

This server:

Registers a get_weather tool with a description and input schema
When called, fetches weather data and returns it as structured text
Any MCP-compatible AI model can discover and use this tool automatically

MCP server in TypeScript

import { Server } from "@modelcontextprotocol/sdk/server";
import { StdioServerTransport } from "@modelcontextprotocol/sdk/server/stdio";

const server = new Server({
 name: "weather-server",
 version: "1.0.0"
});

server.setRequestHandler("tools/list", async () => ({
 tools: [{
 name: "get_weather",
 description: "Get the current weather for a city",
 inputSchema: {
 type: "object",
 properties: {
 city: { type: "string", description: "City name" }
 },
 required: ["city"]
 }
 }]
}));

server.setRequestHandler("tools/call", async (request) => {
 if (request.params.name === "get_weather") {
 const { city } = request.params.arguments;
 const data = await fetchWeatherAPI(city);
 return {
 content: [{ type: "text", text: `Weather in ${city}: ${data.temp}°F` }]
 };
 }
});

const transport = new StdioServerTransport();
await server.connect(transport);

MCP in Production: Key Considerations

Security

MCP servers handle real tools with real consequences — database writes, email sends, financial transactions. Production MCP implementations need:

Authentication — Verify the identity of the MCP client before granting access
Authorization — Define which tools each client can access and what parameters are allowed
Input validation — Sanitize all inputs before executing tool logic
Audit logging — Log every tool invocation with timestamps, inputs, outputs, and the requesting agent
Rate limiting — Prevent runaway agents from overwhelming your systems

Transport options

MCP supports multiple transport mechanisms:

Transport	Best For	Latency
stdio	Local development, CLI tools	Lowest
HTTP/SSE	Web applications, remote servers	Low-moderate
WebSocket	Real-time bidirectional communication	Low

For most production deployments, HTTP/SSE provides the best balance of performance and compatibility.

Error handling

Agents need to handle tool failures gracefully. A well-designed MCP server returns structured errors that help the agent decide whether to retry, try an alternative approach, or escalate to a human.

@server.call_tool()
async def call_tool(name: str, arguments: dict):
 try:
 result = await execute_tool(name, arguments)
 return [TextContent(type="text", text=result)]
 except RateLimitError:
 return [TextContent(type="text", text="Rate limit exceeded. Try again in 60 seconds.")]
 except AuthorizationError:
 return [TextContent(type="text", text="Not authorized to perform this action.")]
 except Exception as e:
 return [TextContent(type="text", text=f"Tool execution failed: {str(e)}")]

The MCP Server Ecosystem in 2026

When we first wrote this post in February, "the MCP ecosystem" meant a few dozen servers in the official modelcontextprotocol/servers GitHub repo and a handful of vendor-built ones. By May 2026 that has changed completely. There are now hundreds of MCP servers in active use across the official repo, vendor-maintained registries, and community marketplaces. The interesting question is no longer "does an MCP server exist for X?" — it almost always does — but "which of the three competing servers for X should I run, and how do I keep it pinned without breaking my agents?"

Here is the operator's tour of the May 2026 ecosystem, organized by category. We have run most of these in production for clients or internally; the gotchas are real.

Databases and data warehouses

— the canonical Postgres server (the original lived in modelcontextprotocol/servers and was moved to modelcontextprotocol/servers-archived in May 2025 alongside other early reference servers; community-maintained forks and several vendor-built alternatives are now the active path), read-only by default with a connection string. Great for letting agents run analytical queries against a replica. Gotcha: it does not enforce row-level security on its own; point it at a read replica with a least-privilege user.
— official MongoDB-maintained server that supports both Atlas and self-hosted clusters. Exposes find, aggregate, listCollections, and (optionally) write ops behind a flag.
— warehouse-side servers for analytics agents. Watch the cost: an agent that runs SELECT * against a 2 TB table will happily burn your Snowflake credits.

Developer tools

— file read/write/list scoped to an allowlisted root. This is the single most used MCP server in our practice; almost every coding agent gets it.
— git log, git diff, git show, branch ops. Pair with the filesystem server for code review agents.
— official GitHub integration covering issues, PRs, comments, and repo search. Requires a PAT; scope it carefully because the server runs with whatever permissions that token has.
— arbitrary shell command execution. Powerful and dangerous. We allow this only in sandboxed dev containers, never on a developer's host machine without an allowlist.

Browsers and web automation

— headless Chromium driven by Puppeteer. Reliable for scraping and form-filling, slow to spin up (~1-2s per launch).
— Microsoft's Playwright-based server with better selector handling and accessibility tree support. Our default for browser agents in 2026.
— wraps the Browser-Use library, optimized for vision-language models that want to "see" the page rather than parse the DOM.

Productivity and SaaS

— official Notion server, supports page reads, database queries, and writes. Auth via internal integration token.
— official Linear server. Issues, projects, comments, cycles. Excellent schemas; the descriptions are tuned for LLM consumption.
— read channels, post messages, search history. The write permission is the one that gets people in trouble; start read-only.

Cloud infrastructure

— the official AWS MCP server suite. Separate servers for cost analysis, CloudWatch logs, S3, Lambda, ECS. Authenticate with standard AWS credentials.
— Google Cloud equivalent covering GCS, BigQuery, Cloud Run, and IAM listings.
— Cloudflare-maintained server for Workers, KV, R2, and zone management.

Search and retrieval

— Brave Search API wrapper. Cheap, decent quality, no scraping.
— Tavily search and extract. Optimized for agentic workflows that want clean Markdown back, not raw HTML.
— neural search; better for "find papers about X" style queries than keyword-heavy ones.

MCP marketplaces and discovery

A directory pattern emerged in 2025 and matured through 2026. Today there are three main ways to discover and install MCP servers:

The official GitHub repo — hosts the reference implementations. This is the source of truth for "is there an official server for X?" and the servers here get the most testing.
Smithery — is the most-used community registry: searchable index, one-line install commands, version metadata, and basic security ratings. Think of it as npm for MCP servers. It tends to host more experimental and community-built servers than the official repo.
Claude's built-in MCP marketplace — lets users browse and install vetted MCP servers directly from the Claude Desktop UI without touching a config file. Great for non-technical users; the vetted catalog is smaller than Smithery's.

For TypeScript-based servers, npm install is also fully supported — most servers publish to npm under scopes like @modelcontextprotocol/* or vendor-specific scopes (@notionhq/*, @cloudflare/*). Python servers are typically distributed via uvx or pipx.

Build vs Buy: Which MCP Server Should You Write?

The single most common mistake we see in 2026 MCP projects is teams building an MCP server when they should have adopted one. The second most common is the reverse — adopting a generic community server when a 200-line custom wrapper would have been safer and faster.

Here is the decision framework we use with clients.

Buy (adopt an existing server) when

A reputable official or popular community server exists for the tool. Postgres, MongoDB, GitHub, Slack, Notion, Linear, AWS, GCP, Cloudflare, Brave Search — these all have well-maintained servers that you should not be rewriting. Adopting one is typically a 30-minute job: install, configure auth, add to your client's MCP config.
The tool's API is stable and public. If the upstream API changes rarely and the existing server tracks it, you inherit free maintenance.
Your use case is read-heavy or covers standard operations. Stock servers cover 80% of standard CRUD; that is usually all you need.

Build (write your own) when

You have a proprietary internal API. Your billing service, your internal feature flag system, your domain-specific data pipeline — no one else has these. A thin MCP server wrapping your existing REST or gRPC endpoints is the right answer.
You need a security-isolated wrapper around a public service. Sometimes the official server exposes too much surface area. For example, the GitHub MCP server with a broad PAT can do almost anything; a custom wrapper that only exposes read_pr_comments and post_review_comment against a single repo is far safer for an automated reviewer agent.
You need domain-specific tool descriptions. A generic Postgres server exposes query and list_tables. An MCP server for your specific analytics warehouse can expose get_revenue_by_cohort(start_date, end_date, segment) — a tool description tuned to your business will dramatically improve agent reliability versus making the agent compose SQL from scratch.
You need server-side caching, rate-limit handling, or batching. Wrapping a flaky third-party API in your own MCP server lets you add retries, cache responses, and merge multiple agent calls into one upstream request.

Real cost numbers

From our practice in 2026:

Adopting an existing server is typically 30 minutes to a few hours: install, configure credentials, add to your MCP client config, test with a real agent task, document for the team.
Building a custom MCP server from scratch is typically 1-3 days of work for a senior engineer for a small, well-scoped server (5-15 tools, one upstream API, basic auth). This includes the server code, integration tests, dockerization, and deployment to your internal infrastructure.
Building and maintaining a large internal MCP platform — multiple servers, shared auth, audit logging, monitoring — is a multi-week engineering project. Most teams should not do this until they have at least 3-4 custom servers in production.

The trap to avoid: building a custom server for a tool that already has a perfectly good community one, "because we wanted to control it." You end up with a worse server and a maintenance liability.

Production Gotchas We Have Hit (and You Will)

The protocol is simple. Running MCP in production is where reality intrudes.

The security model is "your local permissions"

MCP servers, especially those running over stdio on a developer's machine, execute with the full permissions of the user who launched the host. That means a filesystem server with a misconfigured root can read your SSH keys. A shell server can rm -rf your home directory. A GitHub server with a broad PAT can delete repositories.

The mitigations are not exotic, but they have to be deliberate:

Default to read-only servers whenever the use case allows. Most agentic workflows do not need write access on day one.
Run write-capable servers in containers or sandboxes — even on developer machines. A Docker container with mounted scopes is far safer than direct host access.
Scope credentials tightly. A GitHub PAT for an MCP server should be repo-specific, not org-wide. AWS credentials should be role-based with the narrowest IAM policy that works.
Audit log every tool call. For production servers, log who called what with which arguments. When something breaks (or worse, when someone exfiltrates data), the log is the difference between a five-minute investigation and a five-day one.

Tool sprawl makes agents worse, not better

Counterintuitive but well-documented in 2026: an agent with 50 MCP tools available is less reliable than the same agent with 10. The model has to discriminate between similar-looking tools, picks the wrong one, and burns context window on tool descriptions it never uses.

Our heuristic: aim for 5-15 tools per agent. If you find yourself wiring 30+ tools into a single agent, you probably want to split the work across multiple specialist agents (each with its own focused MCP server set) rather than one generalist with everything available.

Version pinning is not optional

The MCP server ecosystem is moving fast, and breaking changes happen — sometimes intentionally (a tool gets renamed for clarity), sometimes accidentally (an argument schema changes shape). If your production agent depends on notion-mcp-server and you let it auto-upgrade, you will eventually wake up to an agent that suddenly fails on every Notion call.

Pin versions explicitly in your MCP client config. For npm-distributed servers, that means @notionhq/notion-mcp-server@1.4.2, not latest. For Python servers installed via uvx, use the --from flag with a specific version tag. Treat MCP server upgrades like any other dependency upgrade: bump in a branch, run agent regression tests, then roll out.

Latency: synchronous round-trips compound

Every MCP tool call is a synchronous round-trip: the agent emits a tool-use, the client routes it to the server, the server executes, the result returns, the agent thinks again. Even with a fast local server, you are looking at. Remote HTTP/SSE servers add network latency on top.

For agents that need to do dozens of small operations, this compounds badly. Two patterns that work:

Batch operations into single tool calls. Instead of a read_file tool that the agent calls 20 times, expose a read_files(paths: string[]) tool. The agent picks the paths it wants, you do the I/O once.
Prefer expressive resources over chatty tools. If an agent needs a project's full directory tree, exposing it as a single resource the agent reads once beats making it call list_dir recursively.

Version compatibility across clients

The MCP spec is stable but evolving. Clients implement different revisions: Claude Desktop tends to be on the latest spec, Cursor caught up rapidly in early 2026, and ChatGPT's Apps SDK ships full tool support on top of MCP plus its own UI component model — though resource and prompt support is still maturing relative to the reference implementations. If you build a server that uses MCP resources or sampling, test it against every client your users will run it under.

Which AI clients support MCP (May 2026)

Client	MCP Support Level	Notes
Claude Desktop	Full	The reference implementation. Tools, resources, prompts, sampling.
Claude Code	Full	First-class MCP support in the CLI; configured via `~/.claude/settings.json`.
Cursor	Full	Added native MCP support in early 2026; configured in Cursor settings.
ChatGPT (Apps SDK)	Tools + UI components	Built on MCP; full tool support plus Apps SDK-specific UI component rendering. Resources and prompts are still maturing.
Cline	Full	Open-source VS Code agent; one of the earliest non-Anthropic MCP clients.
Continue	Full	Open-source IDE assistant for VS Code and JetBrains.
Zed	Full	The Zed editor supports MCP servers via its assistant panel.
Goose (Block)	Full	Block's open-source agent runtime; heavy MCP integration.
Windsurf (Codeium)	Partial / Full	MCP support added during 2026; check current docs for parity with Cursor.
LibreChat, Open WebUI	Community plugins	Available via community extensions rather than first-party.

If you are choosing a client for your team based on MCP needs, see our breakdown in Claude vs GPT vs Gemini (2026) and Cursor vs GitHub Copilot vs Windsurf.

Before building a custom MCP server, check whether a pre-built one already exists for your use case. The ecosystem is growing weekly.

When to Use MCP

MCP makes the most sense when:

You are building agents that need to access multiple external tools
You want model portability — the ability to switch LLMs without rewriting integrations
You are building a platform where multiple agents or users need access to the same tools
You want to leverage the growing ecosystem of pre-built MCP servers

MCP adds unnecessary complexity when:

Your agent uses a single tool with a simple API call
You are locked into one model provider with no plans to switch
Your use case does not involve external tool use

Getting Started with MCP

Start with pre-built servers — Check the MCP ecosystem for existing servers that match your tools. Most common integrations (databases, CRMs, communication tools) already have MCP servers.
Build custom servers for proprietary tools — If your agent needs access to internal APIs or custom business logic, build a lightweight MCP server following the pattern above.
Test with multiple models — Verify that your MCP servers work correctly with different AI models (GPT-4o, Claude, Gemini) to confirm true portability.

If you are building AI agents and want to implement MCP-based tool integration, ZTABS can help. Our team has production experience with MCP, LangChain, and custom agent architectures across 25+ industries. Contact us for a free consultation.

What Problem Does MCP Solve?

Before MCP, connecting an AI agent to external tools required custom integration code for every model-tool combination.

Before MCP

GPT-4o → Custom integration → CRM
GPT-4o → Custom integration → Database
GPT-4o → Custom integration → Email
Claude → Different custom integration → CRM
Claude → Different custom integration → Database
Claude → Different custom integration → Email

6 custom integrations for 2 models and 3 tools. The number of integrations grows as models × tools.

With MCP

GPT-4o → MCP Client → MCP Server → CRM
 → Database
 → Email
Claude → MCP Client → MCP Server → CRM
 → Database
 → Email

3 MCP servers, and any MCP-compatible model can use all of them. Adding a new model requires zero additional integration work.

How MCP Works

MCP uses a client-server architecture inspired by the Language Server Protocol (LSP) that powers code editors like VS Code.

Core components

MCP Host — The application that runs the AI model and needs to access external tools. This could be a chat interface, an agentic workflow, or an IDE.

MCP Client — The protocol layer inside the host that communicates with MCP servers. It discovers available tools, sends requests, and processes responses.

MCP Server — A lightweight service that exposes specific tools or data sources through the MCP standard. Each server provides a defined set of capabilities.

Communication flow

The MCP client connects to one or more MCP servers
The client asks each server to list its available tools (capabilities discovery)
The AI model receives the tool descriptions and decides which tools to call
The client sends tool execution requests to the appropriate server
The server executes the action and returns structured results
The AI model uses the results to continue reasoning

What MCP servers expose

MCP servers can provide three types of capabilities:

Tools — Functions the AI can call to perform actions (search, create records, send emails, run queries). These are the most common capability.

Resources — Data the AI can read but not modify (documents, database records, configuration files). Resources provide context without allowing side effects.

Prompts — Pre-built prompt templates that the server provides to guide the AI's behavior for specific tasks. These help ensure consistent interactions with complex tools.

MCP vs Function Calling

If you have built AI agents before, you are already familiar with function calling (tool use) in GPT-4o, Claude, or Gemini. MCP builds on top of function calling, not replaces it.

Dimension	Native Function Calling	MCP
Scope	Model-specific tool definitions	Universal standard across models
Discovery	You define tools in the prompt	Client auto-discovers tools from servers
Portability	Tied to one model's API format	Works across any MCP-compatible model
Authentication	You manage per-tool auth	MCP server handles auth internally
Transport	HTTP/API	JSON-RPC over stdio, HTTP/SSE, or WebSocket
Ecosystem	Custom per project	Growing ecosystem of pre-built MCP servers

Think of function calling as the low-level mechanism and MCP as the high-level standard that makes function calling portable and discoverable.

MCP vs A2A vs ACP

MCP is not the only protocol in the agentic AI space. Understanding how the three main protocols relate helps you choose the right one.

Model Context Protocol (MCP)

What it does: Agent-to-tool communication. Connects AI agents to external tools and data.

Created by: Anthropic (donated to the Linux Foundation's Agentic AI Foundation in December 2025)

Use when: You need your agent to use external tools, APIs, databases, or data sources.

Agent-to-Agent Protocol (A2A)

What it does: Agent-to-agent communication. Enables different AI agents to discover each other's capabilities and coordinate work.

Created by: Google

Agent Communication Protocol (ACP)

What it does: Standardized message exchange between agents. Focuses on how agents negotiate task ownership and share context.

Created by: Community-driven, gaining traction in enterprise environments

Use when: You need detailed control over how agents hand off work, share state, and negotiate responsibilities.

How they work together

These protocols are complementary, not competing:

MCP handles the vertical integration — connecting agents to tools and data
A2A handles the horizontal integration — connecting agents to each other
ACP handles the messaging layer — standardizing how agents communicate

A production multi-agent system might use MCP for tool access, A2A for agent discovery, and internal orchestration (via LangGraph or CrewAI) for coordination logic.

Building an MCP Server

An MCP server is surprisingly simple to build. Here is a minimal example in Python that exposes a weather lookup tool.

from mcp.server import Server
from mcp.types import Tool, TextContent

server = Server("weather-server")

@server.list_tools()
async def list_tools():
 return [
 Tool(
 name="get_weather",
 description="Get the current weather for a city",
 inputSchema={
 "type": "object",
 "properties": {
 "city": {
 "type": "string",
 "description": "City name (e.g., Houston, TX)"
 }
 },
 "required": ["city"]
 }
 )
 ]

@server.call_tool()
async def call_tool(name: str, arguments: dict):
 if name == "get_weather":
 city = arguments["city"]
 weather_data = await fetch_weather_api(city)
 return [TextContent(
 type="text",
 text=f"Weather in {city}: {weather_data['temp']}°F, {weather_data['condition']}"
 )]

if __name__ == "__main__":
 server.run()

This server:

Registers a get_weather tool with a description and input schema
When called, fetches weather data and returns it as structured text
Any MCP-compatible AI model can discover and use this tool automatically

MCP server in TypeScript

import { Server } from "@modelcontextprotocol/sdk/server";
import { StdioServerTransport } from "@modelcontextprotocol/sdk/server/stdio";

const server = new Server({
 name: "weather-server",
 version: "1.0.0"
});

server.setRequestHandler("tools/list", async () => ({
 tools: [{
 name: "get_weather",
 description: "Get the current weather for a city",
 inputSchema: {
 type: "object",
 properties: {
 city: { type: "string", description: "City name" }
 },
 required: ["city"]
 }
 }]
}));

server.setRequestHandler("tools/call", async (request) => {
 if (request.params.name === "get_weather") {
 const { city } = request.params.arguments;
 const data = await fetchWeatherAPI(city);
 return {
 content: [{ type: "text", text: `Weather in ${city}: ${data.temp}°F` }]
 };
 }
});

const transport = new StdioServerTransport();
await server.connect(transport);

MCP in Production: Key Considerations

Security

MCP servers handle real tools with real consequences — database writes, email sends, financial transactions. Production MCP implementations need:

Authentication — Verify the identity of the MCP client before granting access
Authorization — Define which tools each client can access and what parameters are allowed
Input validation — Sanitize all inputs before executing tool logic
Audit logging — Log every tool invocation with timestamps, inputs, outputs, and the requesting agent
Rate limiting — Prevent runaway agents from overwhelming your systems

Transport options

MCP supports multiple transport mechanisms:

Transport	Best For	Latency
stdio	Local development, CLI tools	Lowest
HTTP/SSE	Web applications, remote servers	Low-moderate
WebSocket	Real-time bidirectional communication	Low

For most production deployments, HTTP/SSE provides the best balance of performance and compatibility.

Error handling

Agents need to handle tool failures gracefully. A well-designed MCP server returns structured errors that help the agent decide whether to retry, try an alternative approach, or escalate to a human.

@server.call_tool()
async def call_tool(name: str, arguments: dict):
 try:
 result = await execute_tool(name, arguments)
 return [TextContent(type="text", text=result)]
 except RateLimitError:
 return [TextContent(type="text", text="Rate limit exceeded. Try again in 60 seconds.")]
 except AuthorizationError:
 return [TextContent(type="text", text="Not authorized to perform this action.")]
 except Exception as e:
 return [TextContent(type="text", text=f"Tool execution failed: {str(e)}")]

The MCP Server Ecosystem in 2026

Here is the operator's tour of the May 2026 ecosystem, organized by category. We have run most of these in production for clients or internally; the gotchas are real.

Databases and data warehouses

— the canonical Postgres server (the original lived in modelcontextprotocol/servers and was moved to modelcontextprotocol/servers-archived in May 2025 alongside other early reference servers; community-maintained forks and several vendor-built alternatives are now the active path), read-only by default with a connection string. Great for letting agents run analytical queries against a replica. Gotcha: it does not enforce row-level security on its own; point it at a read replica with a least-privilege user.
— official MongoDB-maintained server that supports both Atlas and self-hosted clusters. Exposes find, aggregate, listCollections, and (optionally) write ops behind a flag.
— warehouse-side servers for analytics agents. Watch the cost: an agent that runs SELECT * against a 2 TB table will happily burn your Snowflake credits.

Developer tools

— file read/write/list scoped to an allowlisted root. This is the single most used MCP server in our practice; almost every coding agent gets it.
— git log, git diff, git show, branch ops. Pair with the filesystem server for code review agents.
— official GitHub integration covering issues, PRs, comments, and repo search. Requires a PAT; scope it carefully because the server runs with whatever permissions that token has.
— arbitrary shell command execution. Powerful and dangerous. We allow this only in sandboxed dev containers, never on a developer's host machine without an allowlist.

Browsers and web automation

— headless Chromium driven by Puppeteer. Reliable for scraping and form-filling, slow to spin up (~1-2s per launch).
— Microsoft's Playwright-based server with better selector handling and accessibility tree support. Our default for browser agents in 2026.
— wraps the Browser-Use library, optimized for vision-language models that want to "see" the page rather than parse the DOM.

Productivity and SaaS

— official Notion server, supports page reads, database queries, and writes. Auth via internal integration token.
— official Linear server. Issues, projects, comments, cycles. Excellent schemas; the descriptions are tuned for LLM consumption.
— read channels, post messages, search history. The write permission is the one that gets people in trouble; start read-only.

Cloud infrastructure

— the official AWS MCP server suite. Separate servers for cost analysis, CloudWatch logs, S3, Lambda, ECS. Authenticate with standard AWS credentials.
— Google Cloud equivalent covering GCS, BigQuery, Cloud Run, and IAM listings.
— Cloudflare-maintained server for Workers, KV, R2, and zone management.

Search and retrieval

— Brave Search API wrapper. Cheap, decent quality, no scraping.
— Tavily search and extract. Optimized for agentic workflows that want clean Markdown back, not raw HTML.
— neural search; better for "find papers about X" style queries than keyword-heavy ones.

MCP marketplaces and discovery

A directory pattern emerged in 2025 and matured through 2026. Today there are three main ways to discover and install MCP servers:

The official GitHub repo — hosts the reference implementations. This is the source of truth for "is there an official server for X?" and the servers here get the most testing.
Smithery — is the most-used community registry: searchable index, one-line install commands, version metadata, and basic security ratings. Think of it as npm for MCP servers. It tends to host more experimental and community-built servers than the official repo.
Claude's built-in MCP marketplace — lets users browse and install vetted MCP servers directly from the Claude Desktop UI without touching a config file. Great for non-technical users; the vetted catalog is smaller than Smithery's.

Build vs Buy: Which MCP Server Should You Write?

Here is the decision framework we use with clients.

Buy (adopt an existing server) when

A reputable official or popular community server exists for the tool. Postgres, MongoDB, GitHub, Slack, Notion, Linear, AWS, GCP, Cloudflare, Brave Search — these all have well-maintained servers that you should not be rewriting. Adopting one is typically a 30-minute job: install, configure auth, add to your client's MCP config.
The tool's API is stable and public. If the upstream API changes rarely and the existing server tracks it, you inherit free maintenance.
Your use case is read-heavy or covers standard operations. Stock servers cover 80% of standard CRUD; that is usually all you need.

Build (write your own) when

You have a proprietary internal API. Your billing service, your internal feature flag system, your domain-specific data pipeline — no one else has these. A thin MCP server wrapping your existing REST or gRPC endpoints is the right answer.
You need a security-isolated wrapper around a public service. Sometimes the official server exposes too much surface area. For example, the GitHub MCP server with a broad PAT can do almost anything; a custom wrapper that only exposes read_pr_comments and post_review_comment against a single repo is far safer for an automated reviewer agent.
You need domain-specific tool descriptions. A generic Postgres server exposes query and list_tables. An MCP server for your specific analytics warehouse can expose get_revenue_by_cohort(start_date, end_date, segment) — a tool description tuned to your business will dramatically improve agent reliability versus making the agent compose SQL from scratch.
You need server-side caching, rate-limit handling, or batching. Wrapping a flaky third-party API in your own MCP server lets you add retries, cache responses, and merge multiple agent calls into one upstream request.

Real cost numbers

From our practice in 2026:

Adopting an existing server is typically 30 minutes to a few hours: install, configure credentials, add to your MCP client config, test with a real agent task, document for the team.
Building a custom MCP server from scratch is typically 1-3 days of work for a senior engineer for a small, well-scoped server (5-15 tools, one upstream API, basic auth). This includes the server code, integration tests, dockerization, and deployment to your internal infrastructure.
Building and maintaining a large internal MCP platform — multiple servers, shared auth, audit logging, monitoring — is a multi-week engineering project. Most teams should not do this until they have at least 3-4 custom servers in production.

The trap to avoid: building a custom server for a tool that already has a perfectly good community one, "because we wanted to control it." You end up with a worse server and a maintenance liability.

Production Gotchas We Have Hit (and You Will)

The protocol is simple. Running MCP in production is where reality intrudes.

The security model is "your local permissions"

The mitigations are not exotic, but they have to be deliberate:

Default to read-only servers whenever the use case allows. Most agentic workflows do not need write access on day one.
Run write-capable servers in containers or sandboxes — even on developer machines. A Docker container with mounted scopes is far safer than direct host access.
Scope credentials tightly. A GitHub PAT for an MCP server should be repo-specific, not org-wide. AWS credentials should be role-based with the narrowest IAM policy that works.
Audit log every tool call. For production servers, log who called what with which arguments. When something breaks (or worse, when someone exfiltrates data), the log is the difference between a five-minute investigation and a five-day one.

Tool sprawl makes agents worse, not better

Version pinning is not optional

Latency: synchronous round-trips compound

For agents that need to do dozens of small operations, this compounds badly. Two patterns that work:

Batch operations into single tool calls. Instead of a read_file tool that the agent calls 20 times, expose a read_files(paths: string[]) tool. The agent picks the paths it wants, you do the I/O once.
Prefer expressive resources over chatty tools. If an agent needs a project's full directory tree, exposing it as a single resource the agent reads once beats making it call list_dir recursively.

Version compatibility across clients

Which AI clients support MCP (May 2026)

Client	MCP Support Level	Notes
Claude Desktop	Full	The reference implementation. Tools, resources, prompts, sampling.
Claude Code	Full	First-class MCP support in the CLI; configured via `~/.claude/settings.json`.
Cursor	Full	Added native MCP support in early 2026; configured in Cursor settings.
ChatGPT (Apps SDK)	Tools + UI components	Built on MCP; full tool support plus Apps SDK-specific UI component rendering. Resources and prompts are still maturing.
Cline	Full	Open-source VS Code agent; one of the earliest non-Anthropic MCP clients.
Continue	Full	Open-source IDE assistant for VS Code and JetBrains.
Zed	Full	The Zed editor supports MCP servers via its assistant panel.
Goose (Block)	Full	Block's open-source agent runtime; heavy MCP integration.
Windsurf (Codeium)	Partial / Full	MCP support added during 2026; check current docs for parity with Cursor.
LibreChat, Open WebUI	Community plugins	Available via community extensions rather than first-party.

If you are choosing a client for your team based on MCP needs, see our breakdown in Claude vs GPT vs Gemini (2026) and Cursor vs GitHub Copilot vs Windsurf.

Before building a custom MCP server, check whether a pre-built one already exists for your use case. The ecosystem is growing weekly.

When to Use MCP

MCP makes the most sense when:

You are building agents that need to access multiple external tools
You want model portability — the ability to switch LLMs without rewriting integrations
You are building a platform where multiple agents or users need access to the same tools
You want to leverage the growing ecosystem of pre-built MCP servers

MCP adds unnecessary complexity when:

Your agent uses a single tool with a simple API call
You are locked into one model provider with no plans to switch
Your use case does not involve external tool use

Getting Started with MCP

Start with pre-built servers — Check the MCP ecosystem for existing servers that match your tools. Most common integrations (databases, CRMs, communication tools) already have MCP servers.
Build custom servers for proprietary tools — If your agent needs access to internal APIs or custom business logic, build a lightweight MCP server following the pattern above.
Test with multiple models — Verify that your MCP servers work correctly with different AI models (GPT-4o, Claude, Gemini) to confirm true portability.

What Problem Does MCP Solve?

Before MCP

With MCP

How MCP Works

Core components

Communication flow

What MCP servers expose

MCP vs Function Calling

MCP vs A2A vs ACP

Model Context Protocol (MCP)

Agent-to-Agent Protocol (A2A)

Agent Communication Protocol (ACP)

How they work together

Building an MCP Server

MCP server in TypeScript

MCP in Production: Key Considerations

Security

Transport options

Error handling

The MCP Server Ecosystem in 2026

Databases and data warehouses

Developer tools

Browsers and web automation

Productivity and SaaS

Cloud infrastructure

Search and retrieval

MCP marketplaces and discovery

Build vs Buy: Which MCP Server Should You Write?

Buy (adopt an existing server) when

Build (write your own) when

Real cost numbers

Production Gotchas We Have Hit (and You Will)

The security model is "your local permissions"

Tool sprawl makes agents worse, not better

Version pinning is not optional

Latency: synchronous round-trips compound

Version compatibility across clients

Which AI clients support MCP (May 2026)

When to Use MCP

Getting Started with MCP

Explore Related Solutions

Need Help Building Your Project?

Related Articles

AI Browser Automation in 2026: ChatGPT Agent, Computer Use, and What Actually Ships

AI Cost Optimization at Scale: How We Cut LLM Bills 60% Without Quality Loss

Blockchain Development in 2026: What's Actually Worth Building

What Problem Does MCP Solve?

Before MCP

With MCP

How MCP Works

Core components

Communication flow

What MCP servers expose

MCP vs Function Calling

MCP vs A2A vs ACP

Model Context Protocol (MCP)

Agent-to-Agent Protocol (A2A)

Agent Communication Protocol (ACP)

How they work together

Building an MCP Server

MCP server in TypeScript

MCP in Production: Key Considerations

Security

Transport options

Error handling

The MCP Server Ecosystem in 2026

Databases and data warehouses

Developer tools

Browsers and web automation

Productivity and SaaS

Cloud infrastructure

Search and retrieval

MCP marketplaces and discovery

Build vs Buy: Which MCP Server Should You Write?

Buy (adopt an existing server) when

Build (write your own) when

Real cost numbers

Production Gotchas We Have Hit (and You Will)

The security model is "your local permissions"

Tool sprawl makes agents worse, not better