MCP in production: what we actually wired up at Tuscan and what broke

Model Context Protocol servers are no longer a proof-of-concept curiosity. Agencies running AI-assisted pipelines on stacks like Claude, Make.com, and Vercel are wiring MCP into production, and the rough edges are becoming visible fast.

What MCP actually is in a working pipeline

Anthropic's Model Context Protocol gives language models a standardized way to call external tools, read structured context, and take actions across services. In practice, that means a Claude-based agent can be granted access to a Slack workspace, a CMS API, or a search console, and call those tools natively during a run rather than relying on brittle prompt-stuffed instructions.

The protocol specification, published by Anthropic in late 2024, defines three primitives: tools, resources, and prompts. Tools are callable functions. Resources are readable data. Prompts are reusable instruction templates the server exposes to the client. Most production MCP servers in the wild implement tools only, leaving resources and prompts largely unused as of mid-2025.

Which MCP servers are proving useful in agency work

The most reliable MCP servers in active agency use tend to cluster around communication and data retrieval. Slack MCP, GitHub MCP, and Google Analytics MCP are frequently cited in developer threads as stable enough for unattended runs. A developer writing on the Anthropic Discord described Slack MCP as "the one that just works," contrasting it with several data-write servers that require manual token refresh handling.

Search and content retrieval servers, including a Serper-backed web search MCP and various CMS-adjacent adapters, are also seeing real use. Agencies building content pipelines on headless CMS platforms report that a thin MCP wrapper over a REST API is often faster to ship than a full LangChain tool integration. One thread on the r/ClaudeAI subreddit described a setup where a custom MCP server wraps a Supabase client, letting Claude query and write content rows without any prompt-side SQL construction.

Where the ecosystem still needs glue code

Authentication is the most common friction point reported across community forums and developer write-ups. OAuth flows in particular create a gap: MCP servers expect a token at startup, but many agency automation runs are headless and scheduled, meaning token expiry mid-run causes silent failures rather than clean errors. A Reddit thread from April 2025 described this as "the biggest operational gap," with several developers writing custom token-refresh wrappers as a workaround until the protocol adds native refresh support.

Environment isolation is a second recurring issue. MCP servers run as local processes or hosted services, and in cloud execution environments, such as GitHub Actions runners or sandboxed serverless functions, outbound HTTP from the MCP server layer is often blocked by firewall rules that the agent layer does not inherit. Developers have reported that Google Places calls and Wikipedia lookups fail silently in these environments, requiring either a proxy layer or a fallback to a less-restricted image or data source. The fix is not complex, but it is not documented in the official MCP quickstart.

Version drift between the MCP SDK and the Claude API client is a third friction point. Because both are moving fast, a minor SDK bump can break tool call serialization in ways that only surface at runtime. Several developers recommend pinning both dependencies and running a smoke-test tool call as the first step of any automated workflow that depends on MCP.

Patterns that reduce production failures

Teams reporting stable MCP deployments tend to share a few common practices. First, they scope MCP servers narrowly. Rather than exposing an entire platform API as a tool surface, they define one MCP server per logical task, such as one for content publishing and one for analytics retrieval. This reduces the blast radius when a single server goes offline or throws an unexpected error.

Second, they treat MCP tool calls as fallible by default. In Claude's tool use API, a tool can return an error result rather than a success, and well-structured agents are built to handle that gracefully, either retrying, logging, or falling through to a default. Developers who wire MCP tools as if they are guaranteed to succeed report harder-to-debug failures than those who treat every external call as potentially unavailable.

Third, they keep a thin orchestration layer, whether a Make.com scenario, an n8n workflow, or a GitHub Actions job, above the agent so that failures surface as observable events rather than silent non-runs. The MCP layer handles tool execution, but the pipeline layer handles visibility and alerting.

What is still missing from the MCP ecosystem

A few categories of tooling are conspicuously absent or immature. Paid media APIs, including Meta Ads and Google Ads, have no widely adopted MCP wrappers as of this writing. Agencies that want Claude to read campaign performance data are still building custom tool integrations rather than pulling from a shared server. E-commerce platforms are in a similar state. Shopify MCP exists as a community project but lacks the maintenance track record that production use demands.

Testing infrastructure is also thin. There is no standard way to mock MCP server responses for local development, which means developers are either running live calls during testing or maintaining parallel stub implementations. A few open-source projects are attempting to fill this gap, but none have emerged as a community standard.

MCP is genuinely useful in production for the narrow category of tools it currently handles well. Agencies building on it should pick stable, read-oriented servers first, plan for auth edge cases before they hit them, and keep a human-readable log layer outside the agent itself. The protocol is moving quickly enough that gaps closing in the next two quarters is a reasonable expectation, not wishful thinking.

MCP in production: what we actually wired up at Tuscan and what broke

What MCP actually is in a working pipeline

Which MCP servers are proving useful in agency work

Where the ecosystem still needs glue code

Patterns that reduce production failures

What is still missing from the MCP ecosystem

More writing

What the OpenAI Partner Network actually means for small agencies

What's actually in my .claude/skills directory (and why you should have one)

Nvidia RTX Spark and the case for on-prem AI for SMB clients

Anthropic Mythos hits the EU: what it signals about Claude's enterprise roadmap

Qwen 3.6 vs Claude vs GPT: When Local Models Actually Make Sense for Agency Work

The Eternal Sloptember: How Tuscan Filters AI Slop Out of Client Content

Start a project.

Start a project.