by cartesia-ai
Enables clients to call Cartesia's API for speech localization, text‑to‑audio synthesis, voice infilling, and voice swapping directly from local MCP servers.
Provides a local MCP (Model Context Protocol) server that bridges applications such as Claude Desktop, Cursor, and OpenAI agents with Cartesia's speech AI platform. It supports listing voices, converting text to audio, localizing speech to different languages, infilling missing audio segments, and re‑voicing existing files.
pip install cartesia-mcp
which cartesia-mcp # get the absolute path to the executable
CARTESIA_API_KEY
(and optionally OUTPUT_DIRECTORY
).Q: Do I need a paid Cartesia account? A: No, a free tier provides 20,000 credits per month, sufficient for most development and testing.
Q: Which languages are supported for localization? A: All languages offered by Cartesia’s API can be used; refer to the Cartesia documentation for the full list.
Q: Can I run the server without installing Python packages?
A: The server is distributed as a Python package; installation via pip
is required.
Q: How do I specify the output folder for generated audio?
A: Set the OUTPUT_DIRECTORY
environment variable in the client configuration; if omitted, files are saved in the current working directory.
The Cartesia MCP server provides a way for clients such as Cursor, Claude Desktop, and OpenAI agents to interact with Cartesia's API. Users can localize speech, convert text to audio, infill voice clips etc.
Ensure that you have created an account on Cartesia, there is a free tier with 20,000 credits per month. Once in the Cartesia playground, create an API key under API Keys --> New.
pip install cartesia-mcp
which cartesia-mcp # absolute path to executable
Add the following to claude_desktop_config.json
which can be found through Settings --> Developer --> Edit Config.
{
"mcpServers": {
"cartesia-mcp": {
"command": "<absolute-path-to-executable>",
"env": {
"CARTESIA_API_KEY": "<insert-your-api-key-here>",
"OUTPUT_DIRECTORY": // directory to store generated files (optional)
}
}
}
}
Try asking Claude to
Create either a .cursor/mcp.json
in your project or a global ~/.cursor/mcp.json
. The same config as for Claude can be used.
Please log in to share your review and rating for this MCP.
Discover more MCP servers with similar functionality and use cases
by ahujasid
AbletonMCP is an integration that connects Ableton Live with Claude AI, enabling prompt-assisted music production and live session manipulation through natural language commands.
by Simon-Kansara
ableton-live-mcp-server is a Python-based server that implements the Model Context Protocol (MCP) to enable communication between Large Language Models (LLMs) and Ableton Live. It provides a robust interface for LLMs to control various aspects of Ableton Live, facilitating AI-powered music production.
by glassBead-tc
audius-mcp-atris is a Model Context Protocol (MCP) server that enables Large Language Models (LLMs) to interact with the Audius music platform. It provides an extensive set of tools for natural language interaction, allowing users to discover music, manage content, and analyze data on Audius through conversational AI.
by netdata
Real-time, per‑second infrastructure monitoring platform that provides instant insights, auto‑discovery, edge‑based machine‑learning anomaly detection, and lightweight visualizations without requiring complex configuration.
by zed-industries
Provides real-time collaborative editing powered by Rust, enabling developers to edit code instantly across machines with a responsive, GPU-accelerated UI.
by github
Enables AI agents, assistants, and chatbots to interact with GitHub via natural‑language commands, providing read‑write access to repositories, issues, pull requests, workflows, security data and team activity.
by daytonaio
Provides a secure, elastic sandbox environment for executing AI‑generated code with isolated runtimes and sub‑90 ms provisioning.
by RooCodeInc
An autonomous coding agent that lives inside VS Code, capable of generating, refactoring, debugging code, managing files, running terminal commands, controlling a browser, and adapting its behavior through custom modes and instructions.
by chaitin
Provides a self‑hosted web application firewall and reverse‑proxy that filters, monitors, and blocks malicious HTTP/S traffic, protecting web applications from attacks such as SQL injection, XSS, brute‑force, bot abuse, and various code injections.