关键词 "speech synthesizer" 的搜索结果, 共 24 条, 只显示前 480 条
Advanced MCP server providing comprehensive text transformation and formatting tools. TextToolkit offers over 40 specialized utilities for case conversion, encoding/decoding, formatting, analysis, and
TTS MCP Server
MCP Server Tool for Text To Speech
An advanced MCP server for Home Assistant. 🔋 Batteries included.
MCP server for macOS text-to-speech functionality
Mirror of
Model Context Protocol for Text-to-Speech
A Streamlit-based chatbot interface powered by OpenAI GPT-4o that intelligently routes user input to custom MCP tools such as GPT chat, image generation, Supabase queries, and text-to-speech.
Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech and video generation APIs.
This project is a proof of concept for running a local-first multi-agent system using: 🤖 Local LLMs via Ollama 🧩 Simple function/tool-call detection using <tool_call>... 🔍 Brave Search API or optional
Official MiniMax Model Context Protocol (MCP) JavaScript implementation that provides seamless integration with MiniMax's powerful AI capabilities including image generation, video generation, text-to
A Model Context Protocol (MCP) server that enables AI assistants to generate images, text, and audio through the Pollinations APIs. Supports customizable parameters, image saving, and multiple model o
A Model Context Protocol (MCP) server that provides ASR(Automatic Speech Recognition) capabilities using the whisper engine. This server exposes TTS functionality through MCP tools, making it easy to
A MCP server that provides audio transcription capabilities using OpenAI's Whisper API
Kokoro Text to Speech (TTS) MCP Server
A high-performance speech recognition MCP server based on Faster Whisper, providing efficient audio transcription capabilities.
Model Control Protocol (MCP) server for ElevenLabs Scribe ASR API
<p>Overview Spark-TTS 是由出门问问(Mobvoi)联合多所顶尖学术机构(如香港科技大学、上海交通大学)最新推出的新一代语音合成模型,其核心创新在于BiCodec编码技术和与文本大模型的结构统一性,利用大型语言模型 (LLM) 的强大功能实现高度准确且自然的语音合成。</p> <p>Spark-TTS is an advanced text
Muyan-TTS,一款低成本、具备良好二次开发支持的模型并完全开源,以方便学术界和小型应用团队的音频技术爱好者。 当前开源的Muyan-TTS版本由于训练数据规模有限,致使其仅对英语语种呈现出良好的支持效果。不过,得益于与之同步开源的详尽训练方法,从事相关行业的开发者能够依据自身实际业务场景,灵活地对Muyan-TTS进行功能升级与定制化改造。 01. H
只显示前20页数据,更多请搜索
Showing 457 to 480 of 484 results