Speaker recognition - 梅斯AI导航站

Azure AI Vision Face MCP-Server

Hosts the Azure-Ai-Vision-Face Liveness Mcp-Server

SSE-based Server and mobile Angular App

MCP server for image recognition with Angular mobile client app.

Mcp Mindmesh

Claude 3.7 Swarm with Field Coherence: A Model Context Protocol (MCP) server that orchestrates multiple specialized Claude 3.7 Sonnet instances in a quantum-inspired swarm. It creates a field coherenc

MindMesh MCP Server

Claude 3.7 Swarm with Field Coherence: A Model Context Protocol (MCP) server that orchestrates multiple specialized Claude 3.7 Sonnet instances in a quantum-inspired swarm. It creates a field coherenc

Asr_mcp_server

A Model Context Protocol (MCP) server that provides ASR(Automatic Speech Recognition) capabilities using the whisper engine. This server exposes TTS functionality through MCP tools, making it easy to

Whisper Speech Recognition MCP Server

A high-performance speech recognition MCP server based on Faster Whisper, providing efficient audio transcription capabilities.

Entity Identificationn

Recognize whether two sets of data are from the same entity.

MCP Image Recognition Server

An MCP server that provides image recognition 👀 capabilities using Anthropic and OpenAI vision APIs

Spark-TTS

<p>Overview Spark-TTS 是由出门问问（Mobvoi）联合多所顶尖学术机构（如香港科技大学、上海交通大学）最新推出的新一代语音合成模型，其核心创新在于BiCodec编码技术和与文本大模型的结构统一性，利用大型语言模型 (LLM) 的强大功能实现高度准确且自然的语音合成。</p> <p>Spark-TTS is an advanced text

Muyan-TTS

Muyan-TTS，一款低成本、具备良好二次开发支持的模型并完全开源，以方便学术界和小型应用团队的音频技术爱好者。当前开源的Muyan-TTS版本由于训练数据规模有限，致使其仅对英语语种呈现出良好的支持效果。不过，得益于与之同步开源的详尽训练方法，从事相关行业的开发者能够依据自身实际业务场景，灵活地对Muyan-TTS进行功能升级与定制化改造。 01. H

Lovart

Lovart 全球首个设计 Agent 体验 Lovart 的三个特点：一、全链路设计和执行，一句话搞定以前的文生图工具，它们所提供的任务是“生成图片”这一环。而设计 Agent，则像一位“设计执行官”，覆盖从创意拆解到专业交付的整个视觉流程。从意图拆解 → 任务链 → 最后成品，一句话全搞定。单次可以执行上

搜索结果