
Spark-TTS
<p>Overview Spark-TTS 是由出门问问(Mobvoi)联合多所顶尖学术机构(如香港科技大学、上海交通大学)最新推出的新一代语音合成模型,其核心创新在于BiCodec编码技术和与文本大模型的结构统一性,利用大型语言模型 (LLM) 的强大功能实现高度准确且自然的语音合成。</p> <p>Spark-TTS is an advanced text

Zeemo
Zeemo AI is a powerful tool for captioning videos with accurate and fast audio to text transcription.

Final Round AI
Real-time AI copilot for interviewees

DubbingX
智声云配(DubbingX) 是 AI 智能配音工具,提供语音合成(TTS)、音色迁移、歌声转换等多种功能。工具支持中文、英文、日文、粤语等多语言,拥有近2500种情绪语态,支持高度定制,满足游戏、影视、动漫、有声书等多场景需求。工具音色版权合规,支持商用,能显著降低配音成本。智声云配结合专业高校和全球配音演员资源,致力于为用户提供高质量、多样化的音频解决方案。 智声云配官网:https://d

Tactiq
Tactiq is a top transcription tool for online meetings, offering real-time transcription and meeting summaries.

VNSplit
Receive AI summaries of voice notes instead of listening to whole messages with VNSplit.

Tarteel
Recite the Quran confidently with live feedback and AI assistance.

Dicte.ai
AI app for recording and transcribing meetings effectively.

LeVo
LeVo是腾讯AI实验室推出的AI唱歌模型,具备强大的音色克隆能力,仅需3秒音频即可精准复制目标音色,包括音调、情感和韵律,无需大量训练数据。LeVo支持分轨生成,可分别生成人声和伴奏音轨,为后期编辑提供便利。技术架构基于语言模型(LM),结合LeLM和音乐编解码器,能并行生成音轨,音质表现接近行业领先水平,在歌词对齐能力上表现卓越。 LeVo的项目地址 项目官网: https://lev

OLOCR
OLOCR provides unlimited OCR for images and PDFs, allowing users to extract text easily.

Intellisay
Efficiently plan your day with voice.

VOMO AI
Convert voice to organized notes effortlessly.

Cockatoo
Cockatoo is an AI-powered transcription service that provides accurate text and subtitle conversion in multiple languages.

Voz AI Voice Note Taker
Automated note-taking and transcription tool for lectures and videos.

Transkriptor
Convert audio and video to text with Transkriptor's powerful AI.

Shoonya AI
AI models optimized for commerce and retail applications.

AI-Powered WhatsApp Assistant
AI WhatsApp assistant for business communication and automation.

Deepdub
Dubbing and voice over localization at scale.

superwhisper
SuperWhisper is a voice-to-text app powered by AI for macOS.

Audyo
Audyo is a platform that allows users to edit and create audio like writing a document.

VoiceCanvas
VoiceCanvas 是开源的多语言语音合成平台。基于 AI 技术提供高质量的文字转语音服务,支持超过 50 种语言,集成 OpenAI TTS、AWS Polly 和 MiniMax 等多种语音服务。VoiceCanvas 提供个人声音克隆功能,用户上传几秒音频样本能创建个性化声音。VoiceCanvas适合内容创作者、教育工作者和企业用户,显著提升语音内容制作效率。 VoiceCanvas

AiMakeSong
AiMakeSong 是基于人工智能的音乐和歌曲生成平台,支持用户通过简单的文本输入或歌词创作来生成高质量的音乐作品。用户可以选择将文字描述转化为音乐,或者将自己创作的歌词转化为完整的歌曲。平台提供了多种音乐风格和声音选项,包括流行、摇滚、说唱、古典等,以及男性、女性或乐器声音,满足不同用户的需求。 AiMakeSong的主要功能 文本转音乐:用户可以通过描述自己的音乐想法,将这些想法

Mictoo
Mictoo is a free tool for transcribing audio and video into text.

ScriptMe
ScriptMe provides fast and accurate transcriptions and subtitling in multiple languages.
只显示前20页数据