Toolverse

ElevenLabs

Create the most realistic speech with our AI audio tools in 1000s of voices and 32 languages. Easy to use API's and SDK's. Scalable, secure, and customizable voice solutions tailored for enterprise needs. Pioneering research in Text to Speech and AI Voice Generation.

Code assistantsVoice & speech

EasyAnnounce

EasyAnnounce is an automation platform specialized in Public Address (PA) announcements and international name pronunciation. It is designed for environments like airports, hospitals, and resorts where clear communication is vital. The platform uses a purpose-built name...

Code assistantsVoice & speech

NemoVideo

NemoVideo is a professional AI video editing agent designed to help users create viral content through natural language conversations. It acts as an intelligent production assistant that handles the entire video workflow—from hunting viral trends and analyzing patterns to...

Video generationWriting & contentVoice & speech

Doctor Handwriting Reader AI

Doctor Handwriting Reader AI is a specialized tool designed to decode and interpret messy, handwritten medical prescriptions and notes. Using advanced AI-powered OCR, it converts difficult-to-read clinical handwriting into clear, structured text. The platform not only...

ProductivityVoice & speech

Cheetu AI

Cheetu AI delivers real-time transcription, live translation, and instant AI summaries for every meeting,lecture, or interview.

ProductivityVoice & speech

Vois

Vois is a professional desktop AI voice studio designed for high-quality audio production. It allows users to transform scripts, ebooks, articles, and podcasts into natural-sounding speech using over 63 expressive voices and voice cloning technology. Unlike cloud-based...

Audio & musicVoice & speech

Speakoala

Speakoala is an AI-powered text-to-speech (TTS) reading assistant designed to help users consume digital content through listening. It can read any website, email, and local document (including PDF, DOCX, and EPUB) with natural, lifelike voices. The tool supports over 70...

ProductivityVoice & speechAI agents

Ghostype

Ghostype is a context-aware AI voice interface specifically designed for macOS. It functions as an invisible AI layer that bridges the speed gap between speaking and typing by converting voice to polished text in real-time. The tool is uniquely aware of the active...

Writing & contentProductivityTranslation

VocoSpeech

VocoSpeech is a native macOS application designed for high-quality, offline AI voice generation and instant voice cloning. It serves as a local alternative to cloud-based services like ElevenLabs, running 100% on Apple Silicon to ensure that sensitive audio data remains...

Voice & speech

SpotScribe

SpotScribe is an AI-powered platform designed to convert Spotify podcasts into text effortlessly. It allows users to extract accurate transcripts, generate concise AI summaries, and interact with podcast episodes through an AI chat interface. The tool supports high-precision...

Audio & musicProductivityVoice & speech

Video to Text AI

Video to Text AI is an advanced transcription platform that utilizes state-of-the-art machine learning and speech recognition algorithms to convert spoken content from videos and audio into accurate written text. It supports over 55 languages and can process various video...

Writing & contentVoice & speech

FlowSpeech

FlowSpeech is an AI-powered text-to-speech (TTS) studio designed to convert text into highly realistic, human-like audio. It distinguishes itself through context-aware technology that understands the sentiment, timing, and nuance of a script. The platform offers advanced...

Voice & speech

SurfSense

SurfSense is a highly customizable AI research agent, connected to external sources such as search engines, Google Drive, Slack, Microsoft Teams, Linear, Jira, ClickUp, Confluence, BookStack, Gmail, Notion, YouTube, GitHub, Discord, Airtable, Google Calendar, Luma,...

Image generationCode assistantsWriting & content

fluents AI

Fluents.ai is a unified Intelligent Virtual Agent (IVA) platform engineered for human-grade voice interactions at enterprise scale. By automating both inbound and outbound calling workflows with sub-second latency, it provides a seamless, natural experience that eliminates...

ChatbotsProductivityVoice & speech

YTVidHub

YTVidHub is a professional bulk YouTube subtitle downloader and transcript extractor designed for high-volume data collection. It allows users to extract subtitles from entire playlists and channels in a single click, supporting formats like SRT, VTT, and clean TXT. The...

ProductivityResearch & analysisVoice & speech

Dictato

Dictato is a private, fast voice-to-text dictation application specifically built for macOS. It allows users to transcribe speech directly into any application—such as Gmail, Slack, or VS Code—using a global hotkey. The app operates 100% on-device, meaning no audio data is...

Writing & contentVoice & speech

Reloop

Reloop is an AI-powered UGC (User-Generated Content) video generator designed to create high-converting video ads without requiring complex prompting or technical skills. It features a conversational creative agent that handles video production end-to-end—from understanding...

Image generationVideo generationMarketing

Guideless

Guideless is an AI-powered documentation and video guide platform designed to transform browser-based workflows into professional, narrated videos. It eliminates the frustration of manual video editing by using a Chrome extension to capture clicks and automatically generating...

Video generationWriting & contentProductivity

trnscrb

trnscrb is a local meeting transcription tool for macOS that lives in the menu bar and automatically detects meetings on platforms like Zoom, Google Meet, Microsoft Teams, Slack, and FaceTime. It utilizes OpenAI's Whisper model to perform on-device transcription via...

ProductivityVoice & speech

Prism

Prism is an all-in-one AI video creation platform designed for making short-form content without needing multiple external tools. It allows users to generate image and video assets using various state-of-the-art models like Sora, Kling, and Veo, organize them into projects,...

Image generationVideo generationMarketing

Stage Captions

Stage Captions is a professional, browser-based real-time closed captioning software designed for live events, conferences, and broadcasts. It utilizes an advanced AI engine to deliver production-ready live transcription with industry-leading low latency. The platform allows...

Writing & contentVoice & speech

Obi

Obi, developed by Cor (Corellian Systems), is a voice AI agent designed for customer onboarding and user activation. It functions like a live video call, using voice and on-screen awareness to guide users through product setups, share best practices, and answer questions in...

Voice & speechAI agentsCustomer support

PopAir

PopAir is a native macOS AI copilot designed for speed and seamless system-wide integration. Built with SwiftUI to consume significantly less RAM than Electron-based apps, it serves as a unified hub for leading AI models including GPT, Claude, Gemini, and DeepSeek. The...

Image generationWriting & contentProductivity

TalkToPost

TalkToPost is an AI-driven platform specifically designed to convert raw voice notes into professional, high-engagement content for LinkedIn, X (Twitter), and Reddit. It functions by transcribing spoken ideas, analyzing the user's tone and structure, and then...

MarketingVoice & speech

Showing 24 of 1,933 tools