
ElevenLabs
Create the most realistic speech with our AI audio tools in 1000s of voices and 32 languages. Easy to use API's and SDK's. Scalable, secure, and customizable voice solutions tailored for enterprise needs. Pioneering research in Text to Speech and AI Voice Generation.

EasyAnnounce
EasyAnnounce is an automation platform specialized in Public Address (PA) announcements and international name pronunciation. It is designed for environments like airports, hospitals, and resorts where clear communication is vital. The platform uses a purpose-built name...

NemoVideo
NemoVideo is a professional AI video editing agent designed to help users create viral content through natural language conversations. It acts as an intelligent production assistant that handles the entire video workflow—from hunting viral trends and analyzing patterns to...

Doctor Handwriting Reader AI
Doctor Handwriting Reader AI is a specialized tool designed to decode and interpret messy, handwritten medical prescriptions and notes. Using advanced AI-powered OCR, it converts difficult-to-read clinical handwriting into clear, structured text. The platform not only...
Cheetu AI
Cheetu AI delivers real-time transcription, live translation, and instant AI summaries for every meeting,lecture, or interview.

Vois
Vois is a professional desktop AI voice studio designed for high-quality audio production. It allows users to transform scripts, ebooks, articles, and podcasts into natural-sounding speech using over 63 expressive voices and voice cloning technology. Unlike cloud-based...
Speakoala
Speakoala is an AI-powered text-to-speech (TTS) reading assistant designed to help users consume digital content through listening. It can read any website, email, and local document (including PDF, DOCX, and EPUB) with natural, lifelike voices. The tool supports over 70...

Ghostype
Ghostype is a context-aware AI voice interface specifically designed for macOS. It functions as an invisible AI layer that bridges the speed gap between speaking and typing by converting voice to polished text in real-time. The tool is uniquely aware of the active...
VocoSpeech
VocoSpeech is a native macOS application designed for high-quality, offline AI voice generation and instant voice cloning. It serves as a local alternative to cloud-based services like ElevenLabs, running 100% on Apple Silicon to ensure that sensitive audio data remains...
SpotScribe
SpotScribe is an AI-powered platform designed to convert Spotify podcasts into text effortlessly. It allows users to extract accurate transcripts, generate concise AI summaries, and interact with podcast episodes through an AI chat interface. The tool supports high-precision...
Video to Text AI
Video to Text AI is an advanced transcription platform that utilizes state-of-the-art machine learning and speech recognition algorithms to convert spoken content from videos and audio into accurate written text. It supports over 55 languages and can process various video...

FlowSpeech
FlowSpeech is an AI-powered text-to-speech (TTS) studio designed to convert text into highly realistic, human-like audio. It distinguishes itself through context-aware technology that understands the sentiment, timing, and nuance of a script. The platform offers advanced...

SurfSense
SurfSense is a highly customizable AI research agent, connected to external sources such as search engines, Google Drive, Slack, Microsoft Teams, Linear, Jira, ClickUp, Confluence, BookStack, Gmail, Notion, YouTube, GitHub, Discord, Airtable, Google Calendar, Luma,...

fluents AI
Fluents.ai is a unified Intelligent Virtual Agent (IVA) platform engineered for human-grade voice interactions at enterprise scale. By automating both inbound and outbound calling workflows with sub-second latency, it provides a seamless, natural experience that eliminates...
YTVidHub
YTVidHub is a professional bulk YouTube subtitle downloader and transcript extractor designed for high-volume data collection. It allows users to extract subtitles from entire playlists and channels in a single click, supporting formats like SRT, VTT, and clean TXT. The...
Dictato
Dictato is a private, fast voice-to-text dictation application specifically built for macOS. It allows users to transcribe speech directly into any application—such as Gmail, Slack, or VS Code—using a global hotkey. The app operates 100% on-device, meaning no audio data is...

Reloop
Reloop is an AI-powered UGC (User-Generated Content) video generator designed to create high-converting video ads without requiring complex prompting or technical skills. It features a conversational creative agent that handles video production end-to-end—from understanding...

Guideless
Guideless is an AI-powered documentation and video guide platform designed to transform browser-based workflows into professional, narrated videos. It eliminates the frustration of manual video editing by using a Chrome extension to capture clicks and automatically generating...
trnscrb
trnscrb is a local meeting transcription tool for macOS that lives in the menu bar and automatically detects meetings on platforms like Zoom, Google Meet, Microsoft Teams, Slack, and FaceTime. It utilizes OpenAI's Whisper model to perform on-device transcription via...

Prism
Prism is an all-in-one AI video creation platform designed for making short-form content without needing multiple external tools. It allows users to generate image and video assets using various state-of-the-art models like Sora, Kling, and Veo, organize them into projects,...

Stage Captions
Stage Captions is a professional, browser-based real-time closed captioning software designed for live events, conferences, and broadcasts. It utilizes an advanced AI engine to deliver production-ready live transcription with industry-leading low latency. The platform allows...

Obi
Obi, developed by Cor (Corellian Systems), is a voice AI agent designed for customer onboarding and user activation. It functions like a live video call, using voice and on-screen awareness to guide users through product setups, share best practices, and answer questions in...

PopAir
PopAir is a native macOS AI copilot designed for speed and seamless system-wide integration. Built with SwiftUI to consume significantly less RAM than Electron-based apps, it serves as a unified hub for leading AI models including GPT, Claude, Gemini, and DeepSeek. The...
TalkToPost
TalkToPost is an AI-driven platform specifically designed to convert raw voice notes into professional, high-engagement content for LinkedIn, X (Twitter), and Reddit. It functions by transcribing spoken ideas, analyzing the user's tone and structure, and then...
Showing 24 of 1,933 tools