
Replicate
Run any AI model in the cloud via API — pay per second of GPU time, no infrastructure to manage.
Quick info
- Pricing
- Freemium
- Categories
- Automation
- Last updated
- May 6, 2026
Description
Replicate hosts thousands of community-uploaded AI models behind a single API. Pick Flux for images, Whisper for transcription, Llama for chat, or your own fine-tune; each call returns a result and bills per second of GPU. Strong for prototyping pipelines that combine multiple models without renting your own GPUs.




