
Replicate AI
A cloud API platform for running, fine-tuning, and deploying open-source machine learning models with ease.
About Replicate AI
Replicate is a comprehensive cloud-based API platform that enables users to run, fine-tune, and deploy a wide range of open-source machine learning models. With support for thousands of models contributed by the community, it offers production-ready APIs for tasks like image creation, video synthesis, captioning, speech and music generation, and image restoration. Users can deploy models at scale with minimal code and automatic resource management.
How to Use
Easily run pre-trained models with a single line of code, fine-tune models using your own data, or deploy custom models with Cog. The platform automatically scales to meet demand, and billing is based on actual compute usage.
Features
Use Cases
Best For
Pros
Cons
Pricing Plans
Choose the perfect plan for your needs. All plans include 24/7 support and regular updates.
anthropic/claude-3.7-sonnet
Advanced Claude model with hybrid reasoning capabilities, ideal for complex AI tasks
black-forest-labs/flux-1.1-pro
High-quality text-to-image model combining prompt accuracy and diverse outputs for creative projects
black-forest-labs/flux-dev
12-billion-parameter transformer capable of generating detailed images from text descriptions
black-forest-labs/flux-schnell
Fastest image generation model optimized for local development and personal use
deepseek-ai/deepseek-r1
Reinforcement-learned reasoning model comparable to OpenAI's GPT-3
google/veo-2
State-of-the-art video generation model that accurately follows instructions and replicates real-world physics
ideogram-ai/ideogram-v3-quality
Premium Ideogram v3 model producing highly realistic images with creative and consistent styles
recraft-ai/recraft-v3
Leading text-to-image model capable of generating detailed images and long texts, proven SOTA performance
wavespeedai/wan-2.1-i2v-480p
Optimized inference model for converting images to videos at 480p resolution
wavespeedai/wan-2.1-i2v-720p
High-resolution image-to-video model at 720p, pushing the limits of video synthesis
CPU
Central processing unit for cost-effective model execution
Nvidia A100 (80GB) GPU
High-performance GPU for intensive machine learning workloads
2x Nvidia A100 (80GB) GPU
Dual GPU setup for large-scale AI model training and inference
4x Nvidia A100 (80GB) GPU
Quad GPU configuration for demanding AI applications
8x Nvidia A100 (80GB) GPU
Extreme GPU setup for intensive AI model deployment
Nvidia H100 GPU
Next-generation GPU optimized for AI and deep learning tasks
Nvidia L40S GPU
Versatile GPU suitable for diverse machine learning workloads
2x Nvidia L40S GPU
Dual L40S setup for enhanced AI processing power
4x Nvidia L40S GPU
Quad L40S configuration for scalable AI applications
8x Nvidia L40S GPU
High-capacity GPU array for large-scale AI deployment
Nvidia T4 GPU
Cost-effective GPU for inference and small-scale training
2x Nvidia H100 GPU
Dual H100 GPUs for advanced AI workflows
4x Nvidia H100 GPU
Powerful GPU setup for high-demand AI tasks
8x Nvidia H100 GPU
Maximum GPU configuration for large-scale AI processing
Frequently Asked Questions
Find answers to common questions about Replicate AI
