

prunaai / z-image-turbo
Z-Image Turbo is a super fast text-to-image model of 6B parameters developed by Tongyi-MAI.
19.6M runs


krthr / clip-embeddings
Generate CLIP (clip-vit-large-patch14) text & image embeddings
51.5M runs


prunaai / p-image-edit
A sub 1 second 0.01$ multi-image editing model built for production use cases. For image generation, check out p-image here: https://replicate.com/prunaai/p-image
12M runs


beautyyuyanli / multilingual-e5-large
multilingual-e5-large: A multi-language text embedding model
58.6M runs
Generate videos using xAI's Grok Imagine Video model
670 runs

Agentic image model optimized for robust, high-precision generations supporting font control
2.9K runs
Latest video model from Pixverse with astonishing physics
2.1K runs

Moonshot AI's latest open model. It unifies vision and text, thinking and non-thinking modes, and single-agent and multi-agent execution into one model
2.6K runs

Google's most intelligent model built for speed with frontier intelligence, superior search, and grounding
118.3K runs

A unified Text-to-Speech demo featuring three powerful modes: Voice, Clone and Design
20.4K runs

Very fast image generation and editing model. 4 steps distilled, sub-second inference for production and near real-time applications.
2.5M runs
Kling 2.6 Pro: Top-tier image-to-video with cinematic visuals, fluid motion, and native audio generation
109.1K runs

openai/gpt-image-1.5OpenAI's latest image generation model with better instruction following and adherence to prompts
2.3M runs

bytedance/seedream-4.5Seedream 4.5: Upgraded Bytedance image model with stronger spatial understanding and world knowledge
2.9M runs

Google's state of the art image generation and editing model 🍌🍌
12.5M runs

Compose a song from a prompt or a composition plan
2.5K runs
Official models are always on, maintained, and have predictable pricing.

Remove dust and scratches from old photos

Image colorization model from Topaz Labs

Minimax Speech 2.8 Turbo: Turn text into natural, expressive speech with voice cloning, emotion control, and support for 40+ languages

Minimax Speech 2.8 HD focuses on high-fidelity audio generation with features like studio-grade quality, flexible emotion control, multilingual support, and voice cloning capabilities
Generate videos using xAI's Grok Imagine Video model

Anthropic's most intelligent model with state-of-the-art coding, reasoning, and agentic capabilities
Image-to-video generation with optional audio, multi-shot narrative support, and faster inference
Create avatar videos with realistic humans, animals, cartoons, or stylized characters

Render product images with 100% accuracy and environmental blending

Agentic image model optimized for robust, high-precision generations supporting font control

Agentic image model optimized for high-quality, fast generations supporting font control
Latest video model from Pixverse with astonishing physics

A version of FLUX.2 [klein] 9B-base that supports fast fine-tuned lora inference

A version of FLUX.2 [klein] 4B-base that supports fast fine-tuned lora inference
Use audio input with an image or prompt to generate videos

Moonshot AI's latest open model. It unifies vision and text, thinking and non-thinking modes, and single-agent and multi-agent execution into one model

Google's most intelligent model built for speed with frontier intelligence, superior search, and grounding

A unified Text-to-Speech demo featuring three powerful modes: Voice, Clone and Design

4 step distilled version of FLUX.2 [klein]. A foundation model for maximum flexibility and control

Un-distilled version of FLUX.2 [klein]. A foundation model for maximum flexibility and control
Use AI to generate images & photos with an API
Use AI to caption videos with an API
Use AI for text-to-speech or to clone your voice via API
Use AI to generate images from a face with an API
Use AI to generate videos with an API
Use AI to upscale images with super resolution with an API
Use AI to generate music with an API
Use AI to edit any image via API
Use AI to transcribe speech to text via API
Use AI For Optical Character Recognition (OCR) to extract text from images via API
Use AI to remove backgrounds from images and videos with an API
FLUX AI models: advanced image generation & editing via API
Use AI to restore images via API
Use AI to enhance videos via API - Replicate
Detect NSFW content in images and text
Classify text by sentiment, topic, intent, or safety
Identify speakers from audio and video inputs
Replace faces across images with natural-looking results.
Transform rough sketches into polished visuals
Generate custom emojis from text or images
Create anime-style characters, scenes, and animations
Try AI Models for free: video generation, image generation, upscaling, and photo restoration
Explore Large Language Models (LLMs) for chat, generation & NLP tasks via API
Use AI to Generate Videos from Images with API
Use AI to generate lipsync videos with an API
Use AI to create 3D content with an API
Chat with images for understanding, captioning & detection via API
Use AI to control image generation with an API
Embedding models for AI search and analysis
Use AI to edit your videos with an API
Use AI object detection and segmentation models to distinguish objects in images & videos
Official AI models: Always available, stable, and predictably priced
Flux fine-tunes: build and run custom AI image models via API
Kontext fine-tunes: Build custom AI image models with an API
Create songs with voice cloning models via API
AI media utilities: auto-caption, watermark, frame extraction & more via API
Browse the diverse range of qwen-image fine-tunes the community has custom-trained on Replicate.
WAN family of models: powerful image-to-video & text-to-video models
Use AI To Caption Images with an API


topazlabs / dust-and-scratch-v2
Remove dust and scratches from old photos
12 runs


topazlabs / image-colorization
Image colorization model from Topaz Labs
18 runs


visoar / ace-step-1.5
Music generation
76 runs

minimax / speech-2.8-turbo
Minimax Speech 2.8 Turbo: Turn text into natural, expressive speech with voice cloning, emotion control, and support for 40+ languages
117 runs

minimax / speech-2.8-hd
Minimax Speech 2.8 HD focuses on high-fidelity audio generation with features like studio-grade quality, flexible emotion control, multilingual support, and voice cloning capabilities
265 runs
xai / grok-imagine-video
Generate videos using xAI's Grok Imagine Video model
670 runs


annaclaradsg20 / annanovo
9 runs

anthropic / claude-opus-4.6
Anthropic's most intelligent model with state-of-the-art coding, reasoning, and agentic capabilities
759 runs


thanhnew2001test / nghitts3
NghiTTS API for Vietnamese
16 runs
wan-video / wan2.6-i2v-flash
Image-to-video generation with optional audio, multi-shot narrative support, and faster inference
312 runs


geopti / sam-audio-large
SAM-Audio is a foundation model for isolating any sound in audio using text
117 runs


geopti / sam-audio-base
A foundation model for isolating any sound in audio using text, visual, or temporal prompts
27 runs