Skip to content

Replicate

Replicate

Replicate is a cloud AI platform that lets you run machine-learning models via API without setting up or managing servers and GPUs. With the Replicate integration in SquadOS, your agents can create predictions with any public model on the platform, fine-tune models, manage deployments and files, and orchestrate complete generative AI pipelines.

This tool uses an API key (API_KEY) to connect.

You will need the following fields:

FieldRequiredDescription
api_keyYesAPI token generated in your Replicate account, used to authenticate all requests.
  1. Go to replicate.com and sign in (or create a free account).
  2. Navigate to Account settings → API tokens (or go directly to replicate.com/account/api-tokens).
  3. Click Create token, give the token a name, and confirm.
  4. Copy the generated token — this is the value to enter in the api_key field when connecting in SquadOS.
  1. Go to Tools in the side menu (/admin/tools).
  2. Open the Available tab and search for Replicate.
  3. Click the card to open the details modal and hit Connect.
  4. You’re taken to the secure connection page hosted by Composio, where you enter the API token obtained above.
  5. Once done, you’re sent back to SquadOS with the account connected and the tool available for your agents. (Connection-flow details in Organization Tools.)

REPLICATE_ACCOUNT_GET

Tool to get authenticated account information. Use when you need to retrieve details about the account associated with the API token.

NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_CANCEL_PREDICTION

Tool to cancel a prediction that is still running. Use when you need to stop an in-progress prediction to free up resources or halt execution.

NameTypeRequiredDescription
prediction_idstringYesThe ID of the prediction to cancel
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_COLLECTIONS_GET

Tool to get a specific collection of models by its slug. Use when you need detailed information about a collection and its models.

NameTypeRequiredDescription
collection_slugstringYesThe slug identifier of the collection, e.g., ‘super-resolution’ or ‘text-to-image’
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_COLLECTIONS_LIST

Tool to list all collections of models. Use when you need to retrieve available model collections. Collections are curated groupings of related models. Response includes only collection metadata (name, slug, description), not individual models within each collection; use REPLICATE_MODELS_GET for per-model details. Response may include a non-null next field indicating additional pages; follow it to enumerate all collections.

NameTypeRequiredDescription
cursorstringNoCursor for pagination. Use the cursor from ‘next’ URL of previous response.
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_CREATE_MODEL

Tool to create a new Replicate model with specified owner, name, visibility, and hardware. Use when you need to create a destination model before launching LoRA/fine-tune training.

NameTypeRequiredDescription
namestringYesThe name of the model. This must be unique among all models owned by the user or organization.
ownerstringYesThe name of the user or organization that will own the model. This must be the same as the user or organization making the API request.
hardwarestringYesThe SKU for the hardware used to run the model. Possible values can be retrieved from the hardware.list endpoint.
paper_urlstringNoA URL for the model’s paper
github_urlstringNoA URL for the model’s source code on GitHub
visibilitystringYesWhether the model should be public or private. Public models are viewable and runnable by anyone; private models are restricted to owners.
descriptionstringNoA description of the model
license_urlstringNoA URL for the model’s license
cover_image_urlstringNoA URL for the model’s cover image. This should be an image file.
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_CREATE_PREDICTION

Tool to create a prediction for a Replicate Deployment. IMPORTANT: This action ONLY works with Replicate Deployments (persistent instances you create and manage), NOT public models. Deployments are created via REPLICATE_DEPLOYMENTS_CREATE. To run public models (e.g., ‘meta/llama-2-70b-chat’, ‘stability-ai/sdxl’), use REPLICATE_MODELS_PREDICTIONS_CREATE instead. Use ‘wait_for’ to wait until the prediction completes.

NameTypeRequiredDescription
inputobjectYesJSON object of inputs for the model. Keys and types must exactly match the deployment model’s schema; consult REPLICATE_MODELS_README_GET or REPLICATE_MODELS_EXAMPLES_LIST first. For file inputs, verify file IDs via REPLICATE_FILES_LIST before submitting.
webhookstringNoHTTPS URL to receive webhook callbacks
wait_forintegerNoSeconds to wait for prediction completion; adds a ‘Prefer: wait=<n>’ header if provided. Valid range is 1–60; if the prediction does not complete within this window, poll for status or use a webhook instead.
deployment_namestringYesName of the deployment. NOTE: This is the deployment name, NOT a public model name. Deployments are created via REPLICATE_DEPLOYMENTS_CREATE. To run public models, use REPLICATE_MODELS_PREDICTIONS_CREATE instead.
deployment_ownerstringYesUsername or organization that owns the deployment. NOTE: This is for Replicate Deployments only, NOT public models. To run public models, use REPLICATE_MODELS_PREDICTIONS_CREATE instead.
webhook_events_filterarrayNoList of events to trigger webhook calls
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_DEPLOYMENTS_CREATE

Tool to create a new deployment with specified model, version, hardware, and scaling parameters. Use when you need to deploy a model for production use with auto-scaling.

NameTypeRequiredDescription
namestringYesThe deployment’s identifier
modelstringYesFull model name in format owner/name
versionstringYesThe 64-character model version ID to deploy
hardwarestringYesHardware SKU from the hardware.list endpoint
max_instancesintegerYesMaximum instances for auto-scaling
min_instancesintegerYesMinimum instances for auto-scaling
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_DEPLOYMENTS_DELETE

Tool to delete a deployment from your account. Use when you need to remove a deployment. Deployments must be offline and unused for at least 15 minutes before deletion.

NameTypeRequiredDescription
deployment_namestringYesThe name of the deployment to delete. Deployment must be offline and unused for at least 15 minutes before deletion.
deployment_ownerstringYesThe username or organization that owns the deployment
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_DEPLOYMENTS_GET

Tool to get deployment details by owner and name. Use when you need information about a specific deployment including its release configuration and hardware settings.

NameTypeRequiredDescription
deployment_namestringYesThe name of the deployment
deployment_ownerstringYesThe name of the user or organization that owns the deployment
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_DEPLOYMENTS_LIST

Tool to list all deployments associated with the account. Use when you need to retrieve deployment configurations and their latest releases.

NameTypeRequiredDescription
cursorstringNoCursor for pagination. Use the cursor from ‘next’ URL of previous response.
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_FILES_CREATE

Tool to create or upload a file to Replicate. Use when you need to upload file content with optional metadata.

NameTypeRequiredDescription
contentobjectYesFile to upload.
metadataobjectNoUser-provided metadata associated with the file (must be valid JSON)
content_typestringNoThe content/MIME type for the file
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_FILES_DELETE

Tool to delete a file by its ID. Use when you need to remove a file from storage. Returns 204 No Content on success.

NameTypeRequiredDescription
file_idstringYesThe unique identifier of the file to delete
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_FILES_GET

Tool to get details of a file by its ID. Use when you need to inspect uploaded file information before further operations. Returned URLs may be short-lived; download or persist needed files promptly after retrieval.

NameTypeRequiredDescription
file_idstringYesThe ID of the file to retrieve. Must originate from a prior upload or prediction output. Use REPLICATE_FILES_LIST to confirm the ID exists before calling this tool.
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_FILES_LIST

Tool to retrieve a paginated list of uploaded files. Use to view all files created by the authenticated user or organization. Files are sorted with most recent first. Pagination is cursor-based: follow the next cursor until empty to retrieve all files. Limit requests to 1–2/second to avoid 429 Too Many Requests errors. Use to validate current file_ids before passing to prediction tools, as stale file_ids cause runtime errors.

NameTypeRequiredDescription
cursorstringNoCursor for pagination. Use the cursor from ‘next’ URL of previous response.
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_GET_PREDICTION

Tool to get the status and output of a prediction by its ID. Use when you need to check on a running prediction or retrieve the results of a completed prediction.

NameTypeRequiredDescription
prediction_idstringYesThe ID of the prediction to get
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_HARDWARE_LIST

Tool to list available hardware SKUs for models and deployments. Use when you need to see what hardware options are available on the Replicate platform.

NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_MODELS_EXAMPLES_LIST

Tool to list example predictions for a specific model. Use when you want to retrieve author-provided illustrative examples after identifying the model. Returned examples are minimal working payloads; cross-reference with REPLICATE_MODELS_README_GET before calling REPLICATE_CREATE_PREDICTION to satisfy strict input validation.

NameTypeRequiredDescription
cursorstringNoCursor for pagination. Use the cursor from ‘next’ URL of previous response.
model_namestringYesThe name of the model to list examples for. Must exactly match the Replicate URL slug (case-sensitive).
model_ownerstringYesThe name of the user or organization that owns the model. Must exactly match the Replicate URL slug (case-sensitive).
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_MODELS_GET

Tool to get details of a specific model by owner and name. Consult the returned input schema before constructing any prediction request — each model defines its own required/optional fields (e.g., prompt, aspect_ratio, version); missing or unknown keys cause validation errors. Model schemas and available versions may change over time; recheck before production use.

NameTypeRequiredDescription
model_namestringYesThe name of the model, e.g., ‘hello-world’. Must be an exact, case-sensitive Replicate slug (lowercase, hyphenated); e.g., use ‘hello-world’ not ‘Hello World’.
model_ownerstringYesThe username or organization that owns the model, e.g., ‘replicate’. Must be an exact, case-sensitive Replicate slug (lowercase, hyphenated); e.g., use ‘replicate’ not ‘Replicate’.
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_MODELS_LIST

Tool to list public models with pagination and sorting. Use when you need to browse available models or find models sorted by creation date.

NameTypeRequiredDescription
cursorstringNoPagination cursor for navigating through results. Use the ‘next’ URL from previous responses to fetch the next page of results.
sort_bystring (model_created_at | latest_version_created_at)NoField to sort models by. Options are ‘model_created_at’ (when model was created) or ‘latest_version_created_at’ (when latest version was created). Defaults to ‘latest_version_created_at’ if not specified.
sort_directionstring (asc | desc)NoSort direction - ‘asc’ for ascending or ‘desc’ for descending. Defaults to ‘desc’ (newest first) if not specified.
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_MODELS_PREDICTIONS_CREATE

Tool to create a prediction using an official Replicate model. Use when you need to run inference with a specific model using its owner and name. Supports synchronous waiting (up to 60 seconds) and webhooks for async notifications.

NameTypeRequiredDescription
inputobjectYesModel inputs as JSON object. Schema varies by model - each model has its own set of required fields, optional parameters, and validation constraints (e.g., max dimensions for image models). IMPORTANT: Before making predictions, use the REPLICATE_MODELS_GET action to fetch the model’s openapi_schema which contains all valid input parameters and their constraints. For image models like flux-1.1-pro, common constraints include max width/height limits (e.g., 1440px), valid aspect ratios, and output format options.
webhookstringNoHTTPS URL to receive POST requests when prediction updates occur
wait_forintegerNoSeconds to wait for prediction completion (1-60); adds a ‘Prefer: wait=<n>’ header to block and wait synchronously
model_namestringYesThe name of the model as it appears on Replicate. Must be an exact match. Find valid model names by searching at replicate.com or using the search/models_list actions. The full model identifier is {model_owner}/{model_name}.
model_ownerstringYesThe username or organization that owns the model on Replicate. Find valid owners by searching models at replicate.com or using the search/models_list actions. Examples: ‘meta’, ‘stability-ai’, ‘black-forest-labs’, ‘bytedance’, ‘google’.
cancel_afterstringNoMaximum execution duration with optional unit suffixes (s/m/h); minimum 5 seconds. Example: ’60s’ or ‘5m’.
webhook_events_filterarrayNoFilter webhook triggers by event type: ‘start’, ‘output’, ‘logs’, or ‘completed’
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_MODELS_README_GET

Tool to get the README content for a model in Markdown format. Consult alongside REPLICATE_MODELS_EXAMPLES_LIST before calling REPLICATE_CREATE_PREDICTION — Replicate enforces strict JSON schemas on model inputs and returns 422 errors for incorrect keys or types. Use after retrieving model details when you want to view its documentation.

NameTypeRequiredDescription
model_namestringYesThe name of the model, e.g., ‘hello-world’
model_ownerstringYesThe username or organization that owns the model, e.g., ‘replicate’. Must exactly match the Replicate URL slug (case-sensitive).
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_MODELS_VERSIONS_GET

Tool to get a specific version of a model. Use when you need details about a particular model version including its schema and metadata.

NameTypeRequiredDescription
model_namestringYesThe name of the model, e.g., ‘hello-world’
version_idstringYesThe 64-character identifier of the version
model_ownerstringYesThe name of the user or organization that owns the model, e.g., ‘replicate’
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_MODELS_VERSIONS_LIST

Tool to list all versions of a specific model. Use when you need to see all available versions of a model, sorted by newest first.

NameTypeRequiredDescription
cursorstringNoCursor for pagination. Use the cursor from ‘next’ URL of previous response.
model_namestringYesThe name of the model, e.g., ‘hello-world’
model_ownerstringYesThe username or organization that owns the model, e.g., ‘replicate’
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_PREDICTIONS_CREATE

Tool to create a prediction to run a model by version ID. Use when you have a specific model version identifier and need to run inference with provided inputs. Supports synchronous waiting and webhook notifications.

NameTypeRequiredDescription
inputobjectYesModel inputs as JSON object. Schema varies by model. Files should be HTTP URLs (for files >256kb or reusable) or data URLs (for files ≤256kb)
streambooleanNoDeprecated. Streaming availability is now included in response urls
versionstringYesModel version identifier. Accepts formats: {owner}/{name}, {owner}/{name}:{version_id}, or {version_id} alone
webhookstringNoHTTPS URL to receive POST requests when prediction updates occur. Body mirrors the get prediction response
wait_forintegerNoSeconds (1-60) to hold connection open until model finishes. Adds ‘Prefer: wait=<n>’ header
cancel_afterstringNoMaximum runtime before automatic cancellation. Accepts durations with optional suffixes (s/m/h). Minimum: 5 seconds. Examples: ’30s’, ‘5m’, ‘1h30m45s’
webhook_events_filterarrayNoControls webhook triggers. Options: ‘start’, ‘output’, ‘logs’, ‘completed’. Output/logs events throttled to once per 500ms
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_PREDICTIONS_LIST

Tool to list all predictions for the authenticated user or organization with pagination. Use when you need to retrieve prediction history or filter predictions by creation date.

NameTypeRequiredDescription
cursorstringNoCursor for pagination. Use the cursor from ‘next’ URL of previous response.
created_afterstringNoInclude only predictions created at or after this date-time, in ISO 8601 format (e.g., ‘2024-01-01T00:00:00Z’). Useful for filtering recent predictions.
created_beforestringNoInclude only predictions created before this date-time, in ISO 8601 format (e.g., ‘2024-12-31T23:59:59Z’). Useful for filtering older predictions.
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_SEARCH

Tool to search for models, collections, and docs using text queries (beta). Use when you need to find relevant models or collections based on keywords or descriptions.

NameTypeRequiredDescription
limitintegerNoMaximum number of model results to return. Must be between 1 and 50. Defaults to 20 if not specified.
querystringYesThe search query string to execute. Can search across models, collections, and documentation.
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_TRAININGS_CANCEL

Tool to cancel an ongoing training operation in Replicate. Use when you need to stop a training job that is in progress.

NameTypeRequiredDescription
training_idstringYesThe identifier of the training operation to be terminated
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_TRAININGS_CREATE

Tool to create a training job for a specific model version. Use when you need to fine-tune a model with custom training data. Supports webhook notifications for training status updates.

NameTypeRequiredDescription
namestringYesName of the model to train
inputobjectYesJSON object containing inputs to the Cog model’s train() function. Schema varies by model
ownerstringYesName of the user or organization owning the model
webhookstringNoHTTPS URL to receive POST requests when training updates occur
version_idstringYesThe 64-character model version identifier to use for training
destinationstringYesDestination model to push to in format {owner}/{name}
webhook_events_filterarrayNoList of event types that trigger webhook calls. Options: ‘start’, ‘output’, ‘logs’, ‘completed’
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_TRAININGS_LIST

Tool to list all training jobs for the authenticated user or organization with pagination. Use when you need to retrieve training history or check the status of training jobs.

NameTypeRequiredDescription
cursorstringNoCursor for pagination. Use the cursor from ‘next’ URL of previous response.
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_UPDATE_MODELS

Tool to update metadata for a model including description, URLs, and README. Use when you need to modify a model’s visibility, documentation, or associated links.

NameTypeRequiredDescription
readmestringNoThe README content of the model
paper_urlstringNoA URL for the model’s paper
github_urlstringNoA URL for the model’s source code on GitHub
model_namestringYesThe name of the model to update
descriptionstringNoA description of the model
license_urlstringNoA URL for the model’s license
model_ownerstringYesThe name of the user or organization that owns the model
weights_urlstringNoA URL for the model’s weights
NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not

REPLICATE_WEBHOOKS_SECRET_GET

Tool to get the signing secret for the default webhook. Use when you need to retrieve the secret key used to verify webhook authenticity.

NameTypeRequiredDescription
datastringYesData from the action execution
errorstringNoError if any occurred during the execution of the action
successfulbooleanYesWhether or not the action execution was successful or not