Skip to content

Tavily

Tavily

Tavily is a search and information retrieval platform built for AI agents. It offers specialized endpoints for web search, page content extraction, deep website crawling, and URL structure mapping — all with precise controls for depth, domain, and format. With the Tavily integration in SquadOS, your agents gain access to up-to-date web information without the complexity of managing scrapers, proxies, or rate limits.

This tool uses an API key (API_KEY) to connect.

You will need the following fields:

FieldRequiredDescription
api_keyYesAPI key generated in the Tavily dashboard at app.tavily.com, used to authenticate all requests.
  1. Go to app.tavily.com/home and create an account (or log in if you already have one).
  2. In the dashboard, locate the API Keys section.
  3. Click the ”+” button next to the API Keys section to generate a new key.
  4. Fill in a name for the key, choose the type (Development for up to 100 req/min; Production for up to 1,000 req/min), and optionally set a monthly credit limit.
  5. Copy the generated key — this is the value to use in the api_key field when connecting in SquadOS.
  1. Go to Tools in the side menu (/admin/tools).
  2. Open the Available tab and search for Tavily.
  3. Click the card to open the details and hit Connect.
  4. You’re taken to the secure connection page hosted by Composio, where you enter the API key obtained above.
  5. Once done, you’re sent back to SquadOS with the account connected and the tool available to agents. (Connection-flow details in Organization Tools.)

TAVILY_CRAWL

Tool to perform intelligent graph-based website crawling with parallel path exploration and content extraction. Use when you need to traverse and extract content from multiple pages of a website following specific patterns or instructions. Supports depth/breadth controls, domain filtering, and natural language instructions for guided crawling.

NameTypeRequiredDescription
urlstringYesRoot URL to begin the crawl. Can be provided with or without protocol (e.g., docs.tavily.com or https://docs.tavily.com).
limitintegerNoTotal number of links to process before stopping the crawl.
formatstring ("markdown" | "text")NoFormat of the extracted content: "markdown" or "text".
timeoutintegerNoMaximum wait time in seconds for the crawl operation. Range: 10–150.
max_depthintegerNoMaximum crawl depth from base URL. Range: 1–5. Depth of 1 means only direct links from the root URL.
max_breadthintegerNoMaximum number of links to follow per page level.
instructionsstringNoNatural language guidance for the crawler to find specific pages or content. Using instructions increases cost to 2 credits per 10 pages. Example: "Find all pages about the Python SDK".
select_pathsarrayNoList of regex patterns for specific URL paths to include. Example: ['/docs/.*', '/api/.*'].
exclude_pathsarrayNoList of regex patterns to skip certain URL paths. Example: ['/admin/.*', '/private/.*'].
extract_depthstring ("basic" | "advanced")NoExtraction level for content: "basic" for standard extraction or "advanced" for deeper analysis.
include_usagebooleanNoIf true, includes credit usage information in the response.
allow_externalbooleanNoIf true, includes links to external domains in the crawl.
include_imagesbooleanNoIf true, includes images in the crawl results.
select_domainsarrayNoList of regex patterns for domain filtering. Only URLs matching these patterns will be crawled.
exclude_domainsarrayNoList of regex patterns to exclude certain domains from the crawl.
include_faviconbooleanNoIf true, includes favicon URLs in the results.
chunks_per_sourceintegerNoMaximum content snippets per source (max 500 chars each). Range: 1–5.
NameTypeRequiredDescription
datastringYesData from the action execution.
errorstringNoError message if execution failed.
successfulbooleanYesWhether the action executed successfully.

TAVILY_EXTRACT

Tool to extract and parse web page content from specified URLs using Tavily’s extract endpoint. Use when you need to retrieve clean, structured content from web pages with optional image extraction and content reranking based on query relevance.

NameTypeRequiredDescription
urlsstringYesURL(s) to extract content from. Can be a single URL string or a list of URL strings.
querystringNoUser intent for reranking extracted content chunks. Helps prioritize the most relevant extracted content based on the query.
formatstring ("markdown" | "text")NoContent format for extraction: "markdown" or "text". Default is "markdown".
timeoutnumberNoMaximum wait time in seconds for the extraction request. Must be between 1.0 and 60.0 seconds. Default is 30.0.
extract_depthstring ("basic" | "advanced")NoExtraction depth level: "basic" for standard extraction or "advanced" for more in-depth extraction. Default is "basic".
include_usagebooleanNoIf true, includes credit usage information in the response.
include_imagesbooleanNoIf true, includes a list of image URLs found in the extracted content.
include_faviconbooleanNoIf true, includes the favicon URL for each result.
chunks_per_sourceintegerNoMaximum number of relevant chunks to extract per source. Must be between 1 and 5. Default is 3.
NameTypeRequiredDescription
datastringYesData from the action execution.
errorstringNoError message if execution failed.
successfulbooleanYesWhether the action executed successfully.

TAVILY_GET_USAGE

Tool to retrieve API key and account usage statistics from Tavily. Use when you need to check credit consumption, limits, and per-endpoint usage for search, extract, crawl, map, and research operations.

NameTypeRequiredDescription
datastringYesData from the action execution.
errorstringNoError message if execution failed.
successfulbooleanYesWhether the action executed successfully.

TAVILY_MAP

Tool to map a website and discover its pages. Use when you need to scan a website and get a structured list of URLs/pages it contains without extracting full content.

NameTypeRequiredDescription
urlstringYesThe root URL to begin mapping (e.g., docs.tavily.com). This is the starting point from which the crawler will discover and map pages.
limitintegerNoTotal number of links to process before stopping. Minimum is 1. Default is 50.
timeoutintegerNoMaximum number of seconds to wait for the mapping to complete. Range: 10–150. Default is 150.
max_depthintegerNoHow far from the base URL the crawler explores. Range: 1–5. Default is 1.
max_breadthintegerNoThe number of links to follow per page level. Minimum is 1. Default is 20.
instructionsstringNoNatural language directions for the crawler to guide its exploration. Using this parameter increases cost to 2 credits per 10 pages instead of 1.
select_pathsarrayNoList of regex patterns for specific URL paths to include (e.g., '/docs/.*' to only include documentation paths).
exclude_pathsarrayNoList of regex patterns to skip certain URL paths (e.g., '/admin/.*' to exclude admin pages).
include_usagebooleanNoIf true, includes credit usage details in the response. Default is false.
allow_externalbooleanNoIf true, includes external domain links in the results. Default is true.
select_domainsarrayNoList of regex patterns for domain targeting. Only URLs matching these domain patterns will be included.
exclude_domainsarrayNoList of regex patterns to exclude certain domains from the mapping results.
NameTypeRequiredDescription
datastringYesData from the action execution.
errorstringNoError message if execution failed.
successfulbooleanYesWhether the action executed successfully.

TAVILY_SEARCH

Use this to perform a web search via the Tavily API; offers controls for search depth, content types, result count, and domain filtering. Requires an active Tavily connection (401 = auth failure). Rate limit: ~2 req/s; apply exponential backoff on HTTP 429. Results are nested under response_data.results (not a flat list).

NameTypeRequiredDescription
querystringYesThe search query string to find relevant information online. No native date-filter exists; embed time hints directly in the query string. For broad coverage, issue multiple focused queries rather than one broad query.
max_resultsintegerNoMaximum number of search results to return. Large values combined with include_raw_content=true produce very large payloads.
search_depthstring ("basic" | "advanced")NoSpecifies search depth: "basic" (standard, 1 API Credit) or "advanced" (in-depth, 2 API Credits).
include_answerbooleanNoIf true, attempts to include a direct answer to the query (suitable for factual questions). The answer field can be null; treat response_data.results array as primary evidence.
include_imagesbooleanNoIf true, includes links to relevant images in search results.
exclude_domainsarrayNoA list of domain names (e.g., ['exclude.com', 'othersite.net']) to exclude from search results; results from these domains will be filtered out.
include_domainsarrayNoA list of specific domain names (e.g., ['example.com', 'website.org']) to restrict the search to; only results from these domains are returned.
include_raw_contentbooleanNoIf true, includes raw content from visited websites (e.g., unprocessed HTML or text) in search results. Without this, results may be short snippets that omit critical detail.
NameTypeRequiredDescription
datastringYesData from the action execution.
errorstringNoError message if execution failed.
successfulbooleanYesWhether the action executed successfully.