November 18, 2025

# Introducing Parallel FindAll

Parallel's new FindAll API turns natural language queries into custom datasets from the web. It finds entities like companies, people, or locations based on your criteria, then enriches them with structured data—all with citations. FindAll Pro achieves 61% recall, 3x better than competitors.

Tags:Product,Benchmarks

Reading time: 4 min

Today, we're announcing the newest product in our suite of **Web Agent APIs**: the **FindAll API**.

**FindAll** is the best way to create your own custom database from the web, with just a simple natural language query. It’s available now to try in the Parallel Developer Platform[Developer Platform].

## Turn the web into your own structured dataset

**FindAll** finds any set of entities (companies, people, events, locations, houses, etc.) based on a set of match criteria. For example, with **FindAll, **you can run a natural language query like “Find all dental practices located in Ohio that have 4+ star Google reviews.”

Find all dental practicies in ohio with a 4+ star rating on google — An example of a FindAll query

This is a powerful way to discover the complete long tail of interesting entities from the web and filter them down with match criteria that are personalized to your unique use case. The result is an extensible tool that can produce high-quality datasets on demand, as opposed to buying static, stale, and generic datasets.

## How FindAll works

FindAll executes a three-stage pipeline optimized for both coverage and efficiency:

**1. Generate candidates from web data: FindAll** searches across our proprietary web index to identify potential entities matching your query. Unlike traditional search, which returns a fixed result set, **FindAll** generates candidates dynamically based on your specific criteria.

**2. Evaluate against match conditions: **Each candidate is evaluated against your match conditions using multi-hop reasoning across web sources. Only candidates which satisfy all conditions reach matched status and are included in the results. This staged approach means you only pay to process entities that actually matter.

**3. Extract Structured Enrichments: **For matched entities, **FindAll** automatically orchestrates our **Task API**[**Task API**] to extract any additional fields you've specified— from basic attributes like revenue and employee count to complex data points like the strategic initiatives a company is prioritizing.

Illustration demonstrating deep research API concepts, web search capabilities, or AI agent integration features

Every data point returned includes comprehensive verification through our **Basis framework**[**Basis framework**]— citations linking to source materials, detailed reasoning for match decisions, relevant excerpts from web pages, and calibrated confidence scores. This granular attribution enables human-in-the-loop workflows for verifiability and provenance.

## State-of-the-art performance

To test the performance of **FindAll**, we created our own benchmark of 40 complex multi-criteria queries covering public companies, startups, SMBs, specialized entities, and people (e.g., executives, researchers, and professionals). Recall measures the proportion of all correct matches within the entire competitive set of successfully identified entities.

Some sample questions:

- "Find all former McKinsey & Company consultants who are currently employed in C-level or VP positions at healthcare technology startups with Series A or later funding" — combines employment history, current role level, industry focus, and funding stage.
- "Find all wedding venues in Florida with capacity between 150-300 guests that offer both indoor and outdoor ceremony options, provide in-house catering, and have availability in 2025" — combines location, capacity range, facility features, service offerings, and temporal availability.
- "Find all climate technology startups that have active pilot programs with Fortune 500 companies, raised pre-Series A funding, and focus on carbon capture or renewable energy storage" — combines industry focus, corporate partnerships, funding stage, and specific technology areas.

****

**FindAll Pro** achieves state-of-the-art results with 61% recall, ~3X higher than OpenAI Deep Research, Anthropic Deep Research, and Exa. Higher recall means that Parallel **FindAll **finds more correct matches for a given query.** FindAll** **Base** also achieves 30% recall while being the lowest cost on the market, making it the most cost-effective yet performant option.

WISER-FindAll

Recall (%)

FindAll Pro61.3% / 1430CPM

FindAll Core52.5% / 230CPM

FindAll Base30.3% / 60CPM

OpenAI Deep Research21% / 250CPM

Exa19.2% / 110CPM

Anthropic Deep Research15.3% / 1000CPM

COST (CPM)

RECALL (%)

Loading chart...

CPM: USD per 1000 requests. Cost is shown on a Log scale.

Parallel

Others

+Methodology

### Benchmark

This benchmark, created by Parallel, contains 40 complex multi-criteria queries covering public companies, startups, SMBs, specialized entities, and people (e.g., executives, researchers, professionals).

### Methodology

To measure recall we take the number of correct matches / total entities in the ground truth dataset. The ground truth dataset is created by taking the union of all correct matches across the competitor set. Cost is calculated as the average cost to find 1000 correct matches.

### Testing dates

Nov 13th-17th, 2025

### Benchmark

### Methodology

### Testing dates

Nov 13th-17th, 2025

### Parallel-FindAll

| Series   | Model                   | Cost (CPM) | Recall (%) |
| -------- | ----------------------- | ---------- | ---------- |
| Parallel | FindAll Base            | 60         | 30.3       |
| Parallel | FindAll Core            | 230        | 52.5       |
| Parallel | FindAll Pro             | 1430       | 61.3       |
| Others   | OpenAI Deep Research    | 250        | 21         |
| Others   | Anthropic Deep Research | 1000       | 15.3       |
| Others   | Exa                     | 110        | 19.2       |

CPM: USD per 1000 requests. Cost is shown on a Log scale.

### Benchmark

### Methodology

### Testing dates

Nov 13th-17th, 2025

FindAll can be used to find a broad set of entities across a range of criteria. There are many powerful and diverse use cases we’ve seen:

- **Finding sales leads that match your ICP**: “Find all F500 companies with a senior AI leader that joined the company in the last 6 months”
- **Finding acquisition targets as a hedge fund**: "Find all residential roofing companies in Charlotte, NC with 10-50 employees"
- **Finding public companies to invest in**: "Find all S&P 500 companies that cited tariffs as a key risk in their latest 10-K"
- **Finding competitors to keep track of**: "Find all productivity tools targeting remote teams that launched in the last year"
- **Creating market maps**: "Find all AI infrastructure providers that raised Series B in the last 6 months"
- **Finding potential suppliers and factories**: "Find all semiconductor equipment manufacturers with facilities in Southeast Asia."
- **Researching regulatory environments: **“Find all environmental lawsuits in the United States where a court ruling was reached in 2025”

## Get started creating entire datasets from the web

The **FindAll** API is available today. Get started with our Developer Platform[Developer Platform] or dive into the documentation[documentation].

### Create a FindAll run

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
import requests

url = "https://api.parallel.ai/v1beta/findall/runs"

payload = {
    "objective": "<string>",
    "entity_type": "<string>",
    "match_conditions": [
        {
            "name": "<string>",
            "description": "Company must have SOC2 Type II certification (not Type I). Look for evidence in: trust centers, security/compliance pages, audit reports, or press releases specifically mentioning 'SOC2 Type II'. If no explicit SOC2 Type II mention is found, consider requirement not satisfied."
        }
    ],
    "generator": "base",
    "match_limit": 123
}
headers = {
    "x-api-key": "<api-key>",
    "Content-Type": "application/json"
}

response = requests.post(url, json=payload, headers=headers)

print(response.json())``` import requests
 
url = "https://api.parallel.ai/v1beta/findall/runs"
 
payload = {
    "objective": "<string>",
    "entity_type": "<string>",
    "match_conditions": [
        {
            "name": "<string>",
            "description": "Company must have SOC2 Type II certification (not Type I). Look for evidence in: trust centers, security/compliance pages, audit reports, or press releases specifically mentioning 'SOC2 Type II'. If no explicit SOC2 Type II mention is found, consider requirement not satisfied."
        }
    ],
    "generator": "base",
    "match_limit": 123
}
headers = {
    "x-api-key": "<api-key>",
    "Content-Type": "application/json"
}
 
response = requests.post(url, json=payload, headers=headers)
 
print(response.json())
```

## About Parallel Web Systems

Parallel develops critical web search infrastructure for AI. Our suite of web search and agent APIs is built on a rapidly growing proprietary index of the global internet. These solutions transform human tasks that previously took days and weeks into agentic tasks that now take seconds and minutes.

Fortune 100 and 500 companies use Parallel’s web intelligence APIs in insurance, finance, and retail, as well as AI-first businesses like Clay, Starbridge, and Sourcegraph.

## Ready to get started?

Try Parallel[Try Parallel]Contact sales[Contact sales]

Are you an agent? Read this to onboard Parallel[Are you an agent? Read this to onboard Parallel]

By Parallel

November 18, 2025

## Related Posts77

Jul 20, 2026

- [Building a vendor intelligence system with Parallel](https://parallel.ai/blog/vendor-intelligence-system)

Tags:Developers

Author: By Sahith Jagarlamudi

Jul 16, 2026

# Introducing Parallel FindAll

## Turn the web into your own structured dataset

## How FindAll works

## State-of-the-art performance

### Benchmark

### Methodology

### Testing dates

### Benchmark

### Methodology

### Testing dates

### Parallel-FindAll

### Benchmark

### Methodology

### Testing dates

FindAll can be used to find a broad set of entities across a range of criteria. There are many powerful and diverse use cases we’ve seen:

## Get started creating entire datasets from the web

## About Parallel Web Systems

## Ready to get started?

## Related Posts77

- [Building a vendor intelligence system with Parallel](https://parallel.ai/blog/vendor-intelligence-system)

- [Parallel and Google Cloud Announce Partnership for Agentic Web Search on Gemini Enterprise Agent Platform](https://parallel.ai/blog/google-cloud-partnership)

- [$5 in free Parallel credits, every month](https://parallel.ai/blog/free-tier-parallel)

- [Introducing Parallel Search Turbo](https://parallel.ai/blog/parallel-search-turbo)

- [How Nooks cut web search costs 70.5% by switching to Parallel](https://parallel.ai/blog/case-study-nooks)

- [How Build created live geofenced alerts powered by Parallel for institutional real estate](https://parallel.ai/blog/case-study-build)

- [OpenClaw now has free, LLM-optimized web search by default powered by Parallel](https://parallel.ai/blog/free-web-search-openclaw)

- [Introducing real-time Entity Search](https://parallel.ai/blog/entity-search-company)

- [How we enrich & triage inbound leads using the Parallel Task API](https://parallel.ai/blog/enrich-triage-inbound-leads-parallel-task-api)

- [How AirOps creates citation-worthy content at scale, powered by Parallel](https://parallel.ai/blog/case-study-airops)

- [Introducing Index by Parallel](https://parallel.ai/blog/introducing-index-by-parallel)

- [Parallel Monitor API: New processor tiers, snapshots and event streams, and Basis on every event](https://parallel.ai/blog/monitor-api)

- [How we built parallelmpp.dev](https://parallel.ai/blog/parallel-mpp-dev)

- [How Actively's Per Account Agents use Parallel to turn the entire web into a proactive sales intelligence layer](https://parallel.ai/blog/case-study-actively)

- [Parallel Raises at $2 Billion Valuation to Scale Web Infrastructure for Agents](https://parallel.ai/blog/series-b)

- [Building a free CLI agent with Pi, Ollama, Gemma 4, and Parallel](https://parallel.ai/blog/free-CLI-agent)

- [Parallel Search is now free for agents via MCP](https://parallel.ai/blog/free-web-search-mcp)

- [Upgrades to the Parallel Search & Extract APIs](https://parallel.ai/blog/parallel-search-api)

- [How Finch is scaling plaintiff law with AI agents that research like associates](https://parallel.ai/blog/case-study-finch)

- [Genpact and Parallel Web Systems Partner to Drive Tangible Efficiency from AI Systems](https://parallel.ai/blog/genpact-parallel-partnership)

- [How Genpact helps top US insurers cut contents claims processing times in half with Parallel ](https://parallel.ai/blog/case-study-genpact)

- [A new deep research frontier on DeepSearchQA with the Task API Harness](https://parallel.ai/blog/deep-research)

- [How Modal saves tens of thousands annually by building in-house GTM pipelines with Parallel](https://parallel.ai/blog/case-study-modal)

- [How Opendoor uses Parallel as the enterprise grade web research layer powering its AI-native real estate operations](https://parallel.ai/blog/case-study-opendoor)

- [Introducing stateful web research agents with multi-turn conversations](https://parallel.ai/blog/task-api-interactions)

- [Parallel is live on Tempo, now available natively to agents with the Machine Payments Protocol](https://parallel.ai/blog/tempo-stripe-mpp)

- [How Parallel helped Kepler build AI that finance professionals can actually trust](https://parallel.ai/blog/case-study-kepler)

- [Introducing the Parallel CLI](https://parallel.ai/blog/parallel-cli)

- [How Profound helps brands win AI Search with high-quality web research and content creation powered by Parallel](https://parallel.ai/blog/case-study-profound)

- [How Harvey is expanding legal AI internationally with Parallel](https://parallel.ai/blog/case-study-harvey)

- [How Tabstack by Mozilla enables agents to navigate the web with Parallel’s best-in-class web search](https://parallel.ai/blog/case-study-tabstack)

- [Parallel Web Tools and Agents now available across Vercel AI Gateway, AI SDK, and Marketplace](https://parallel.ai/blog/vercel)

- [Authenticated page access for the Parallel Task API](https://parallel.ai/blog/authenticated-page-access)

- [Introducing structured outputs for the Monitor API](https://parallel.ai/blog/structured-outputs-monitor)

- [Introducing research models with Basis for the Parallel Chat API](https://parallel.ai/blog/research-models-chat)

- [Build a real-time fact checker with Parallel and Cerebras](https://parallel.ai/blog/cerebras-fact-checker)

- [Parallel Task API achieves state-of-the-art accuracy on DeepSearchQA](https://parallel.ai/blog/deepsearch-qa)

- [Introducing Granular Basis for the Task API](https://parallel.ai/blog/granular-basis-task-api)

- [How Amp’s coding agents build better software with Parallel Search](https://parallel.ai/blog/case-study-amp)

- [Latency improvements on the Parallel Task API ](https://parallel.ai/blog/task-api-latency)

- [Introducing Parallel Extract](https://parallel.ai/blog/introducing-parallel-extract)

- [Introducing Parallel Monitor](https://parallel.ai/blog/monitor-api-beta)

- [Parallel raises $100M Series A to build web infrastructure for agents](https://parallel.ai/blog/series-a)

- [How Macroscope reduced code review false positives with Parallel](https://parallel.ai/blog/case-study-macroscope)

- [Introducing Parallel Search](https://parallel.ai/blog/parallel-search-api-beta)

- [Parallel processors set new price-performance standard on SealQA benchmark](https://parallel.ai/blog/benchmarks-task-api-sealqa)

- [Introducing LLMTEXT, an open source toolkit for the llms.txt standard](https://parallel.ai/blog/LLMTEXT-for-llmstxt)

- [How Starbridge powers public sector GTM with state-of-the-art web research](https://parallel.ai/blog/case-study-starbridge)

- [Building a market research platform with Parallel Deep Research](https://parallel.ai/blog/cookbook-market-research-platform-with-parallel)

- [How Lindy brings state-of-the-art web research to automation flows](https://parallel.ai/blog/case-study-lindy)

- [Introducing the Parallel Task MCP Server](https://parallel.ai/blog/parallel-task-mcp-server)

- [Introducing the Core2x Processor for improved compute control on the Task API](https://parallel.ai/blog/core2x-processor)

- [How Day AI merges private and public data for business intelligence](https://parallel.ai/blog/case-study-day-ai)

- [Full Basis framework for all Task API Processors](https://parallel.ai/blog/full-basis-framework-for-task-api)

- [Building a real-time streaming task manager with Parallel](https://parallel.ai/blog/cookbook-sse-task-manager-with-parallel)

- [How Gumloop built a new AI automation framework with web intelligence as a core node](https://parallel.ai/blog/case-study-gumloop)

- [Introducing the TypeScript SDK](https://parallel.ai/blog/typescript-sdk)

- [Building a serverless competitive intelligence platform with MCP + Task API](https://parallel.ai/blog/cookbook-competitor-research-with-reddit-mcp)

- [Introducing Parallel Deep Research reports](https://parallel.ai/blog/deep-research-reports)

- [A new pareto-frontier for Deep Research price-performance](https://parallel.ai/blog/deep-research-benchmarks)

- [Building a Full-Stack Search Agent with Parallel and Cerebras](https://parallel.ai/blog/cookbook-search-agent)