# How Finch is scaling plaintiff law with AI agents that research like associates

Finch replaced a three-step search-extract-validate pipeline with a single Parallel Task API call, cutting per-query costs by 96% and eliminating the defensive engineering overhead that came with an unreliable vendor.

Tags:Case Study

Reading time: 3 min

How Finch is scaling plaintiff law with AI agents that research like associates

## Key highlights

- **90% cost reduction:** per-query cost dropped after migrating to the Task API
- **Zero 5xx errors or timeouts** across three-plus months in production
- **Single-call structured outputs** replaced a three-step search, extract, and validate chain

**Tiered processing** lets Finch match cost and accuracy to query complexity at the individual task level

## About Finch

Finch builds the back office that plaintiff law firms never had. The company pairs an in-house legal team with AI agents to handle the research, case preparation, and administrative work that bogs down attorneys. Lawyers focus on settlements and litigation; Finch handles everything upstream. Their current focus is personal injury pre-litigation, with plans to expand into other plaintiff law areas.

The problem Finch targets is a supply-side bottleneck: 75% of Americans who need legal help can't get it. Law firms hit capacity limits and refer cases out rather than hiring more staff to handle the work.

## The problem

Finch's AI agents run structured research workflows at scale, pulling case-relevant facts from the web, enriching records, and feeding structured outputs into the firm's data models. Before Parallel, Finch powered these workflows through another search API provider.

That vendor's API contract was unreliable. Fields appeared and disappeared between responses with no warning. Finch's engineers wrote defensive validation layers and permissive schemas to keep production from breaking. The search quality compounded the problem: results came back as free text that required a second extraction pass before the data could enter Finch's structured pipelines. Every query became a three-step chain: search, extract, validate.

The engineering overhead of maintaining defensive wrappers around an inconsistent API consumed time that should have gone toward Finch's core product.

## The solution

Finch migrated to Parallel's Task API in November and now runs all web research through it. They break these two into two key workflows:

- **Deterministic workflows.** These are repeatable queries with versioned schemas, the bread-and-butter of legal back-office research. The same categories of questions run against different cases, pushing structured data directly into Finch's data models with a stable output contract. Finch started here because the work is high-volume, well-defined, and predictable, which made the Lite Processor a clean fit for cost and latency.
- **Production agents that write their own Task API specs.** These agents construct their own structured output schemas at runtime with variable depths, depending on the complexity of the research question in front of them.

Finch is also exploring Parallel's Monitor API for passive web monitoring as a future addition to their pipeline, so that agents can kick off research themselves, without human intervention.

## The impact

A 90% drop in per-task cost. Finch now runs higher volumes than before at a fraction of the spend.

> **"We went from spending engineering cycles on defensive validation to spending them on product. The Task API gives us structured outputs with citations in a single call, which is exactly what production legal AI needs."**

> **— Ben Weems, CTO, Finch**

After three-plus months in production, Finch has seen zero 5xx errors or timeouts from Parallel. The team that previously built defensive validation layers around an unreliable API now builds product features instead.

Parallel's tiered pricing lets Finch match compute to complexity across their entire operation. Deterministic workflows run on Lite. Production agents use Core. The tradeoffs between speed, accuracy, and cost are explicit at each tier, so Finch's engineers can make informed decisions at the query level rather than paying a flat rate for capabilities they don't always need

# How Finch is scaling plaintiff law with AI agents that research like associates

## **Key highlights**

## **About Finch**

## **The problem**

## **The solution**

## **The impact**

## Related Posts68

- [How AirOps creates citation-worthy content at scale, powered by Parallel](https://parallel.ai/blog/case-study-airops)

- [Introducing Index by Parallel](https://parallel.ai/blog/introducing-index-by-parallel)

- [Parallel Monitor API: New processor tiers, snapshots and event streams, and Basis on every event](https://parallel.ai/blog/monitor-api)

- [How we built parallelmpp.dev](https://parallel.ai/blog/parallel-mpp-dev)

- [How Actively's Per Account Agents use Parallel to turn the entire web into a proactive sales intelligence layer](https://parallel.ai/blog/case-study-actively)

- [Parallel Raises at $2 Billion Valuation to Scale Web Infrastructure for Agents](https://parallel.ai/blog/series-b)

- [Building a free CLI agent with Pi, Ollama, Gemma 4, and Parallel](https://parallel.ai/blog/free-CLI-agent)

- [Parallel Search is now free for agents via MCP](https://parallel.ai/blog/free-web-search-mcp)

- [Upgrades to the Parallel Search & Extract APIs](https://parallel.ai/blog/parallel-search-api)

- [Genpact and Parallel Web Systems Partner to Drive Tangible Efficiency from AI Systems](https://parallel.ai/blog/genpact-parallel-partnership)

- [How Genpact helps top US insurers cut contents claims processing times in half with Parallel ](https://parallel.ai/blog/case-study-genpact)

- [A new deep research frontier on DeepSearchQA with the Task API Harness](https://parallel.ai/blog/deep-research)

- [How Modal saves tens of thousands annually by building in-house GTM pipelines with Parallel](https://parallel.ai/blog/case-study-modal)

- [How Opendoor uses Parallel as the enterprise grade web research layer powering its AI-native real estate operations](https://parallel.ai/blog/case-study-opendoor)

- [Introducing stateful web research agents with multi-turn conversations](https://parallel.ai/blog/task-api-interactions)

- [Parallel is live on Tempo, now available natively to agents with the Machine Payments Protocol](https://parallel.ai/blog/tempo-stripe-mpp)

- [How Parallel helped Kepler build AI that finance professionals can actually trust](https://parallel.ai/blog/case-study-kepler)

- [Introducing the Parallel CLI](https://parallel.ai/blog/parallel-cli)

- [How Profound helps brands win AI Search with high-quality web research and content creation powered by Parallel](https://parallel.ai/blog/case-study-profound)

- [How Harvey is expanding legal AI internationally with Parallel](https://parallel.ai/blog/case-study-harvey)

- [How Tabstack by Mozilla enables agents to navigate the web with Parallel’s best-in-class web search](https://parallel.ai/blog/case-study-tabstack)

- [Parallel Web Tools and Agents now available across Vercel AI Gateway, AI SDK, and Marketplace](https://parallel.ai/blog/vercel)

- [Authenticated page access for the Parallel Task API](https://parallel.ai/blog/authenticated-page-access)

- [Introducing structured outputs for the Monitor API](https://parallel.ai/blog/structured-outputs-monitor)

- [Introducing research models with Basis for the Parallel Chat API](https://parallel.ai/blog/research-models-chat)

- [Build a real-time fact checker with Parallel and Cerebras](https://parallel.ai/blog/cerebras-fact-checker)

- [Parallel Task API achieves state-of-the-art accuracy on DeepSearchQA](https://parallel.ai/blog/deepsearch-qa)

- [Introducing Granular Basis for the Task API](https://parallel.ai/blog/granular-basis-task-api)

- [How Amp’s coding agents build better software with Parallel Search](https://parallel.ai/blog/case-study-amp)

- [Latency improvements on the Parallel Task API ](https://parallel.ai/blog/task-api-latency)

- [Introducing Parallel Extract](https://parallel.ai/blog/introducing-parallel-extract)

- [Introducing Parallel FindAll](https://parallel.ai/blog/introducing-findall-api)

- [Introducing Parallel Monitor](https://parallel.ai/blog/monitor-api-beta)

- [Parallel raises $100M Series A to build web infrastructure for agents](https://parallel.ai/blog/series-a)

- [How Macroscope reduced code review false positives with Parallel](https://parallel.ai/blog/case-study-macroscope)

- [Introducing Parallel Search](https://parallel.ai/blog/parallel-search-api-beta)

- [Parallel processors set new price-performance standard on SealQA benchmark](https://parallel.ai/blog/benchmarks-task-api-sealqa)

- [Introducing LLMTEXT, an open source toolkit for the llms.txt standard](https://parallel.ai/blog/LLMTEXT-for-llmstxt)

- [How Starbridge powers public sector GTM with state-of-the-art web research](https://parallel.ai/blog/case-study-starbridge)

- [Building a market research platform with Parallel Deep Research](https://parallel.ai/blog/cookbook-market-research-platform-with-parallel)

- [How Lindy brings state-of-the-art web research to automation flows](https://parallel.ai/blog/case-study-lindy)

- [Introducing the Parallel Task MCP Server](https://parallel.ai/blog/parallel-task-mcp-server)

- [Introducing the Core2x Processor for improved compute control on the Task API](https://parallel.ai/blog/core2x-processor)

- [How Day AI merges private and public data for business intelligence](https://parallel.ai/blog/case-study-day-ai)

- [Full Basis framework for all Task API Processors](https://parallel.ai/blog/full-basis-framework-for-task-api)

- [Building a real-time streaming task manager with Parallel](https://parallel.ai/blog/cookbook-sse-task-manager-with-parallel)

- [How Gumloop built a new AI automation framework with web intelligence as a core node](https://parallel.ai/blog/case-study-gumloop)

- [Introducing the TypeScript SDK](https://parallel.ai/blog/typescript-sdk)

- [Building a serverless competitive intelligence platform with MCP + Task API](https://parallel.ai/blog/cookbook-competitor-research-with-reddit-mcp)

- [Introducing Parallel Deep Research reports](https://parallel.ai/blog/deep-research-reports)

- [A new pareto-frontier for Deep Research price-performance](https://parallel.ai/blog/deep-research-benchmarks)

- [Building a Full-Stack Search Agent with Parallel and Cerebras](https://parallel.ai/blog/cookbook-search-agent)

- [Webhooks for the Parallel Task API](https://parallel.ai/blog/webhooks)

- [Introducing Parallel: Web Search Infrastructure for AIs ](https://parallel.ai/blog/introducing-parallel)

- [Introducing SSE for Task Runs](https://parallel.ai/blog/sse-for-tasks)

- [A new line of advanced Processors: Ultra2x, Ultra4x, and Ultra8x ](https://parallel.ai/blog/new-advanced-processors)

- [Introducing Auto Mode for the Parallel Task API](https://parallel.ai/blog/task-api-auto-mode)

- [A state-of-the-art search API purpose-built for agents](https://parallel.ai/blog/search-api-benchmark)

- [Parallel Search MCP Server in Devin](https://parallel.ai/blog/parallel-search-mcp-in-devin)

- [Introducing Tool Calling via MCP Servers](https://parallel.ai/blog/mcp-tool-calling)

- [Introducing the Parallel Search MCP Server ](https://parallel.ai/blog/search-mcp-server)

- [Introducing Source Policy](https://parallel.ai/blog/source-policy)

- [The Parallel Task Group API](https://parallel.ai/blog/task-group-api)

- [State of the Art Deep Research APIs](https://parallel.ai/blog/deep-research-browsecomp)

- [Parallel Search API is now available in alpha](https://parallel.ai/blog/search-api-alpha)

- [Introducing the Parallel Chat API ](https://parallel.ai/blog/chat-api)

- [Introducing Basis with Calibrated Confidences ](https://parallel.ai/blog/introducing-basis-with-calibrated-confidences)

- [Introducing the Parallel Task API](https://parallel.ai/blog/parallel-task-api)

## Key highlights

## About Finch

## The problem

## The solution

## The impact