Skip to content

Performance & Cost

For document parsing, performance and cost are tightly coupled: your default mode, how many pages you process, and whether you route by document type.

  • PDFs are priced per page (1 credit per page)
  • fast/auto/ocr changes latency and success rate
  • maxPages is the simplest cost/time cap
Section titled “Recommended: learn your distribution first”
  1. sample ~20 PDFs
  2. estimate how many are text-native vs scanned vs table-heavy
  3. pick defaults per class instead of one-size-fits-all
  • stage 1: auto + maxPages: 5 for quick probing
  • stage 2: for scanned/complex docs, switch to ocr or increase maxPages
  • defaulting everything to ocr without sampling
  • not keeping failure examples for regression testing

Related: