Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.case.dev/llms.txt

Use this file to discover all available pages before exploring further.

Specialized OCR for the messy reality of legal documents. We handle what generic providers can’t: handwriting, poor scans, fax headers, and complex tables.

Quick example

curl -X POST https://api.case.dev/ocr/v1/process \
  -H "Authorization: Bearer $CASEDEV_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{}'
FeatureWhy it matters for your app
Handwriting RecognitionExtract notes and annotations from uploaded documents
Table ReconstructionPreserve structure for financial statements and forms
Bates Stamp HandlingIdentify and index reference numbers separately
Searchable PDF (HOCR)Return documents with text layers your users can search

Engine Selection

Choose based on your users’ document types:
EngineBest forSpeed
doctrStandard documents. High speed, good accuracy for typed text.Fast
paddleocrTables and forms. Best-in-class table structure recognition.Slower

Output formats

FormatDescription
textPlain text extraction
jsonStructured output with coordinates, confidence scores
pdfSearchable PDF (original with text layer)

Endpoints

Process

POST /ocr/v1/process — Submit a document for OCR

Status

GET /ocr/v1/:id — Check processing status

Download

GET /ocr/v1/:id/download/:type — Download results

Common patterns

casedev ocr:v1 process \
  --document-url "$DOCUMENT_URL" \
  --callback-url "https://your-app.com/webhooks/ocr-complete"

From S3

casedev ocr:v1 process \
  --document-url "s3://your-bucket/documents/upload.pdf"

With table extraction

casedev ocr:v1 process \
  --document-url "$DOCUMENT_URL" \
  --engine paddleocr \
  --features.tables '{"format": "csv"}'

Vault

Store OCR’d documents and make them searchable with semantic search

LLMs

Analyze extracted text with AI—summarize, classify, and extract entities