Claude Agent Skill · by Dkyazzentwatwa

Ocr Document Processor

The ocr-document-processor skill extracts text and structured data from scanned images, PDFs, and handwritten documents using optical character recognition (OCR

Install
Terminal · npx
$npx skills add https://github.com/dkyazzentwatwa/chatgpt-skills --skill ocr-document-processor
Works with Paperclip

How Ocr Document Processor fits into a Paperclip company.

Ocr Document Processor drops into any Paperclip agent that handles this kind of work. Assign it to a specialist inside a pre-configured PaperclipOrg company and the skill becomes available on every heartbeat — no prompt engineering, no tool wiring.

S
SaaS FactoryPaired

Pre-configured AI company — 18 agents, 18 skills, one-time purchase.

$27$59
Explore pack
Source file
SKILL.md32 lines
Expand
---name: ocr-document-processordescription: Extract text and structure from scans, images, and scanned PDFs. Use for OCR, searchable PDFs, table extraction, receipt parsing, and business card parsing.--- # OCR Document Processor Handle OCR-heavy inputs where text must be recovered from images or scanned pages. ## Use This For - OCR on images and scanned PDFs- Searchable PDF export- Structured extraction to text, markdown, JSON, or HTML- Table extraction from scanned material- Receipt parsing and business card parsing ## Workflow 1. Decide whether plain OCR, structured extraction, or document-specific parsing is needed.2. Preprocess noisy inputs before extraction when skew, blur, or shadows are present.3. Use `scripts/ocr_processor.py` for core OCR tasks.4. Use the focused helpers when the input is specialized:   - `scripts/business_card_scanner.py`   - `scripts/receipt_scanner.py`5. Return confidence caveats when the source is low quality, rotated, handwritten, or multilingual. ## Guardrails - Prefer explicit language selection when accuracy matters.- Do not claim fields are exact when OCR confidence is weak.- Route non-scanned digital PDFs to `document-converter-suite` instead of OCR by default.