Extract text from scanned PDFs using optical character recognition. All processing runs locally in your browser.
Drop a PDF here or click to browse
Extract text from scanned documents
Optical Character Recognition (OCR) turns scanned documents and image-based PDFs into searchable, selectable text. If you've ever scanned a paper document and ended up with a PDF you can't search or copy text from, this tool solves that problem. It analyzes each page image, recognizes the characters, and produces a new PDF with a searchable text layer.
Searchable Text Layer
Adds invisible text over each page so you can search, select, and copy text from the resulting PDF.
Multi-Language Support
Select from multiple languages to improve recognition accuracy for non-English documents.
Powered by Tesseract.js
Uses Tesseract.js, the leading open-source OCR engine, running entirely in your browser.
No Server Upload
Your scanned documents stay private. The OCR engine runs locally in JavaScript.