Basic OCR

Extract text from scanned or image-based PDFs. Results download as plain text or Word — all processed in your browser.

How to use it

  1. Drop your scanned PDF into the tool and choose the document language.
  2. Each page is processed and the text is extracted.
  3. Download results as .txt or .docx from the results panel.

About this tool

  • Works well on clearly printed, high-contrast scans with typed text in common languages.
  • Best results come from clear scans at 300 DPI or higher.
  • Handwriting, decorative fonts, and low-resolution scans will reduce accuracy.
  • Complex tables and multi-column layouts may not preserve structure.
  • Large files process more slowly on low-end devices.

Common questions

Does this tool send my PDF anywhere?

No. OCR runs entirely in your browser using Tesseract.js. Your file never leaves your device.

What output formats can I download?

Extracted text is available as .txt (plain text) or .docx (Word). Choose from the results panel.

Can this read handwriting?

Not reliably. The tool works best on clear typed text. Handwriting, low-contrast scans, and decorative fonts will reduce accuracy.

Is there a file size limit?

The tool accepts PDFs up to 25 MB. Larger files will take longer depending on your device.

Does it work offline?

Yes — once the page and language data are cached in your browser, you can process files without an internet connection.

Related tools

Related guides