Read & Edit

Basic OCR

Extract text from scanned or image-based PDFs. Results download as plain text or Word — all processed in your browser.

How to use it

Works well on clearly printed, high-contrast scans with typed text in common languages.
Best results come from clear scans at 300 DPI or higher.
Handwriting, decorative fonts, and low-resolution scans will reduce accuracy.
Complex tables and multi-column layouts may not preserve structure.
Large files process more slowly on low-end devices.

Does this tool send my PDF anywhere?

No. OCR runs entirely in your browser using Tesseract.js. Your file never leaves your device.

What output formats can I download?

Extracted text is available as .txt (plain text) or .docx (Word). Choose from the results panel.

Can this read handwriting?

Not reliably. The tool works best on clear typed text. Handwriting, low-contrast scans, and decorative fonts will reduce accuracy.

Is there a file size limit?

The tool accepts PDFs up to 25 MB. Larger files will take longer depending on your device.

Does it work offline?

Yes — once the page and language data are cached in your browser, you can process files without an internet connection.

Scanned PDFs are just images of text. To edit them in Word, you first need to extract the text using OCR. Here is the practical workflow.

If you cannot select text in a PDF, it is probably an image based file. Here is how to extract the text so you can copy, search, or edit it.

A practical workflow for turning a shoebox of paper receipts into organized, searchable PDFs. Useful for tax season and expense tracking.

Your phone is already a high quality scanner. Here is how to capture documents cleanly and turn them into a professional looking PDF.