Are my documents safe?

Yes. The entire OCR process runs only in your browser. Files are not sent to any server — they never leave your device. This matters for scans of contracts, ID documents, or receipts.

Which languages are supported?

English and Polish are built-in by default. You can add German, Ukrainian, French, Spanish and 100+ more — language models download on demand and are cached in the browser.

Does it support PDF files?

Yes. You can upload multi-page PDFs — the tool renders each page and runs OCR sequentially. The output is combined text from all pages.

How long does the first run take?

The first run downloads the OCR engine (~4 MB) and language model (~10-15 MB per language). Files are cached — subsequent runs are instant.

Can I crop a region of the image before OCR?

Yes. In selection mode you can crop a region — useful when extracting only one paragraph from a scan or data from a specific row.

/ TOOL · image / PDF → text

OCR

Extract text from images and PDF files. Tesseract.js in your browser, 100+ languages, TXT/DOCX export. 100% client-side.

Drag & drop an image or PDF

JPG, PNG, WebP, PDF

or paste a screenshot (Ctrl+V)

Recognized text will appear here

Add a file on the left, click Recognize

Done!

OCR

Input

Output

Free OCR online — Extract text from images, scans, and PDF

How it works

Features

Frequently Asked Questions

Are my documents safe?

Which languages are supported?

How well does it handle Polish diacritics?

Does it support multi-page PDFs?

How long does the first run take?

OCR

How to use

Key features

Input

Output

Other FormattedAI tools

Free OCR online — Extract text from images, scans, and PDF

How it works

Features

Frequently Asked Questions

Are my documents safe?

Which languages are supported?

How well does it handle Polish diacritics?

Does it support multi-page PDFs?

How long does the first run take?