Skip to main content

Integrated OCR and PDF to Word Converter

Translate images with OCR

Wordbee includes a tool to extract text contained in images such as PNG, JPG, TIFF, ICO, BMP and many more.

You get text back as an HTML file, ready to be edited or translated.

Image to text sample

Let us look at a sample image and how the result is extracted. Yes you can copy & paste the text to the right, it is no image any longer.

Uploaded image:

Extracted text:

好一朵美丽的茉莉花
好一朵美丽的茉莉花
芬芳美丽满枝桠
又香又白人人夸
让我来将你摘下
送给别人家
茉莉花呀茉莉花

Convert image files to text

Go to a project and open the document library. Upload or drag & drop your image file:

Click images to select one or more images. Then click the OCR link above the files. The OCR tool opens:

Choose one of the OCR systems and hit "Process files". The results are saved next to the images:

To rapidly check if the text was properly recognized, right click one of the HTML files and select the "Open With" > "Web Browser" option:

Handwritten text (English only)

Among the integrated OCR systems you have one from Microsoft that can extract handwritten English text.

Sample image:

Converted to:

CODE
chapter
Mr. Sherlock Holmes
In the year 1878 I took my degree of Doctor
of medicine of the university of London
and proceeded to Netley to go through
the course precribed for surgeons . Having
completed my studies there . I was duly attached
to the Fifth Northumberland Fusiliers as
Assistant
Surgeon

Enabling OCR systems

Integrated OCR capabilities are currently available through both Google and Microsoft. You will need to obtain credentials from either provider. Both offer free plans, and beyond that, usage is billed based on consumption.

Go to Settings > Image to Text (OCR) and enable the systems you need:

What about PDF files?

If the PDF was created with a word processor and it is not a scan, then you can use Wordbee PDF Converter tool.

For PDFs with scanned pages, the individual pages can be saved as image files using a screenshot or screen capture tool. A native PDF-to-text conversion feature may be added in a future update.

PDF to Word Converter

With its built-in PDF to Word Converter, Wordbee Translator allows you to translate searchable PDF documents seamlessly, no third-party tools required. Convert and process your documents in one streamlined step.

The process is straightforward: simply upload your searchable PDF files to your Wordbee platform and process them like any other file. The built-in PDF Converter will instantly extract the content ready for translation.

The output can be delivered in either .docx or .pdf format. Please note that some layout information may be lost during conversion, so a DTP task is recommended to ensure the translated document faithfully preserves the formatting of the original.

Text embedded within images is not extracted, as the converter does not support optical character recognition (OCR).

JavaScript errors detected

Please note, these errors can depend on your browser setup.

If this problem persists, please contact our support.