Tutorial 007: How to Extract Text from Images Using AI

Learn to simplify your workflow with 0CodeKit’s Picture Text Recognition. Our clear tutorial helps you extract text from scans and images effortlessly.

Published
December 1, 2024

Digitization has simplified so many things for us in the last few years. For instance, it has made the data from physical documents also available on technological devices. However, it has presented some limitations, like not being able to recognize and to extract information from certain formats (scans or images). Luckily, ever since OCR technology was developed, this whole panorama has changed. In fact, we have a feature that is perfect for this kind of situations. In this blog, we would like to present the Picture Text Recognition endpoint that is able to extract body text from images and scans.

Not a fan of reading? No problem! Check out our quick, easy-to-follow video tutorial to learn everything you need!

Setting Up the Picture Text Recognition Module

The first step is to upload the desired document into one of these software. Then, we sign up or log into one automation platform where this feature is available (Make, Zapier, and n8n).

After that, we can set up the first Dropbox/Google Drive module and choose the feature called "Watch Files", which will look at a specified folder and it'll trigger whenever a file is uploaded. Later, we need to add a second Dropbox/Google Drive module with the feature "Download File" for the 0CodeKit to access this document.

Once the Dropbox/Google Drive module has been set up, we must integrate the 0CodeKit app, and find the feature "Create temporary URL to file" for 0CodeKit to be able to access the document via the URL. Here, we only have to click on the option "Dropbox/Google Drive - Download a File".

Now, we must integrate the last module. We will add another 0CodeKit module with the feature "Detect Text in a Picture with OCR" that will extract all text bodies found in an image. Here, we only need to enter the item "Temporary File URL"into the Image URL field and click on "Save". We choose Finally, we can execute the scenario and we will receive a list of all text found as an output.

To Wrap Things Up

Stop extracting text manually from scans or images! Use our Picture Text Recognition feature instead.

If you would like to know how to use other 0CodeKit features, head to our YouTube channel for more tutorials.

What’s a Rich Text element?

The rich text element allows you to create and format headings, paragraphs, blockquotes, images, and video all in one place instead of having to add and format them individually. Just double-click and easily create content.

Static and dynamic content editing

A rich text element can be used with static or dynamic content. For static content, just drop it into any page and begin editing. For dynamic content, add a rich text field to any collection and then connect a rich text element to that field in the settings panel. Voila!

How to customize formatting for each rich text

Headings, paragraphs, blockquotes, figures, images, and figure captions can all be styled after a class is added to the rich text element using the "When inside of" nested selector system.