Product
/
Mar
23
,
2021
Ankur Goyal
Share
For document-heavy industries — and what industries aren’t? — OCR is no longer enough. Impira AutoML is the answer to automate manual data entry tasks in minutes.
Optical character recognition (OCR) technologies — while still great at identifying characters from PDFs, images, and handwritten notes — can never be successful alone, but rather rely on other technologies to add structure to make this data usable. Luckily today, when you pair OCR with automated machine learning (AutoML) you can save time and resources. AutoML allows anyone (your accountant, insurance agent, IT specialist, or healthcare receptionist) to set up and modify a unique machine learning (ML) model for a set of documents without needing to know a single line of code. With Impira AutoML, you can simply train your machine learning model by highlighting the text you want to extract. To make a change or improve the accuracy of your model, all you need to do is confirm your data extraction matches within the platform. This is you actively training and improving the OCR that powers your machine learning model in the background.
Leveraging Impira AutoML, subject matter experts can now set up their own automation workflow with little to no expertise in the technologies that power Impira.
A collection is the main feature within Impira for organizing and grouping files together. Each collection has its own dedicated ML model, and that model will be customized to work on extracting data from only the files within that collection.
Upload your files to your new collection, and Impira will immediately begin the initial OCR processing. This process will detect words and letters embedded within your file. Once complete, you’ll see that each file has its own row in the table. We recommend starting off by adding at least 5 or more similar files.
The beauty of AutoML is that you can teach and retrain an ML model to continually process your files through a simple user interface without the knowledge of code. Simply open a file, highlight the text you want to extract, and watch the machine learning model pull out your data from the rest of your files. As you confirm more data extraction matches, this will retrain and improve the accuracy of your ML model.
That’s it, you’re done. It’s really that simple: Create a collection, upload, and highlight.
We have more detailed How to guides that can walk you through a hands-on demo. If you’re in the insurance industry, ACORD forms are likely the bane of your existence. Read our How to guide on automating ACORD forms here.
Try Impira AutoML's easy to use interface to set up custom ML models for all your data extraction needs.