Join us October 22nd to hear Coglate-Palmolive, IDC, and Sequoia Capital discuss moving to a digital-first environment
Learn more

Last updated

July 21, 2021

Table of contents

Supported documents

Documents that work well with Impira AutoML

Impira automated machine learning (AutoML) system is built to automate data entry from your documents. Unlike OCR templates, Impira uses machine learning that utilizes geometric information in order to identify the values you want. For example, the location on the page and the proximity to certain nearby words are clues that our system uses to identify the right values. This geometric information allows Impira AutoML to excel for structured and semi-structured documents like standard and custom forms, invoices, purchase orders, paystubs, bills, tax documents, and more.

You can start extracting information out of your documents in minutes using just one file as your first example. Impira AutoML will then take its learnings and apply it to the rest of the files in your collection. After you review Impira's results after pulling information from the rest of your documents, you can export your table as a CSV or use our API to shape the data you want to see.


Impira can extract data from a wide variety of forms. Impira AutoML models can extract multiple field types, including text, dates, numbers, and checkboxes. Further, the AutoML is built to handle the handwriting, rotations, and zooms that you're likely to encounter with scans and images.


Impira AutoML handles the complexity and diversity inherent to processing invoices for accounts payable or spend analytics. Impira supports field types such as numbers, dates, and text.

Tips for success

For tips on how to get the most of Impira’s AutoML, check out this quick start guide.

Documents that Impira AutoML doesn’t support (yet)

Because Impira AutoML is optimized to use geometric information, there are several types of use cases that our AutoML doesn’t currently support:

  • Extracting specific entities or terms from paragraphs of text that require interpretation
  • Extracting tabular data
  • Extracting specific slides from presentations

We’re working hard to add support for more kinds of documents every day. We value any and all feedback about use cases and would love to learn if you are trying to extract something we don’t support today. You can reach us at

Other ways Impira can work for you

Even for cases where Impira AutoML can’t currently help with your extraction needs, Impira’s rich functionality can still help automate your workflows. Every file that you upload is available for storage, search, and retrieval. Read more about some tips and tricks for the searching, Impira Query Language (IQL), and integrations here.

All images and documents are run through state-of-the-art OCR models. Even in cases where the AutoML can’t reliably extract your data, users still see dramatic improvements in their efficiency by manually selecting text read through OCR rather than typing it in by hand.

Stay in the loop

Get our Release Notes hot off the press, straight into your inbox.

Need more help?

Talk to someone