Supported documents

Documents that work well with Impira

Impira is built to automate data entry from your documents. Unlike OCR templates or pretrained machine learning models, Impira uses machine learning that utilizes geometric information in order to identify the values you want.

For example, the location on the page and the proximity to certain nearby words are clues that our system uses to identify the right values. This geometric information allows Impira to excel for structured and semi-structured documents like standard and custom forms, invoices, purchase orders, paystubs, bills, tax documents, and more.

You can start extracting information out of your documents in minutes using just one file as your first example. Impira then takes its learnings and apply it to the rest of the files in your Collection. After you review Impira's results after pulling information from the rest of your documents, you can export your data as a CSV or use our API to shape the data you want to see.


Impira can extract data from a wide variety of forms and can extract multiple field types, including text, dates, numbers, and checkboxes. Further, Impira is built to handle the handwriting, rotations, and zooms that you're likely to encounter with scans and images.


Impira handles the complexity and diversity inherent to processing invoices for accounts payable or spend analytics. Impira supports field types such as text, numbers, dates, checkboxes, and tables.

Documents that we don't support (yet)

Because Impira is optimized to use geometric information, there are several types of use cases that we don't currently support:

  • Specific entities or terms from paragraphs of text that require interpretation
  • Specific slides from presentations

We’re working hard to add support for more kinds of documents every day. We value any and all feedback about use cases and would love to learn if you are trying to extract something we don’t support today. Reach out to us at

Other ways Impira can work for you

Even for cases where Impira can’t currently help with your extraction needs, our rich functionality can still help automate your workflows. Every file that you upload is available for storage, search, and retrieval.

Read more about some tips and tricks for the searching, Impira Query Language (IQL), and integrations.

All images and documents are run through state-of-the-art OCR models. Even in imperfect use cases, users still see dramatic improvements in their efficiency by manually selecting text read through OCR rather than typing it in by hand.

© 2022 Impira Inc. All rights reserved. This site is built with Motif.