Amazon Identity Documents Extraction

The Amazon Identity Documents Service makes use of pre-trained ID doc models to extract Key-Value Pairs of invoice-related data making use of Amazon AnalyzeID API. The processing of invoices returns relevant information from passports, driver licenses, and other identity documentation issued by the US Government, such as Name, Surname, ID Number, and Date of Birth.

Key names are standardized within the response. For example, if your driver license says LIC# (license number) and passport says Passport No, Analyze ID response will return the standardized key as Document ID along with the raw key (e.g. LIC#). This standardization lets customers easily combine information across many IDs that use different terms for the same concept.

Possible use cases

  • Extracting invoice-related data.
  • Match Name and Surname fields to other supporting docs to verify the identity of a person.

Service Setup

  1. Open the Project Detail View of the project you would like to add the service to.
  2. Click on the Add Service button in the command bar.
  3. Select Amazon Identity Documents Service from the available Service Types.

  4. A new Service Configuration Wizard will open:
    (When navigating the Wizard, please make sure to use the Next Step button in the command bar to save any changes made).

    • Step 1 - Allows configuration of various service settings, including the name and description. The default settings are sufficient for most use cases.
    • Step 2 - Allows adding User Defined Categories to train the service on.
    • Step 3 - Verification of any documents that have been processed.

Service Configuration Settings

The Microsoft OCR Service can be configured by the user as a flexible solution. The following Settings are available:

Setting Type Required Type Description
ArchivingStrategy Optional Days before documents get deleted.
BatchSize Hidden Processing batch size.
DocumentProcessedStatus Optional Document status used to denote that a document has been processed.
Enabled Hidden Enable or disable the service.
ExecuteBeforeProcess   When set up as a child service, specify whether this service should be executed before the parent service gets executed.
ExecuteAfterProcess   When set up as a child service, specify whether this service should be executed after the parent service gets executed.
IsVerification Optional Save the lines and words for verification.
Password Optional Used for service authentication. Custom Code can be used to set the password. Can be set per document.
RemoveComments Optional Remove human comments from a document.

Add and Process Documents

  1. In the Amazon Identity Documents Service click on Inbox button.
  2. Select the Status you want to upload and use Status None or Received for new documents that have not been processed yet.
  3. Select an optional category if you know the category for the document, if you don’t want to select one just click on “No selection”.
  4. Find the files on your Local machine and upload them. The demo’s test files can be found at the following link: Click here.
  5. After all the documents have been uploaded you can check the documents to be processed, click on Processed Checked to process the documents.

It is recommended to only process a few documents at a time, especially if it is a new service to properly test if you receive the results you want before processing everything.

View Processed Documents

  1. In the Amazon Identity Documents Service click on the Outbox button.
  2. You can view your Processing results by opening a processed doc for verification.