Documents can sometimes have layouts that look almost identical, with only a small difference in their visual templates. As a best practice, we recommend creating layout variations to account for these minor differences, as described in Adding a Variation to a Layout. Even with layout variations, the system may classify documents incorrectly. To prevent that from happening, we’ve introduced settings that help the system improve classification accuracy by using indications from the document’s pages after the system’s initial matching. These settings force the system to match documents based specifically on the layout identifier.
If the layout identifier in the document matches the expected ID in the layout variation, the system will classify the document according to that layout variation. This matching is valid regardless of whether the initial confidence score was below or above the threshold. The system is effectively forcing the match, ensuring accuracy.
If the identifier doesn’t match the expected ID in the layout variation, the document is either sent for Manual Classification or to Document Drift Management.
When Manual Classification is off, the document goes directly to Document Drift Management.
When it’s on, the document undergoes further review in Manual Classification.
These settings are essential for reducing errors and ensuring that documents are always matched to the correct layout variation.
Settings
The following settings are available at the flow level to boost Classification confidence:
Classify Using Layout ID — Enabling this setting allows Structured documents to be matched using a layout identifier. When this setting is enabled, the system checks for a matching layout identifier in the document. If the identifier matches the expected one in the layout variation, the document is classified accordingly. If it doesn't match, the document is either sent for further review or to Document Drift Management, preventing misclassification.
Bypass Classification by Layout ID — This setting should be enabled when certain layouts do not contain a layout identifier. It bypasses validation by layout identifier if the matched layout variation doesn’t have an identifier specified. In these cases, the bypass allows the system to continue classifying documents even without layout identifiers, ensuring that documents are still processed but not necessarily tied to a specific layout variation.
To enable these settings:
In the left-hand sidebar, click Flows.
Click on the name of the flow you want to enable the settings for, and click Edit Flows.
Click Start Document Processing Subflow.
In the Settings Type drop-down list, click Classification.
Select the checkbox for Classify Using Layout ID.
Enabling Classify Using Layout ID will force Structured document matching by layout identifier, if one is present in both the document and in the Structured layout variation.
If some layout variations do not have defined layout identifiers, select the checkbox for the Bypass Classification by Layout ID setting. That way, the system will continue classifying documents without using layout identifiers.