V40 Release Notes

Versions v40.1.x and v40.2.x are available to SaaS customers only.

40.2.11 (4 Jun 2025)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.2.10 (22 May 2025)

Document Renderer Block

Fixed

Processing images through the Document Renderer Block — We've fixed image-identification and -handling issues that caused submissions to halt in the Document Renderer Block in the following situations:

The Image Correction and Mobile Processing Classification settings for the flow were enabled, and:
- PNG files were being processed, and the Quality Document Rendering setting for the flow was set to 80, or
- JPEG files were being processed, and the Quality Document Rendering setting for the flow was set to 40.

40.2.9 (13 May 2025)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.2.8 (24 Apr 2025)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.2.7 (10 Apr 2025)

User Experience

Fixed

"Updated at" timestamps — We've fixed an issue that caused "Updated at" timestamps (e.g., in the upper-right corner of the Submissions and Flows pages) to be in the user's time zone instead of the time zone specified in the SYSTEM_TIME_ZONE ".env" variable.

Tables

Fixed

Processing submissions with nested tables — We've resolved an issue that caused submissions containing nested tables to halt in Machine Transcription. The issue occurred when all rows in a child table were deleted during Manual Identification.

40.2.6 (27 Mar 2025)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.2.5 (12 Mar 2025)

Flow Blocks

Fixed

Processing images through the Document Renderer Block — We've resolved an issue in the Document Renderer Block that caused submissions to halt if they contained 16-bit images or images with transparency channels.

40.2.4 (6 Mar 2025)

Training Data Management

Fixed

Previewing pages in Training Data Management (TDM) for Classification models — We've fixed an issue that caused delays in loading preview images of pages in TDM for Classification models.

40.2.3 (19 Feb 2025)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.2.2 (3 Feb 2025)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.2.1 (21 Jan 2025)

Version 40.2.0 was not released and is not supported.

License Keys

Updated

Archival of layouts and releases on license downgrade – If you enter a license key for a license package that is at a lower level than your current package, the system archives any layouts and releases that include features that are not offered in the new license package. This update makes affected layouts and releases unavailable for use, preventing errors from occurring and submissions from being halted.

More information about entering license keys can be found in License Packages and Feature Availability.

Internationalization

Updated

Support for German translations – You can now provide interface-text translations for the de-DE locale (German, Germany).

For more information on translating the application’s interface, see Providing a Translated User Interface.

Importing updated versions of translation files – When an updated version of a translation file is imported, the system validates that the updated file contains the same number of translation strings as the original file. If it does not, the import will fail. This update prevents unsupported translations from being uploaded, and it ensures that a translation is available for each string that can be translated.

Models

Updated

Enhancements to segmentation for Signature fields – We’ve enhanced the ability to detect signatures in documents, addressing previous performance challenges. We’ve improved the detection accuracy for signatures using real-world training data.

Independent model — Signature segmentation is now trained as a standalone model, separate from text and checkbox segmentation, allowing it to focus exclusively on identifying signatures in documents. The model now produces more reliable and complete bounding boxes around signatures.
Improved detection accuracy — Bounding boxes from text segmentation are now used to refine signature segmentation, ensuring that fragmented or partial signatures are consolidated for improved accuracy.

Training Data Management

Updated

Enhanced filtering and search in the Model History table for Identification models – To make it easier to find particular model versions in the Model History table, we've introduced the following options:

Filtering — You can filter the contents of the Model History table by creation date, last-deploy date, source, and trainer version.
Searching — We've added the ability to search for model versions by name.
Sorting — You can sort the table's contents by the following columns:
- Name
- Date created
- Version
- Source
- Last deploy

Additionally, you can choose which columns are included in the table by clicking the menu next to the Filter drop-down list and clicking the Manage columns… option.

For more information about navigating TDM for Identification, see TDM for Identification Models.

Fixed

Migration of data to Training Data Management – We've fixed a data-migration issue that caused database deadlocks to occur when training data was sent to Training Data Management. This issue affected data coming from completed submissions that contained more than 500 pages.

Flow Blocks

New

File Filter Block – With the File Filter Block, you can prevent files that meet certain criteria from being included in submissions. It is supported for use with the Email Listener Input Block and with API integrations.

The File Filter Block allows you to:

Block files or allow files with particular extensions.
Block files whose size does not meet the minimum that you specify.
Block image files with heights and widths that are less than particular minimums.

Additionally, you can choose to have the filters applied to all files in a submission or only to image files.

The File Filter Block is not included in the flow provided in the application but can be included in custom flows.

To learn more about the File Filter Block, see Flow Blocks.

Updated

"Scope" setting for HTTP REST Blocks – We've added a Scope setting to HTTP REST Blocks, which allows you to specify a scope for requests authorized with OAuth 2.0. This setting is available only if the block's Authorization Type is set to OAuth 2.0 Client Credentials. More information on HTTP Rest Blocks can be found in API Blocks.

Page-size limitation for the Document Renderer Block – The Document Renderer Block supports documents containing pages up to 600mm x 600mm in size. Ensure that your documents’ pages fit within these dimensions to avoid rendering issues. Learn more in our Flow Blocks article.

Machine Classification

Fixed

Initialization times for Machine Classification Blocks – We've fixed an issue that caused the initialization of the Machine Classification Block to take up to an hour if its flow's release contained a large number of unique pages (~5000 or more) and at least one Structured layout.

Supervision

New

Additional options for sending fields to Supervision – In v40.2, we’ve introduced the following options for sending specific fields to be reviewed by a keyer:

Send normalization errors to Supervision — If your organization needs to review normalization errors before downstream processing, you can enable the Force Normalization Errors To Supervision flow setting, which allows you to send such fields to Supervision. When enabled, normalization errors will be flagged for human review, ensuring data accuracy within the platform. By default, this setting is turned off to maintain existing workflows.
Send blank or missing Required fields to Supervision — Fields marked as “Required” in the Layout Editor can now be sent to Supervision when missing or left blank. When the Force Missing/Blank Fields To Supervision setting is enabled, blank fields are sent to Transcription Supervision, while missing fields are sent to Identification Supervision. Doing so ensures that critical fields are handled within Hyperscience. By default, this setting is disabled to avoid changes to current workflows.

For more information about available flow settings, see Flow Settings.

Permissions

New

Permission groups for machine accounts – You can now assign permission groups to machine accounts, just as you can for any other user. By default, machine accounts are assigned to the API User permission group, but you can assign or remove permission groups as needed. As a result, this feature allows you to assign granular permissions to machine accounts to match exactly how those accounts are used.

To learn how to assign permission groups to machine accounts, see Managing Machine Credentials.

Audit Log

Updated

"Changes" field in activity records – We've removed the Changes field from Audit Log activity records in the application. The field is still present in API responses.

More information about the Audit Log can be found in Audit Log (v39.1 and later).

Application Bundle

Updated

Removal of PgBouncer – We've removed pgbouncer from the application bundle. If your instance requires the use of PgBouncer, contact your Hyperscience representative for assistance.

Upgrade of PostgreSQL image to PostgreSQL 16.6 – We've updated the version of the PostgreSQL image included in the bundle to PostgreSQL 16.6.

SaaS

New

Allowlisting endpoints for outbound connections – If you need to integrate Hyperscience with systems that require outbound connections, a System Admin can create an allowlist of endpoints that those integrations can use. As a result, you no longer need to open a ticket with Hyperscience to create the allowlist on your behalf, reducing the time required to set up outbound connections.

To learn more about allowlisting endpoints, see IP Allowlisting.

Support for AWS IAM roles in Input Blocks and Output Blocks – You can now use Amazon Web Services (AWS) Identity Access Management (IAM) roles to authenticate AWS connections in the following blocks:

Message Queue Listener Input Block
S3 Listener Input Block
Message Queue Notifier Output Block
S3 Notifier Output Block

To use IAM roles, you first need to work with a Hyperscience Cloud Engineer to configure the roles in your instance. For more information, see IAM Roles for Input Blocks and Output Blocks.

40.1.7 (4 Feb 2025)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.1.6 (17 Jan 2025)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.1.5 (19 Dec 2024)

Submissions

Fixed

Submissions processing - We’ve resolved an issue causing submissions to halt by ensuring Ghostscript is placed in a folder /opt/hs/gs/bin/ the system can access. This fix allows submissions to process correctly without errors.

Multiple Occurrences

Updated

Checkboxes and Signatures - We’ve disabled multiple occurrences for non-entry field types (e.g. checkboxes and signatures).

Long-form Extraction

Updated

Clause field data type - We’ve removed the ability to add Clause field data types for structured layouts.

40.1.4 (30 Nov 2024)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.1.3 (29 Nov 2024)

Flow Blocks

Updated

40.1.2 (18 Nov 2024)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.1.1 (15 Nov 2024)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.1.0 (12 Nov 2024)

Internationalization

Updated

Support for German translations — You can now provide interface-text translations for the de-DE locale (German, Germany).

To learn how to upload translation files, see Providing a Translated User Interface.

Submission Pre-processing

Updated

New Rotation Correction model – We’ve implemented a new page-level rotation-detection model, which improves performance for text-sparse documents, documents with noisy backgrounds (e.g., SSN cards, driver's licenses, birth certificates, etc.), and mixed-language documents.

Training

Updated

Deterministic Training Recovery – In v40, we introduced the Trainer Resiliency feature, which allows you to resume interrupted training tasks from their last saved checkpoint. Deterministic Training Recovery enhances Trainer Resiliency by ensuring that models' automation rates are unaffected by the use of saved checkpoint data in the completion of training tasks.

Training Data Management

Updated

Importing and exporting Classification models – With the updates included in v40.1, users can download trained Classification models and import them to other instances.

Note that these updates apply only to Classification models and not to their training data.

For more information, see TDM for Classification.

Accuracy

New

Field-level accuracy targets for transcriptions in Semi-structured documents – We've extended the field-level accuracy targets introduced in v40 to include the transcription of fields and table columns in Semi-structured documents. With this update, you can specify accuracy targets for the transcription of particular fields and table columns in a flow's settings.

Note that accuracy targets apply only to the transcription of fields and table columns and not to their identification. Field-level accuracy targets for identification will be available in an upcoming version.

For more information on setting field-level accuracy targets for Semi-structured documents, see Transcription Accuracy and Automation.

Flow Blocks

New

Google Cloud Platform Integration Blocks – We’ve introduced five new Google Cloud Platform (GCP) Blocks, which enable automation solutions requiring the usage of large language models (LLMs) or vision language models (VLMs). These blocks use Retrieval Augmented Generation (RAG) techniques to provide ground-truth data to LLMs and create high-accuracy responses:

Vertex AI Block (VLM/LLM)
BigQuery Block (BigQuery querying Block)
Vertex AI Embeddings Block (Embedding model)
Vector Search Create Block (Creating vector indexes)
Vector Search Query Block (Querying vector indexes)

Each one of these blocks helps to enable and build automation workflows that leverage Generative AI and RAG on GCP. The use of RAG techniques minimizes the likelihood of LLM hallucinations in the blocks’ output, providing the most relevant and accurate information possible.

For more information about these blocks, and for assistance in implementing them, contact your Hyperscience representative.

File Storage

Updated

Enhanced Google Cloud Storage (GCS) integration – If you are using a GCS bucket as your file store, you can take advantage of our enhanced GCS integration, which supports the use of Object Versioning and Application Default Credentials (ADC).

To learn more about this enhanced integration and how to configure it, see Google Cloud Storage.

Known Issues

Task Queue

Completing selected tasks from the Perform Tasks action – If a keyer selects the checkboxes for individual tasks in the Task Queue, clicks Actions, and then clicks Perform Tasks, no tasks appear in the table. A fix for this issue will be available in an upcoming patch version.

Reporting

Access to Reporting pages for users with custom roles – We’re fixing an issue that prevents users from accessing the entire Reporting section of the application if they do not have permission to access the Reporting Overview page (Reporting > Overview). This issue affects only those users who are in custom permission groups that have permission to access other pages in the Reporting section.

40.0.24 (3 Jun 2025)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.0.23 (22 May 2025)

Manual Identification

Fixed

Manually re-identifying tables when tables are partially machine-identified — We've resolved an issue that prevented keyers from re-identifying tables when those tables were partially identified by the machine. When they attempted to do so, their changes to machine's identification weren't saved.

Document Renderer Block

Fixed

The Image Correction and Mobile Processing Classification settings for the flow were enabled, and:
- PNG files were being processed, and the Quality Document Rendering setting for the flow was set to 80, or
- JPEG files were being processed, and the Quality Document Rendering setting for the flow was set to 40.

40.0.22 (7 May 2025)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.0.21 (24 Apr 2025)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.0.20 (10 Apr 2025)

Classification

Updated

Font-rendering consistency in Microsoft documents — We’ve updated support for Microsoft fonts to ensure consistent rendering of Microsoft Office Open XML documents, such as .docx. In v40.0.19, certain .docx files displayed unexpected layout differences when converted to images, which impacted Structured Classification accuracy in downstream processes. This issue was caused by missing Microsoft fonts that were previously available as part of the LibreOffice library but were removed in the library’s later versions. This update restores alignment with the behavior observed in v40.0.08 and improves reliability for classification use cases that are dependent on layout structure.

NOTE: You may notice differences in how .txt files are rendered in v40.0.20 and later. These differences are due to a change in LibreOffice behavior introduced in their March 2025 release, which affects text-to-image conversion when exporting to PDF. To learn more, see LibreOffice’s Release Notes.

User Experience

Fixed

"Updated at" timestamps — We've fixed an issue that caused "Updated at" timestamps (e.g., in the upper-right corner of the Submissions and Flows pages) to be in the user's time zone instead of the time zone specified in the SYSTEM_TIME_ZONE ".env" variable.

40.0.19 (31 Mar 2025)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.0.18 (6 Mar 2025)

Training Data Management

Fixed

Previewing pages in Training Data Management (TDM) for Classification models — We've fixed an issue that caused delays in loading preview images of pages in TDM for Classification models.

40.0.17 (28 Feb 2025)

Training Data Management

Fixed

Processing submissions containing large Semi-structured documents – We've fixed an issue in the migration of training data that caused deadlocks, high database-GPU usage, and halting when processing submissions that contained Semi-structured documents that had more than 500 pages.

40.0.16 (19 Feb 2025)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.0.15 (3 Feb 2025)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.0.14 (16 Jan 2025)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.0.13 (19 Dec 2024)

Submissions

Fixed

40.0.12 (16 Dec 2024)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.0.11 (29 Nov 2024)

Flow Blocks

Updated

40.0.10 (21 Nov 2024)

Large Language Model (LLM) Blocks

Fixed

Execution of LLM Install Flow – We've fixed an issue that caused the execution of the Hyperscience-provided LLM Install Flow to fail with the error ModuleNotFoundError: No module named 'authlib'.

40.0.9 (6 Nov 2024)

Internationalization

Updated

Support for German translations — You can now provide interface-text translations for the de-DE locale (German, Germany).

To learn how to upload translation files, see Providing a Translated User Interface.

40.0.8 (25 Oct 2024)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.0.7 (15 Oct 2024)

Layouts and Models

Fixed

Messaging about latest layout and model versions not being live – We've resolved a version-comparison issue that caused incorrect "Latest version is not live" warning messages to appear on the details pages for layouts and models.

40.0.6 (7 Oct 2024)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.0.5 (3 Oct 2024)

Connections

Updated

Specifying AWS regions for S3 Notifier connections – We've added an AWS Region setting to S3 Notifier Output Blocks, which allows you to specify the region of the S3 bucket that notifications are being sent to (e.g., us-west-2). Specifying a region helps to prevent location-constraint errors from occurring when attempting to connect to the notifications' S3 bucket.

Infrastructure

Updated

Refactoring of docker-compose files – We've refactored our docker-compose files by merging frontend and backend files into docker-compose.forms.<type>.yml. This update allows the starting of containers to be controlled from within Docker, and it includes the following changes:

hsfe_gunicorn_1 has been renamed to hsbe_gunicorn_1.
hsfe_nginx_1 has been renamed to hsbe_nginx_1.
The command for the script that reassociates field dictionaries after uploading a release has been updated to use the new names. For more information, see Adding a New Release.

Upgrades

Fixed

Upgrading to v40 – We've fixed a data-migration issue that caused upgrades to v40 to fail with unsupported operand type(s) errors. This issue affected instances that had run v35 within the past year, regardless of what version was being run immediately before the upgrade.

40.0.4 (26 Sept 2024)

Updates

This version includes a number of updates that optimize our internal testing and deployment processes.

40.0.3 (19 Sept 2024)

Versions 40.0.0-40.0.2 were not released and are not supported.

User Experience

New

Internationalization of the user interface – If your organization needs to provide a translated user interface (UI) for your keyers, a System Admin can upload translations of the interface's text in the languages of your choice. After a CSV containing the translations has been uploaded, a drop-down menu appears in the upper-right corner of the application, allowing any user to change the language of the UI text. By translating the application's text into your users' native languages, you can increase keyer productivity and expand the use of Hyperscience at your organization.

You can provide translations for up to three additional locales. Translations in languages read from right to left are not supported. In v40, only interfaces used by keyers (e.g., Supervision) are able to be translated with this feature. We will make more UI text available for translation in future versions of Hyperscience.

Note that Hyperscience does not offer translations of UI text, nor does the platform include a translation-management interface.

To learn how to provide translations of UI text, see Providing a Translated User Interface.

Enforcement of license packages – To ensure that you can use only those features we can support in your license package, any features not included in your license package are no longer available for use. This change will take effect upon entering a license key created under Hyperscience's Hypercell pricing plan. Instances with license keys created under non-Hypercell pricing are not affected and include all available Hyperscience features.

More details about the enforcement of license packages can be found in License Packages and Feature Availability.

For more information about our license packages and what is included in each one, contact your Hyperscience representative.

Layouts

New

Document Drift Management – Document Drift Management (also known as Layout Triage) helps you manage and classify unmatched pages more effectively. It automatically groups similar pages based on their visual patterns, allowing you to easily organize and create new layouts for documents that don’t match existing ones.

This feature replaces the previous "Find Potential Layouts" and "No Layout Found" processes, offering a more streamlined and effective way to handle unmatched pages.

Document Drift Management simplifies handling unmatched pages by letting you manually adjust groups and create accurate layouts directly from these pages. It works best with Structured documents, where the visual similarity between the pages is consistent.

To learn more, see Document Drift Management (Layout Triage).

Training

New

Trainer Resiliency – With Trainer Resiliency, interrupted training tasks can be resumed from their last saved checkpoint. This feature helps to reduce the potential of lost time when training a model in the event of network or other infrastructure failures.

You can save checkpoint data for the following types of models:

Field Identification
Table Identification
Long-form Extraction

By default, checkpoint data is saved every 30 minutes, but you can choose to save data at a different frequency. This data is saved in the /var/www/forms/forms/media directory. Note that Trainer Resiliency requires 6GB of additional server capacity in order to save checkpoint data.

This feature Is not enabled by default. You can enable it by adding variables to the ".env" file, or you can ask your Hyperscience representative for assistance.

To learn more about Trainer Resiliency, see Trainer Resiliency.

Models

New

Extended Model Compatibility – In v39 and earlier, models could only be used with flows that were created in the same version of Hyperscience as the models were trained on. While the application could run flows created in the previous two versions of Hyperscience, any retraining of those flows' models required that the flows be updated, as well. This limited compatibility caused delays in the deployment of newer, more performant models, as well as the overall time-to-value for Hyperscience in many use cases.

In v40 and later, models do not need to be trained in the same version that their flows were created in in order for them to be used in submission processing. For example, in v40 of the application, if you have a model trained in v38 and a flow created in v38, but you need to retrain the model, you can retrain the model in v40 and continue to use it with the v38 flow.

For more details on compatibility among application, flow, and model versions, as well as the impact of this update on the v40 upgrade process, see Compatibility Across Application, Flow, and Model Versions.

Updated

Training Data Management enhancements for Identification Models - We’ve made enhancements in the Training Data Management for Identification models. In v40, you will be able to see all models associated with the currently supported model versions in the Model History card. This enhancement provides a view of the entire model history, enabling you to revert to previously trained models. Another improvement in v40 is that the models that are rejected or undeployed will no longer disappear from the history view. Instead, they remain visible as part of the comprehensive model history, giving you better visibility and control over your model.

Accuracy

New

Field-level accuracy targets in Structured documents – In many use cases, some field transcriptions require higher levels of accuracy than others, like names or Social Security Numbers. With the updates included in v40, you can set accuracy targets that are specific to each of these fields in Structured documents. By allowing you to tailor accuracy requirements at the field level, this feature eliminates the need to create separate flows for critical transcriptions. It also prevents keyers from having to complete Transcription Supervision tasks for lower-value fields due to high accuracy targets at the flow level.

You can set field-level accuracy targets in the field dictionary or in a flow's settings. Any targets set at the flow level for a field override any targets for that field that have been set in the field dictionary. The accuracy achieved for each field you've set a specific target for can be found on the Accuracy page of the application (Reporting > Accuracy).

Note that setting field-level accuracy targets has no effect on the number of QA tasks that the system creates.

To learn how to set field-level accuracy targets, see Transcription Accuracy and Automation.

Flows

New

Document Renderer Block – The Document Renderer Block allows you to download documents as PDFs after they go through Classification—either Machine or Manual. You can configure the block in Document Processing Subflow V40. In the block settings, you can specify the page size (in inches or millimeters) and adjust the image quality. Note that higher quality results in larger file sizes, with a default quality of 50% that balances size and clarity. Once configured, you can find a download URL in each submission’s JSON output. Paste this URL after the core URL of your instance to initiate the PDF download (e.g., example.hyperscience.com/api/<URL>).

For more information, see Flow Blocks.

Updated

Packaging of Python 3.9 runtime – In preparation for Python 3.9's end-of-life in October 2025, we've supported both Python 3.9 and Python 3.11 in the past several versions of Hyperscience. To continue to support both versions of Python in v40, we’ve included the Python 3.9 runtime as a package that is automatically deployed if you are still using flows that depend on Python 3.9. Doing so gives your organization additional time to upgrade any Code Blocks or external packages that use Python 3.9 before support ends for that version of Python.

More information on deploying the Python 3.9 runtime in v40 can be found in Developing Flows.

Flows versioning and UX updates – To provide a better overall user experience, we've made the following enhancements to the Flows user interface and versioning:

All flows now on a single page — The lists of top-level flows and all flows in the instance no longer appear on separate pages. The Top-level Flows page has been removed, and all flows, both top-level and subflows, can be found on the Flows page of the application.
Behavior of the back button (<) on flow-related pages — We've updated the behavior of the Back (<) button on certain flow-related pages (e.g., pages in Flow Studio, flow-run pages) to create a more consistent user experience. We've also removed the button from some of these pages.
Versioning of On-Error and Notification subflows — Because their functionality does not change from version to version, we are no longer versioning the On-Error and Notification subflows that are included in each version of Hyperscience. These subflows are still included in v40, but they do not include any version designation in their names.

Default values for subflows and blocks – Default values declared in the manifest or input elements in the JSON files of flows are now taken into account when running those flows as subflows. By making the flow-development process more intuitive, this update enhances the development experience and makes the behavior of blocks and subflows more predictable.

Note that values entered specifically on the subflow-calling block take precedence over any default values in the subflow’s definition. After that, values in the input element of the flow's JSON are used before those specified in that file's manifest element.

This update may change the current behavior of any flows invoked as subflows if they don’t have explicit default values in their manifest or input elements.

Classification

Updated

Layout ID Matching for Structured Documents – Layout identifiers help the system improve classification accuracy by using distinguishing features from documents. With the flow-level settings introduced in v40, you can specify how layout identifiers are used during the Classification process:

Classify using Layout ID — When this setting is enabled, the system checks for a matching layout identifier in the document. If the identifier matches the expected one in the layout variation, the document is classified accordingly. If it doesn't match, the document is either sent for further review or to Document Drift Management, preventing misclassification.
Bypass Classification by Layout ID — This setting bypasses validation by layout identifier if the matched layout variation doesn’t have an identifier specified.

Pre-computing data for the classification of Structured documents – To increase the overall efficiency of the classification process, we've updated the system to pre-compute data for the classification of Structured documents in each release. This pre-computation occurs when the first submission enters a flow that is using a release that contains Structured documents.

Unstructured Extraction

Updated

Multiple Occurrences for Unstructured Extraction – We’ve enhanced the Unstructured Extraction (UNLP) model to support the extraction of multiple occurrences of fields. With this update, you can now capture and extract various instances of the same field within a document with long segments of text. You can also select the engine type from the UI.

For more information, see Training a New Field Identification Model.

Reporting

New

Infrastructure metrics in the Usage Report – In v40, we’ve added key infrastructure information to the Usage Report:

Database version
Operating system details
Kubernetes version
Helm chart version

To learn more, see Usage Report.

Authentication

Updated

Support for SAML_STAFF_PERMISSION_ROLES – We've removed support for the SAML_STAFF_PERMISSION_ROLES ".env' file variable. The variable’s functionality was never enabled in the application, so there is no action required on your part as a result.

For more information on configuring SAML, see SAML.

Additional OIDC configuration options – We've added the following ".env" variables for OIDC configurations:

HS_OIDC_VERIFY_KID — Indicates whether the OIDC client verifies the kid field in JWT tokens. Defaults to true.
HS_OIDC_VERIFY_SSL — Indicates whether the OIDC client verifies the SSL certificates of the OpenID provider's responses. This value can be a boolean value or a path to a certificate bundle. Defaults to true.

For more information about these and other OIDC configuration options, see OpenID Connect (OIDC).

Fixed

Resetting passwords for locked-out users in SaaS self-service user management – We've fixed an issue that prevented passwords from being reset for locked-out users. The issue affected SaaS deployments with self-service user management enabled.

Infrastructure

New

Logging NGINX-related issues – To enable you to debug NGINX-related issues more effectively, we've introduced the NGINX_ERROR_LOG_LEVEL ".env" file variable. You can use this variable to specify the minimum severity level an issue must have in order for it to be logged by the syslog utility. The default value is info.

More details on NGINX_ERROR_LOG_LEVEL and its possible values can be found in Security.

Updated

Support for Ubuntu 16 and RHEL 7 – From v40 onward, we are not supporting the use of Ubuntu 16 or RHEL 7 with Hyperscience.

To learn more about our supported operating systems, see Infrastructure Requirements.

Support for PostgreSQL 12 – Beginning in v40, the Hyperscience application will no longer support PostgreSQL 12.For more information about our supported databases, see Infrastructure Requirements.