Written by Ana Canteli on May 13, 2022
Last week we discussed the zonal OCR engine included as standard in OpenKM. It intervenes in many tasks that make document management very easy, fast, and affordable for the user in his daily work. From extracting the text of documents, a person can find any document based on any word they remember; to enable the identification of document types, indexing, categorization, and assignment of metadata groups automatically, to name just a few examples.
OpenKM's document management software can be integrated with any third-party application thanks to the web services and the complete API in its documentation. Today we will talk about the possibility of using OpenKM with an advanced optical character recognition application.
Advanced zonal OCR expands the number of cases in which optical character recognition can be successfully implemented in the document processes and business processes that the organization wishes.
OCR provides tangible benefits, primarily when used for specific tasks:
OpenKM offers an advanced optical character recognition solution. This option enables the management of large volumes of documentation. Although they are the same type of document (for example, invoices), they are not limited by the design differences of each one. of them. It must be taken into account that each invoice that an organization receives comes from a different entity, and therefore the critical values of this type of file can be found in various places within the same document. For open-source OCR engines, this represents a limitation that can only be overcome by inserting templates, in which the OCR application is told where the values it has to read are.
Advanced optical character recognition allows the capture of metadata such as date, invoice number, concept, units, price, tax base, VAT, subtotal, total, etc., without the design of each invoice being a limitation. Such an advantage multiplies the possibilities of automated document management for the company.
Here we present a standard use case of invoice management using OpenKM integrated with an advanced OCR solution.
In the organization, there will be an app where the documentation will be generated or recorded; delivery notes, invoices, tickets, purchase orders, orders, etc.
We start with invoices, particularly supplier invoices. The documentation is deposited in OpenKM.
The documentation to be analyzed can come from different sources and other formats. As an email attachment or as a result of a scan, it will be a single file made up of several pages, each of which would be an invoice.
In OpenKM, a documentation entry route and an exit route are programmed. In the simplest scenario, an input file - a pdf, for example - will contain one single sheet, an invoice. Usually, an input document can have several exit points, so we can take a folder per client when preparing the taxonomy. There will be a subfolder as the entry point of the documentation. The invoices are organized by date (year/month/day) in this folder.
This advanced optical character recognition system includes configuration services within OpenKM. The technical team will be in charge of implementing the document input route, the import service, error control (what is the action protocol in this case), and document separation, among other features.
When the input folder hosts documentation, OpenKM will transfer the documentation to advanced OCR - capture, processing, separation - and the documentation goes to the output path, where the documentation will be ready for use.
If the advanced optical character recognition application detects any errors, the user will receive a notification with the wrong or defective file. It can also be hosted in a folder in OpenKM, where the user can manage the incident. All these operations are usually carried out in the background not to affect the system's performance for the user.
This represents just one case of many scenarios in which advanced optical character recognition can improve your company's day-to-day document management. Contact us for more information.
North America: Please call +1 646 206 6071.
Monday - Friday: 08:00 am - 17:00 pm EDT for immediate assistance. Currently, it is Wednesday 23:28 pm in New York, USA.
Europe Spain: Please call +34 605 074 544.
Monday - Friday: 09:00 am - 14:00 pm, 16:00 pm- 19:00 pm CEST for immediate assistance. Currently, it is Thursday 05:28 am in Palma de Mallorca, Spain.