document classification

What if the OCR misses a field or value?

What if the OCR misses a field or value?

You have a software package that relies on optical character recognition (OCR) to classify, pick up words, numbers or phrases from a document.  As long as the quality of the document is mostly clean, everything works well.  However, what happens when the document arrives and the quality is simply, not good?  Does the software give up and run away with its tail between its legs? Are there any options to classify or capture anything on these documents?  

Defining Your Project Plan For Automated Redaction

Defining Your Project Plan For Automated Redaction

Are you planning on completing an automated redaction project for your company?

Have you already defined your project scope?

Deciding an automated redaction vendor can be overwhelming. You’ll have to do some vendor scoping to figure out who is best for your company.

Defining Your Automated Redaction Project Scope

Defining Your Automated Redaction Project Scope

Defining your organization's project scope for an automated redaction project

Not sure where to start when it comes to implementing automated redaction for your business?

Build a solid foundation for a successful automated redaction project by following the best practice tips for defining your project scope.

5 Examples of how document type and quality can affect accuracy

Document formatting and quality can have a dramatic effect on OCR and rules accuracy when data capture is concerned. The 5 examples shown below are meant to educate a potential or current data capture user on what can cause accuracy to rise or fall.  Although sometimes it’s hard or impossible to correct the issues that cause accuracy to fall, there are generally steps that can be taken to help prevent them.

Top 3 Optical Character Recognition (OCR) Misconceptions

Top 3 Optical Character Recognition (OCR) Misconceptions

Optical Character Recognition can be an extremely powerful tool, but there are many things that an OCR engine can’t actually handle, that often times get overlooked. Below I have listed out the top 3 most common misconceptions of an OCR engine.

Swiper, Yes Swiping!

Swiper, Yes Swiping!

Extract’s value extends well beyond just the information that is automatically captured.

There are many additional productivity tools in our UI that increase the speed and accuracy of users. With features like auto-zooming and highlighting all data on the image, there is a lot to talk about, but I’m going to dedicate the remainder of this blog post to my personal favorite: Extract’s swiping tools.

Is Government Ahead of the Game When it Comes to Technology?

Is Government Ahead of the Game When it Comes to Technology?

Interesting question, isn’t it? Within the past 30 days I have had more interaction with government entities than I have had in the past 30+ years…combined. Within the past 30-days, I miraculously figured out how to accomplish the following: obtain a marriage license, apply for certified copies of my marriage license, begin my name change process, start a LLC, file for a FEIN, and setup payroll. 

Are you still living in ancient times?

Are you still living in ancient times?

Early 2017, the Government Business Council and Veritas did extensive research and built a survey to figure out if federal organizations are living out “principles of transparency, participation, and collaboration.”

This survey shed some light on some troubling information regarding federal agencies and their data management practices.

OCR Software? The Magical Tool

OCR Software? The Magical Tool

The OCR Process is the Equivalent of Magic

Imagine having a mountain of scanned and faxed paper documents in your office… oy vey—what a mess!

Now, imagine waving a magic wand over the mountainous pile of papers that automatically classifies the documents, captures the information and designates any potential confidential or otherwise important information.