Extracting Data

What is the GDPR and How Does it Affect Your Data?

What is the GDPR and How Does it Affect Your Data?

The GDPR is a new rule rolling out in the first half of 2018. This standard will significantly affect businesses that are using EU citizen data. Read on to learn more about how this will affect and protect your data.

What if the OCR misses a field or value?

What if the OCR misses a field or value?

You have a software package that relies on optical character recognition (OCR) to classify, pick up words, numbers or phrases from a document.  As long as the quality of the document is mostly clean, everything works well.  However, what happens when the document arrives and the quality is simply, not good?  Does the software give up and run away with its tail between its legs? Are there any options to classify or capture anything on these documents?  

PCI Compliance

PCI Compliance

As it becomes more and more clear that data breaches are a fact of modern day life, it is also clear we need to think about protecting consumer information in more diverse ways. 

One way to protect data is to simply redact it.  The problem is that the amount and type of information that needs to be redacted is diverse and often changes from state to state. 

Defining Your Implementation Phase

Defining Your Implementation Phase

The Implementation Phase is the third step in proceeding with an automated redaction vendor. If you’re in this phase, you’re close to the go-live for your project.

The first thing that you must do in this phase is collect sample documents. This involves gathering random and targeted sample documents in scope for the redaction project. Once your files and samples are gathered, you must send them to your software vendor.

Defining Your Project Plan For Automated Redaction

Defining Your Project Plan For Automated Redaction

Are you planning on completing an automated redaction project for your company?

Have you already defined your project scope?

Deciding an automated redaction vendor can be overwhelming. You’ll have to do some vendor scoping to figure out who is best for your company.

Bots Versus Humans

Bots Versus Humans

First came Alien, then Predator, followed by Alien versus Predator. The Alien series chronicles the battle between humans and a mysterious lifeform whose lifecycle has just begun. Predator was based on an extraterrestrial hunter stalking commandos in Central America and the citizens of Los Angeles.  When the two series merged, it featured an epic battle between the two legends.

5 Examples of how document type and quality can affect accuracy

Document formatting and quality can have a dramatic effect on OCR and rules accuracy when data capture is concerned. The 5 examples shown below are meant to educate a potential or current data capture user on what can cause accuracy to rise or fall.  Although sometimes it’s hard or impossible to correct the issues that cause accuracy to fall, there are generally steps that can be taken to help prevent them.

NCSC Highlights Redaction Technology Advancements

NCSC Highlights Redaction Technology Advancements

Last month the National Center for State Courts (NCSC) coordinated an Automated Redaction Proof of Concept (PoC) with several vendors. 

The PoC was made possible thanks to funding from the State Justice Institute (SJI).  The purpose was to provide accuracy benchmarks to courts considering implementing automated redaction technology. NCSC is planning to issue results prior to CTC, publish on their website, and present to various target groups and at conferences.

Extract, redact all my documents.

Extract, redact all my documents.

The latest, most popular technological innovations have a common theme among them: Automating your life to make it easier and more efficient. Every single day, new gadgets are being released that are completing tasks we never imagined would be human-free activities.

Swiper, Yes Swiping!

Swiper, Yes Swiping!

Extract’s value extends well beyond just the information that is automatically captured.

There are many additional productivity tools in our UI that increase the speed and accuracy of users. With features like auto-zooming and highlighting all data on the image, there is a lot to talk about, but I’m going to dedicate the remainder of this blog post to my personal favorite: Extract’s swiping tools.

Are you still living in ancient times?

Are you still living in ancient times?

Early 2017, the Government Business Council and Veritas did extensive research and built a survey to figure out if federal organizations are living out “principles of transparency, participation, and collaboration.”

This survey shed some light on some troubling information regarding federal agencies and their data management practices.

Leverage OCR to improve your workflows

Leverage OCR to improve your workflows

It’s easy to mistake Optical Character Recognition (OCR) as a one-trick pony. 

After all, pulling text out of an image to make it usable in other applications is an impressive trick.  Don’t be content to think that’s all OCR can do for you though.  By combining OCR output with other technologies, it’s possible to make substantial improvements to workflows throughout an organization.  Incoming document workflows are the first and most obvious place that OCR can make a major impact.