Machine Learning Best Practices

Machine Learning Best Practices

A brief overview of understanding machine learning, its challenges, and best practices for implementing a program in your business.  We also discuss how we use machine learning at Extract to deliver results better than you thought possible.  Read on to learn more.

What if the OCR misses a field or value?

What if the OCR misses a field or value?

You have a software package that relies on optical character recognition (OCR) to classify, pick up words, numbers or phrases from a document.  As long as the quality of the document is mostly clean, everything works well.  However, what happens when the document arrives and the quality is simply, not good?  Does the software give up and run away with its tail between its legs? Are there any options to classify or capture anything on these documents?  

PCI Compliance

PCI Compliance

As it becomes more and more clear that data breaches are a fact of modern day life, it is also clear we need to think about protecting consumer information in more diverse ways. 

One way to protect data is to simply redact it.  The problem is that the amount and type of information that needs to be redacted is diverse and often changes from state to state. 

Wisconsin Becomes Technology Hub

Wisconsin Becomes Technology Hub

Taiwanese electronics manufacturer Foxconn recently announced plans to open a facility. This facility would build LCD screens for computers, televisions, and automobiles. The plans are to build it between Milwaukee and Chicago. The Company has pledged to invest $10 billion dollars to build the factory and have it open by 2020.

Defining Your Implementation Phase

Defining Your Implementation Phase

The Implementation Phase is the third step in proceeding with an automated redaction vendor. If you’re in this phase, you’re close to the go-live for your project.

The first thing that you must do in this phase is collect sample documents. This involves gathering random and targeted sample documents in scope for the redaction project. Once your files and samples are gathered, you must send them to your software vendor.

Defining Your Project Plan For Automated Redaction

Defining Your Project Plan For Automated Redaction

Are you planning on completing an automated redaction project for your company?

Have you already defined your project scope?

Deciding an automated redaction vendor can be overwhelming. You’ll have to do some vendor scoping to figure out who is best for your company.

Bots Versus Humans

Bots Versus Humans

First came Alien, then Predator, followed by Alien versus Predator. The Alien series chronicles the battle between humans and a mysterious lifeform whose lifecycle has just begun. Predator was based on an extraterrestrial hunter stalking commandos in Central America and the citizens of Los Angeles.  When the two series merged, it featured an epic battle between the two legends.

3 ways to redact a document

3 ways to redact a document

Ensure all sensitive information in the document has been removed. 

The exclusion of private information from sensitive documents is something that individuals in different fields and job roles need to worry about. From the single business proprietor who needs to redact personal or business financial information to government agencies protecting policy data, redacting sensitive information properly is key. For those who need to redact a document the right way, some basic steps based on expert advice can come in handy.

Defining Your Automated Redaction Project Scope

Defining Your Automated Redaction Project Scope

Defining your organization's project scope for an automated redaction project

Not sure where to start when it comes to implementing automated redaction for your business?

Build a solid foundation for a successful automated redaction project by following the best practice tips for defining your project scope.

5 Examples of how document type and quality can affect accuracy

Document formatting and quality can have a dramatic effect on OCR and rules accuracy when data capture is concerned. The 5 examples shown below are meant to educate a potential or current data capture user on what can cause accuracy to rise or fall.  Although sometimes it’s hard or impossible to correct the issues that cause accuracy to fall, there are generally steps that can be taken to help prevent them.

Top 3 Optical Character Recognition (OCR) Misconceptions

Top 3 Optical Character Recognition (OCR) Misconceptions

Optical Character Recognition can be an extremely powerful tool, but there are many things that an OCR engine can’t actually handle, that often times get overlooked. Below I have listed out the top 3 most common misconceptions of an OCR engine.