Solved

How accurate is Code Green's (a DLP tool) OCR

Posted on 2016-07-23
1
64 Views
Last Modified: 2016-07-24
Googling around, there are tools that measure accuracy of OCR (converting image to characters).

Has anyone measured Code Green's OCR using any of these tools or has some indications of
Code Green's OCR accuracy?
0
Comment
Question by:sunhux
1 Comment
 
LVL 63

Accepted Solution

by:
btan earned 500 total points
ID: 41725848
not so sure about code green OCR accuracy but I understand Forcepoint has this OCR support but limited as its OCR engine does not support handwriting based image; nor are images containing text that is skewed more than 10 degrees. To share further, how FP does its check is
All other PDF documents, including hybrid files containing both searchable text and scanned text, are sent to the default Data Security extractor, not the OCR server. Should the system fail to extract text from a PDF, it is forwarded to the OCR server.
https://www.websense.com/content/support/library/data/v78/help/ocr_main.aspx

another candidate is Core DLP from GTB Tech is strong in OCR engine
Core Detection & Analysis Algorithms

Methods for describing sensitive content are abundant.  They can be divided into two categories: precise methods and imprecise methods.

Precise methods are, by definition, those that involve Content Registration and trigger almost zero false positive incidents.

All other methods are imprecise.  They include:  keywords, lexicons, regular expressions, extended regular expressions, meta data tags, Bayesian analysis, statistical analysis such as Machine Learning, etc.

Combined with the proprietary algorithms, GTB's AccuMatchTM detection algorithms have virtually zero false positives and a very high resilience to data modifications including:

Excerpting, inserting, file type conversion, formatting,    ASCII ->UNICODE conversion,     UNIX–Windows conversion,   partial data match, and so on.
https://gttb.com/data-loss-prevention/core-dlp-technology/

I will suggest you ask Code Green to share and compare against the above two DLP engine - if they do not even know these two provider I do see that they may be quite far off in improving their OCR leadership, likewise if they do know, there should be accuracy matrix to share on its limits
0

Featured Post

Enterprise Mobility and BYOD For Dummies

Like “For Dummies” books, you can read this in whatever order you choose and learn about mobility and BYOD; and how to put a competitive mobile infrastructure in place. Developed for SMBs and large enterprises alike, you will find helpful use cases, planning, and implementation.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Ransomware is a malware that is again in the list of security  concerns. Not only for companies, but also for Government security and  even at personal use. IT departments should be aware and have the right  knowledge to how to fight it.
Smart phones, smart watches, Bluetooth-connected devices—the IoT is all around us. In this article, we take a look at the security implications of our highly connected world.
Sending a Secure fax is easy with eFax Corporate (http://www.enterprise.efax.com). First, Just open a new email message.  In the To field, type your recipient's fax number @efaxsend.com. You can even send a secure international fax — just include t…
The Email Laundry PDF encryption service allows companies to send confidential encrypted  emails to anybody. The PDF document can also contain attachments that are embedded in the encrypted PDF. The password is randomly generated by The Email Laundr…

733 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question