Solved

How accurate is Code Green's (a DLP tool) OCR

Posted on 2016-07-23
1
48 Views
Last Modified: 2016-07-24
Googling around, there are tools that measure accuracy of OCR (converting image to characters).

Has anyone measured Code Green's OCR using any of these tools or has some indications of
Code Green's OCR accuracy?
0
Comment
Question by:sunhux
1 Comment
 
LVL 62

Accepted Solution

by:
btan earned 500 total points
ID: 41725848
not so sure about code green OCR accuracy but I understand Forcepoint has this OCR support but limited as its OCR engine does not support handwriting based image; nor are images containing text that is skewed more than 10 degrees. To share further, how FP does its check is
All other PDF documents, including hybrid files containing both searchable text and scanned text, are sent to the default Data Security extractor, not the OCR server. Should the system fail to extract text from a PDF, it is forwarded to the OCR server.
https://www.websense.com/content/support/library/data/v78/help/ocr_main.aspx

another candidate is Core DLP from GTB Tech is strong in OCR engine
Core Detection & Analysis Algorithms

Methods for describing sensitive content are abundant.  They can be divided into two categories: precise methods and imprecise methods.

Precise methods are, by definition, those that involve Content Registration and trigger almost zero false positive incidents.

All other methods are imprecise.  They include:  keywords, lexicons, regular expressions, extended regular expressions, meta data tags, Bayesian analysis, statistical analysis such as Machine Learning, etc.

Combined with the proprietary algorithms, GTB's AccuMatchTM detection algorithms have virtually zero false positives and a very high resilience to data modifications including:

Excerpting, inserting, file type conversion, formatting,    ASCII ->UNICODE conversion,     UNIX–Windows conversion,   partial data match, and so on.
https://gttb.com/data-loss-prevention/core-dlp-technology/

I will suggest you ask Code Green to share and compare against the above two DLP engine - if they do not even know these two provider I do see that they may be quite far off in improving their OCR leadership, likewise if they do know, there should be accuracy matrix to share on its limits
0

Featured Post

The Eight Noble Truths of Backup and Recovery

How can IT departments tackle the challenges of a Big Data world? This white paper provides a roadmap to success and helps companies ensure that all their data is safe and secure, no matter if it resides on-premise with physical or virtual machines or in the cloud.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

Title # Comments Views Activity
Site-To-site VPN Natting inbound traffic? 9 74
How do I remove / delete my personal information from a website? 9 93
ticket bloat 3 31
PCI Compliance - mixing SAQs 6 32
Ensuring effective and secure communication in the age of healthcare BYOD.
In this increasingly digital world, security hacks are no longer just a threat, but a reality. As we've witnessed with Target's big identity hack 2013, Heartbleed in 2015, and now Cloudbleed, companies and their leaders need to prepare for the unthi…
This video is the first in a two-part series that discusses PaperPort's "Send To Bar" feature . This first video tutorial explains the purpose of the Send To Bar, how to use it, and how to hide unwanted items that are automatically created on it whe…
Nobody understands Phishing better than an anti-spam company. That’s why we are providing Phishing Awareness Training to our customers. According to a report by Verizon, only 3% of targeted users report malicious emails to management. With compan…

832 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question