Solved

How accurate is Code Green's (a DLP tool) OCR

Posted on 2016-07-23
1
77 Views
Last Modified: 2016-07-24
Googling around, there are tools that measure accuracy of OCR (converting image to characters).

Has anyone measured Code Green's OCR using any of these tools or has some indications of
Code Green's OCR accuracy?
0
Comment
Question by:sunhux
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
1 Comment
 
LVL 64

Accepted Solution

by:
btan earned 500 total points
ID: 41725848
not so sure about code green OCR accuracy but I understand Forcepoint has this OCR support but limited as its OCR engine does not support handwriting based image; nor are images containing text that is skewed more than 10 degrees. To share further, how FP does its check is
All other PDF documents, including hybrid files containing both searchable text and scanned text, are sent to the default Data Security extractor, not the OCR server. Should the system fail to extract text from a PDF, it is forwarded to the OCR server.
https://www.websense.com/content/support/library/data/v78/help/ocr_main.aspx

another candidate is Core DLP from GTB Tech is strong in OCR engine
Core Detection & Analysis Algorithms

Methods for describing sensitive content are abundant.  They can be divided into two categories: precise methods and imprecise methods.

Precise methods are, by definition, those that involve Content Registration and trigger almost zero false positive incidents.

All other methods are imprecise.  They include:  keywords, lexicons, regular expressions, extended regular expressions, meta data tags, Bayesian analysis, statistical analysis such as Machine Learning, etc.

Combined with the proprietary algorithms, GTB's AccuMatchTM detection algorithms have virtually zero false positives and a very high resilience to data modifications including:

Excerpting, inserting, file type conversion, formatting,    ASCII ->UNICODE conversion,     UNIX–Windows conversion,   partial data match, and so on.
https://gttb.com/data-loss-prevention/core-dlp-technology/

I will suggest you ask Code Green to share and compare against the above two DLP engine - if they do not even know these two provider I do see that they may be quite far off in improving their OCR leadership, likewise if they do know, there should be accuracy matrix to share on its limits
0

Featured Post

Simplifying Server Workload Migrations

This use case outlines the migration challenges that organizations face and how the Acronis AnyData Engine supports physical-to-physical (P2P), physical-to-virtual (P2V), virtual to physical (V2P), and cross-virtual (V2V) migration scenarios to address these challenges.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Email attacks are the most common methods for initiating ransomware and phishing scams. Attackers want you to open an infected attachment or click a malicious link, and unwittingly download malware to your machine. Here are 7 ways you can stay safe.
Recovering from what the press called "the largest-ever cyber-attack", IT departments worldwide are discussing ways to defend against this in the future. In this process, many people are looking for immediate actions while, instead, they need to tho…
This video is the first in a two-part series that discusses PaperPort's "Send To Bar" feature . This first video tutorial explains the purpose of the Send To Bar, how to use it, and how to hide unwanted items that are automatically created on it whe…
Along with being a a promotional video for my three-day Annielytics Dashboard Seminor, this Micro Tutorial is an intro to Google Analytics API data.

695 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question