Text within a Image format

Security/Privacy related question.  Can text be detected in say a .jpg, . bmp, etc. type file formats?  I know using text within .pdfs can be with OCR.  When I say "detected" I mean with use of a SIEM, DLP or other event driven software?  Not referring to steganography or obfuscation of text in anyway.  Just simple text detection in a jpg or bmp format.  Much thanks.
Who is Participating?
AlanConnect With a Mentor ConsultantCommented:
Absolutely - do a search for:

"Optical Character Recognition" OR OCR

Open in new window

You will find masses of information out there.

Paul SauvéRetiredCommented:
in fact, you can use software like Bullzip PDF printer to print ANY file as PDF then use OCR to extract a text file...

I would note that, if you are printing something to PDF that has the actual text within it, then you don't need OCR at all - the text will be in the PDF.

If you are printing an image to PDF, then you are probably wasting your time, as most OCR software will work with most image formats, not just PDF.

Solution provided.
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

All Courses

From novice to tech pro — start learning today.