Improve company productivity with a Business Account.Sign Up

x
?
Solved

Scanned pages into pdf

Posted on 2006-06-27
3
Medium Priority
?
321 Views
Last Modified: 2010-04-17
hello all does anyone  know how are scanned pages containing text converted into pdf and what is the format in which text is stored in the pdf. Can this text be directly extracted from the pdf or some technique like OCR is required to be used.

Thanks
0
Comment
Question by:jhav1594
1 Comment
 
LVL 1

Accepted Solution

by:
jm021196 earned 2000 total points
ID: 16994505
It reallly depends on what app you are using and the quality of the page.

If the PDF Converting program which is being used to take the image from the scanner can recognise the text as text then its stored as text in the PDF File.

If the converting program cannot recognise it as text then it gets saved in a variety of image formats depending on which one suits it best. There really is no way to tell how its saved in advance.

PDF Files use a combination of vector, raster and text formates to give the best compression and viewability and so converting to PDF is a very difficult thing to undo... especiall if its not possible to tell in advance if its going to be in text or not.

I would suggest that a OCR system is the best way forward.

Thanks
mitch
0

Featured Post

Free Tool: Subnet Calculator

The subnet calculator helps you design networks by taking an IP address and network mask and returning information such as network, broadcast address, and host range.

One of a set of tools we're offering as a way of saying thank you for being a part of the community.

Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

Join & Write a Comment

If you are a mobile app developer and especially develop hybrid mobile apps then these 4 mistakes you must avoid for hybrid app development to be the more genuine app developer.
AngularJS web development a very simple procedure. So, to put it, in short, AngularJS’ stand out features are – Two-way data binding, MVC structure, directives, templates, dependency injections and testing.
Progress
Introduction to Processes

589 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question