Column position differs in text output in multiple page PDF report conversion to text

Dear Expert,

Thanks for sharing this excellent information.  I noticed that when I create text file using the xPDF for a tabular report and if the report is of multiple page then column position of is inconsistent in each page of text output. For example if a column starting at position 10 on the first page, it is at column 11 or 12 or 9 in subsequent position. How do we achieve the text output in all pages in sync? Please advise.
Ankit-PKAsked:
Who is Participating?
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

Joe Winograd, Fellow&MVEDeveloperCommented:
If you haven't viewed my 5-minute EE video Micro Tutorial on Xpdf's PDFtoText utility, I recommend doing that (it seems that you don't need the first part in the 3-part series):
Xpdf - Convert PDF Files to Plain Text Files - Part 3

You'll notice that the video mentions different options. I suggest experimenting with these five output format options:

(1) -layout
(2) -lineprinter
(3) -raw
(4) -table
(5) none of the above, which means it will use the default output format

One of these five may work better for you.

Also, it's possible that the page-break character is causing some, if not all, of the differences, so you should experiment with eliminating it in the output text file, which is achieved with this option:

-nopgbrk

Regards, Joe
0

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
Joe Winograd, Fellow&MVEDeveloperCommented:
Selecting a different output format mode often solves problems like this. Also, the -nopgbrk is sometimes the issue, as is the -fixed option, which specifies the character pitch/width in points (for -layout, -table, -lineprinter modes).
0
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Document Imaging

From novice to tech pro — start learning today.

Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.