Solved

How to extract unicode string from pdf file using programmatically

Posted on 2008-10-20
3
1,421 Views
Last Modified: 2013-12-25
I have unicode based data file of pdf. i want that data to extract from pdf to database so how can i extract that unicode data using vb6 or vb.net application? is there any sdk or utility to get solution. thanks
0
Comment
Question by:bhaveshgujjar
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
3 Comments
 
LVL 8

Accepted Solution

by:
jorgesv13 earned 500 total points
ID: 22762140
Yes, you can use iTextSharp (http://itextsharp.sourceforge.net) for manipulating PDF files from .NET.
In this URL: http://www.codeproject.com/KB/cs/PDFToText.aspx
you can find a fully-working code for extracting the text from PDF file, which uses iTextSharp.
Altough is in C#, you can compile it as a library and use it on your VB.Net project.
0

Featured Post

Space-Age Communications Transitions to DevOps

ViaSat, a global provider of satellite and wireless communications, securely connects businesses, governments, and organizations to the Internet. Learn how ViaSat’s Network Solutions Engineer, drove the transition from a traditional network support to a DevOps-centric model.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

The Adobe PDF proprietary file format is recognized as secure and formulated. But these PDF files are also prone to corruption and any external threat like virus attacks, improper storage can hit PDF file integrity.This type of damages can make cruc…
Since upgrading to Office 2013 or higher installing the Smart Indenter addin will fail. This article will explain how to install it so it will work regardless of the Office version installed.
In this fourth video of the Xpdf series, we discuss and demonstrate the PDFinfo utility, which retrieves the contents of a PDF's Info Dictionary, as well as some other information, including the page count. We show how to isolate the page count in a…
In this fifth video of the Xpdf series, we discuss and demonstrate the PDFdetach utility, which is able to list and, more importantly, extract attachments that are embedded in PDF files. It does this via a command line interface, making it suitable …

740 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question