Still celebrating National IT Professionals Day with 3 months of free Premium Membership. Use Code ITDAY17

x
?
Solved

How to extract unicode string from pdf file using programmatically

Posted on 2008-10-20
3
Medium Priority
?
1,518 Views
Last Modified: 2013-12-25
I have unicode based data file of pdf. i want that data to extract from pdf to database so how can i extract that unicode data using vb6 or vb.net application? is there any sdk or utility to get solution. thanks
0
Comment
Question by:bhaveshgujjar
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
3 Comments
 
LVL 8

Accepted Solution

by:
jorgesv13 earned 2000 total points
ID: 22762140
Yes, you can use iTextSharp (http://itextsharp.sourceforge.net) for manipulating PDF files from .NET.
In this URL: http://www.codeproject.com/KB/cs/PDFToText.aspx
you can find a fully-working code for extracting the text from PDF file, which uses iTextSharp.
Altough is in C#, you can compile it as a library and use it on your VB.Net project.
0

Featured Post

Get your Conversational Ransomware Defense e‑book

This e-book gives you an insight into the ransomware threat and reviews the fundamentals of top-notch ransomware preparedness and recovery. To help you protect yourself and your organization. The initial infection may be inevitable, so the best protection is to be fully prepared.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

The Adobe PDF proprietary file format is recognized as secure and formulated. But these PDF files are also prone to corruption and any external threat like virus attacks, improper storage can hit PDF file integrity.This type of damages can make cruc…
I was working on a PowerPoint add-in the other day and a client asked me "can you implement a feature which processes a chart when it's pasted into a slide from another deck?". It got me wondering how to hook into built-in ribbon events in Office.
In this video, we show how to convert an image-only PDF file into a PDF Searchable Image file, that is, a file with both the image (typically from scanning) and text, which is created in an automated fashion with Optical Character Recognition (OCR) …
In this fourth video of the Xpdf series, we discuss and demonstrate the PDFinfo utility, which retrieves the contents of a PDF's Info Dictionary, as well as some other information, including the page count. We show how to isolate the page count in a…
Suggested Courses

715 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question