Link to home
Start Free TrialLog in
Avatar of AlexKostrub
AlexKostrubFlag for Ukraine

asked on

content analysis of pdf file

Is there a software for content analysis of pdf file (i.e. finding how many of defined phrases are there in such file) availible for download(any type of licence)? If not, I need ideas how to implement such software.
Avatar of sathaihost

start coding and use open source library. here is a nice library for you to analyze your pdf file. 
And this is how to analyze word phrase in PDF with c# using iTextSharp

I hope this help you.
Avatar of AlexKostrub


I can't find in Prase.cs of iTextLibrary how to access an element of text of pdf.
I also still want to know if there are any software able to perform content analysis of pdf.
this link is where you could find the phrase class;

why do you try program like to analyze your content. just a quick google search, havn't tried it before.
You misunderstood me: I found the Prace.cs class but I did not found in it how to access elements of text of pdf. I found some shareware but I wanted expert advice. Also is not eligible for me because it works only with MS Word.
Avatar of AlexKostrub
Flag of Ukraine image

Link to home
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
There is no other acceptable solution