Link to home
Start Free TrialLog in
Avatar of AlexKostrub
AlexKostrubFlag for Ukraine

asked on

content analysis of pdf file

Is there a software for content analysis of pdf file (i.e. finding how many of defined phrases are there in such file) availible for download(any type of licence)? If not, I need ideas how to implement such software.
Avatar of sathaihost
sathaihost

start coding and use open source library. here is a nice library for you to analyze your pdf file.
http://csharp-source.net/open-source/pdf-libraries 
And this is how to analyze word phrase in PDF with c# using iTextSharp

http://www.java2s.com/Open-Source/CSharp/PDF/iTextSharp/iTextSharp/text/Phrase.cs.htm

I hope this help you.
Avatar of AlexKostrub

ASKER

I can't find in Prase.cs of iTextLibrary how to access an element of text of pdf.
I also still want to know if there are any software able to perform content analysis of pdf.
this link is where you could find the phrase class;
http://www.java2s.com/Open-Source/CSharp/PDF/iTextSharp/iTextSharp/text/Phrase.cs.htm

why do you try program like http://www.mywritertools.com/ to analyze your content. just a quick google search, havn't tried it before.
You misunderstood me: I found the Prace.cs class but I did not found in it how to access elements of text of pdf. I found some shareware but I wanted expert advice. Also www.mywritertools.com is not eligible for me because it works only with MS Word.
ASKER CERTIFIED SOLUTION
Avatar of AlexKostrub
AlexKostrub
Flag of Ukraine image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
There is no other acceptable solution